platform/upstream/libvpx.git
11 years agoRemoving find_seg_id and using vp9_get_pred_mi_segid instead.
Dmitry Kovalev [Thu, 20 Jun 2013 22:52:47 +0000 (15:52 -0700)]
Removing find_seg_id and using vp9_get_pred_mi_segid instead.

Change-Id: Ia40229903c08f14020e90e94cfdf494aba1be827

11 years agoMerge "Add subtract_block SSE2 version and unit test."
Ronald S. Bultje [Fri, 21 Jun 2013 19:49:50 +0000 (12:49 -0700)]
Merge "Add subtract_block SSE2 version and unit test."

11 years agoMerge "SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance()."
Ronald S. Bultje [Fri, 21 Jun 2013 19:49:43 +0000 (12:49 -0700)]
Merge "SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance()."

11 years agoAdd subtract_block SSE2 version and unit test.
Ronald S. Bultje [Fri, 21 Jun 2013 16:35:37 +0000 (09:35 -0700)]
Add subtract_block SSE2 version and unit test.

3% faster overall (3min35.0 to 3min28.5).

Change-Id: I5ff8a5c2c91586b6632ca5009ad1ea51ce94af5e

11 years agoMerge "Get some speed back for cpuused 1"
Yaowu Xu [Fri, 21 Jun 2013 05:37:01 +0000 (22:37 -0700)]
Merge "Get some speed back for cpuused 1"

11 years agoGet some speed back for cpuused 1
Yaowu Xu [Thu, 20 Jun 2013 22:23:37 +0000 (15:23 -0700)]
Get some speed back for cpuused 1

and remove unused code.

Change-Id: If380440c4450294b5450b7a9eeb94a376846ec01

11 years agoMerge "rename variables to avoid build error in MSVC"
Yaowu Xu [Fri, 21 Jun 2013 02:04:30 +0000 (19:04 -0700)]
Merge "rename variables to avoid build error in MSVC"

11 years agorename variables to avoid build error in MSVC
Yaowu Xu [Thu, 20 Jun 2013 18:48:08 +0000 (11:48 -0700)]
rename variables to avoid build error in MSVC

Change-Id: I7960178c95c54d5c4497e44cfc8c493566294b34

11 years agoMerge "Implement sse2 and ssse3 versions for all sub_pixel_variance sizes."
Yaowu Xu [Fri, 21 Jun 2013 00:42:50 +0000 (17:42 -0700)]
Merge "Implement sse2 and ssse3 versions for all sub_pixel_variance sizes."

11 years agoSSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance().
Ronald S. Bultje [Thu, 20 Jun 2013 22:59:48 +0000 (15:59 -0700)]
SSE2/SSSE3 optimizations and unit test for sub_pixel_avg_variance().

Encoding of bus @ 1500kbps (first 50 frames) goes from 3min57 to
3min35, i.e. approximately a 10.5% speedup. Note that the SIMD versions
which use a bilinear filter (x_offset & 7 || y_offset & 7) aren't
perfectly interleaved, and can probably be improved further in the
future. I've marked this with a few TODOs/FIXMEs in the code.

Change-Id: I5c9e900c0f0d32e431a50fecae213b510b2549f9

11 years agoMerge "clean out libvpx-srcs.txt if built"
Jim Bankoski [Thu, 20 Jun 2013 22:10:16 +0000 (15:10 -0700)]
Merge "clean out libvpx-srcs.txt if built"

11 years agoclean out libvpx-srcs.txt if built
Jim Bankoski [Thu, 20 Jun 2013 22:05:42 +0000 (15:05 -0700)]
clean out libvpx-srcs.txt if built

Change-Id: Idfd69e66e8982275eb00d8007a55efd1a4f86a98

11 years agoMerge "Revert "test_libvpx: disable pthreads in gtest""
James Zern [Thu, 20 Jun 2013 22:02:27 +0000 (15:02 -0700)]
Merge "Revert "test_libvpx: disable pthreads in gtest""

11 years agoFix win64 warning.
Frank Galligan [Thu, 20 Jun 2013 21:05:17 +0000 (14:05 -0700)]
Fix win64 warning.

- size_t vs int.

Change-Id: Ib47ebd932a4b69db9f52a43000bb69d0a96b9134

11 years agoRevert "test_libvpx: disable pthreads in gtest"
James Zern [Thu, 20 Jun 2013 19:49:15 +0000 (12:49 -0700)]
Revert "test_libvpx: disable pthreads in gtest"

This reverts commit 90a9900abb79fabfd44189a959d14ca677c2777a

Seems to break the Mac build:
src/include/gtest/internal/gtest-port.h:1208:: pthread_mutex_lock(&mutex_)failed with error 22
Abort trap: 6

Change-Id: Icbe31161d7c27f1b0a28d33409e7712430bbf0ae

11 years agoMerge "Add unit tests for 4x4 ADST"
Jingning Han [Thu, 20 Jun 2013 17:22:40 +0000 (10:22 -0700)]
Merge "Add unit tests for 4x4 ADST"

11 years agoMerge "Cast value to avoid size_t/int warning on win64"
Johann [Thu, 20 Jun 2013 17:19:39 +0000 (10:19 -0700)]
Merge "Cast value to avoid size_t/int warning on win64"

11 years agoMerge "Renaming 'nmv' to 'mv' for several functions."
Dmitry Kovalev [Thu, 20 Jun 2013 17:17:12 +0000 (10:17 -0700)]
Merge "Renaming 'nmv' to 'mv' for several functions."

11 years agoMerge "Function decomposition inside vp9_decodemv.c file."
Dmitry Kovalev [Thu, 20 Jun 2013 17:17:05 +0000 (10:17 -0700)]
Merge "Function decomposition inside vp9_decodemv.c file."

11 years agoImproving model rd with variance and quant step
Deb Mukherjee [Wed, 19 Jun 2013 23:23:21 +0000 (16:23 -0700)]
Improving model rd with variance and quant step

Improves the rd modeling function and implements them using interpolation
from a table which is a little faster. Also uses sse as input to the
modeling function rather than var - since there is no dc prediction
used and as a result the sse works a little better.

derfraw300: +0.05%
Speedup: ~1%

Change-Id: I151353c6451e0e8fe3ae18ab9842f8f67e5151ff

11 years agoCast value to avoid size_t/int warning on win64
Johann [Thu, 20 Jun 2013 16:52:08 +0000 (09:52 -0700)]
Cast value to avoid size_t/int warning on win64

dboolhuff.c(50) : warning C4267: 'initializing' : conversion from
'size_t' to 'int'

Change-Id: I6b85759efb2fa19f362f406623d8a7583a55c036

11 years agoadds force partitioning greater than or less than block size
Jim Bankoski [Thu, 20 Jun 2013 16:24:04 +0000 (09:24 -0700)]
adds force partitioning greater than or less than block size

adds a new speed feature to force partitioning to be greater than
or less than a certain size

Change-Id: I8c048eeeef93700ae822eccf98f8751a45b2e7d0

11 years agoadds a set partitioning to speed features
Jim Bankoski [Thu, 20 Jun 2013 14:46:51 +0000 (07:46 -0700)]
adds a set partitioning to speed features

this feature lets you set a partitioning size to be used by the entire
frame.

Change-Id: I208a4c8c701375cbb054418266f677768b6f8f06

11 years agopartition by variance using var from last frame
Jim Bankoski [Thu, 20 Jun 2013 14:17:01 +0000 (07:17 -0700)]
partition by variance using var from last frame

This uses variance to split partition. Variance is calculated using
nearest mv,  always from last ref frame.

Change-Id: Idd015b4a9aa3bc82591759eac239680c07496896

11 years agoconvert all speed things to speed features
Jim Bankoski [Wed, 19 Jun 2013 22:53:47 +0000 (15:53 -0700)]
convert all speed things to speed features

Change-Id: Ie24489a4d39f3e53e816eeebf75a1c9c7d94515a

11 years agonew partition via variance
Jim Bankoski [Wed, 19 Jun 2013 21:26:49 +0000 (14:26 -0700)]
new partition via variance

Change-Id: Ideee45cad8b38087c509cd404484728e85d0c427

11 years agofix to set up new speed feature
Jim Bankoski [Wed, 19 Jun 2013 19:16:45 +0000 (12:16 -0700)]
fix to set up new speed feature

This uses the speed feature functionality for code.

Change-Id: I9cd16c0c5f98520ae27ebba81aa2c178546587f8

11 years agodon't copy partitions for key frames or altrefs
Jim Bankoski [Wed, 19 Jun 2013 18:05:34 +0000 (11:05 -0700)]
don't copy partitions for key frames or altrefs

force us to go through slow partitioning for keyframes, altref and
overlays.

Change-Id: I1a286361bf74083e71973575a7296be46eb98742

11 years agoImplement sse2 and ssse3 versions for all sub_pixel_variance sizes.
Ronald S. Bultje [Thu, 20 Jun 2013 16:34:25 +0000 (09:34 -0700)]
Implement sse2 and ssse3 versions for all sub_pixel_variance sizes.

Overall speedup around 5% (bus @ 1500kbps first 50 frames 4min10 ->
3min58). Specific changes to timings for each function compared to
original assembly-optimized versions (or just new version timings if
no previous assembly-optimized version was available):

sse2   4x4:    99 ->   82 cycles
sse2   4x8:           128 cycles
sse2   8x4:           121 cycles
sse2   8x8:   149 ->  129 cycles
sse2   8x16:  235 ->  245 cycles (?)
sse2  16x8:   269 ->  203 cycles
sse2  16x16:  441 ->  349 cycles
sse2  16x32:          641 cycles
sse2  32x16:          643 cycles
sse2  32x32: 1733 -> 1154 cycles
sse2  32x64:         2247 cycles
sse2  64x32:         2323 cycles
sse2  64x64: 6984 -> 4442 cycles

ssse3  4x4:           100 cycles (?)
ssse3  4x8:           103 cycles
ssse3  8x4:            71 cycles
ssse3  8x8:           147 cycles
ssse3  8x16:          158 cycles
ssse3 16x8:   188 ->  162 cycles
ssse3 16x16:  316 ->  273 cycles
ssse3 16x32:          535 cycles
ssse3 32x16:          564 cycles
ssse3 32x32:          973 cycles
ssse3 32x64:         1930 cycles
ssse3 64x32:         1922 cycles
ssse3 64x64:         3760 cycles

Change-Id: I81ff6fe51daf35a40d19785167004664d7e0c59d

11 years agodisable speed > 1 speed corrections in firstpass
Jim Bankoski [Thu, 20 Jun 2013 16:28:11 +0000 (09:28 -0700)]
disable speed > 1 speed corrections in firstpass

need to rework these

Change-Id: I17dc2c88d2faadd2f8fb117c52c25f04ea2e9856

11 years agonew debug modes code
Jim Bankoski [Wed, 19 Jun 2013 23:03:27 +0000 (16:03 -0700)]
new debug modes code

The new print out includes skips and has prefixed sections so you can
grep to find things like transforms chosen on each frame.

Change-Id: I195043424647d9514cfc3ff6720a5b20d010fa1b

11 years agoMerge "copy partitioning from last fame"
Jim Bankoski [Thu, 20 Jun 2013 16:32:43 +0000 (09:32 -0700)]
Merge "copy partitioning from last fame"

11 years agocopy partitioning from last fame
Jim Bankoski [Thu, 20 Jun 2013 16:32:19 +0000 (09:32 -0700)]
copy partitioning from last fame

Change-Id: I26e80ede80cb4389378a95afa95d229092a9859a

11 years agoAdd unit tests for 4x4 ADST
Jingning Han [Tue, 18 Jun 2013 17:46:33 +0000 (10:46 -0700)]
Add unit tests for 4x4 ADST

Enable sign bias check and round-trip error unit tests for 4x4 hybrid
transform modules.

Change-Id: Icd3d839f098d4b92b00ff76eac146765b039d0d3

11 years agoMerge "test_libvpx: disable pthreads in gtest"
John Koleszar [Thu, 20 Jun 2013 16:05:58 +0000 (09:05 -0700)]
Merge "test_libvpx: disable pthreads in gtest"

11 years agoRemoved a number of unnecessary check on ref_frame
Yaowu Xu [Thu, 20 Jun 2013 00:34:49 +0000 (17:34 -0700)]
Removed a number of unnecessary check on ref_frame

Since intra block decoding is handled by decode_sb_intra() separately.

Change-Id: I42d757884714084c92fc23ec5d35d4dc946f4b15

11 years agoFunction decomposition inside vp9_decodemv.c file.
Dmitry Kovalev [Wed, 19 Jun 2013 20:09:34 +0000 (13:09 -0700)]
Function decomposition inside vp9_decodemv.c file.

Change-Id: Iab96e6a50aec543c63e15cd134f9d5f01ca7ceff

11 years agotest_libvpx: disable pthreads in gtest
James Zern [Wed, 19 Jun 2013 02:19:20 +0000 (19:19 -0700)]
test_libvpx: disable pthreads in gtest

currently threading is internal to libvpx so thread safety is unneeded
in libgtest -- visual studio builds already operate in this way as they
do not have pthread.h available by default.

this removes an unconditional link to libpthread using $(extralibs)
should libvpx require it.

Change-Id: Ieae1d693406653a54b54fba818c598836797d33b

11 years agoMerge "Add two-pass quantization"
Yunqing Wang [Wed, 19 Jun 2013 18:35:40 +0000 (11:35 -0700)]
Merge "Add two-pass quantization"

11 years agoAdd two-pass quantization
Yunqing Wang [Wed, 19 Jun 2013 00:09:50 +0000 (17:09 -0700)]
Add two-pass quantization

Optimized the quantization function by making it a two-pass
process. The first pass does a quick checking of the transform
coefficients against the base ZBIN, and only keep the good
enough set of coefficients for quantization. A skipping
check is added. If all coefficients are within the base ZBIN, no
quantization is needed. The second pass is the actual quantization
pass, which only processes the coefficient subset determined
in first pass. This reduces the computation. Furthermore, an
alternitive method is used for large transform size, which often
has sparse nonzero quantized coefficients.

Overall, the encoder speedup is about 4%. The quantization function
itself gets 20% faster.

Change-Id: I3a9dd0da6db030260b6d9c314a9fa48ecae89f22

11 years agoRemove unnecessary copying of probs.
Yaowu Xu [Wed, 19 Jun 2013 01:18:25 +0000 (18:18 -0700)]
Remove unnecessary copying of probs.

Change-Id: Ic924f07c6ab0c929c6cdf11880d3c625806e272c

11 years agoRenaming 'nmv' to 'mv' for several functions.
Dmitry Kovalev [Wed, 19 Jun 2013 01:28:10 +0000 (18:28 -0700)]
Renaming 'nmv' to 'mv' for several functions.

Change-Id: I183a38997a9d01e4a1b869e92509f6915216fa09

11 years agoMerge "tests: clear system state after non-API calls"
John Koleszar [Tue, 18 Jun 2013 23:40:15 +0000 (16:40 -0700)]
Merge "tests: clear system state after non-API calls"

11 years agoMerge "Make fdct32 computation flow within 16bit range"
Jingning Han [Tue, 18 Jun 2013 18:40:14 +0000 (11:40 -0700)]
Merge "Make fdct32 computation flow within 16bit range"

11 years agotests: clear system state after non-API calls
James Zern [Tue, 18 Jun 2013 05:58:40 +0000 (22:58 -0700)]
tests: clear system state after non-API calls

add ClearSystemState() to reset MMX registers avoiding corrupting
subsequent tests.

Change-Id: I668deb09aa7aa467709776e5819f936910698bc0

11 years agoMerge "Code cleanup inside the decoder code."
Dmitry Kovalev [Tue, 18 Jun 2013 17:16:46 +0000 (10:16 -0700)]
Merge "Code cleanup inside the decoder code."

11 years agoMerge "Removing vp9_invtrans.{c, h} files."
Dmitry Kovalev [Tue, 18 Jun 2013 17:16:25 +0000 (10:16 -0700)]
Merge "Removing vp9_invtrans.{c, h} files."

11 years agoMake fdct32 computation flow within 16bit range
Jingning Han [Fri, 14 Jun 2013 18:28:56 +0000 (11:28 -0700)]
Make fdct32 computation flow within 16bit range

This commit makes use of dual fdct32x32 versions for rate-distortion
optimization loop and encoding process, respectively. The one for
rd loop requires only 16 bits precision for intermediate steps.
The original fdct32x32 that allows higher intermediate precision (18
bits) was retained for the encoding process only.

This allows speed-up for fdct32x32 in the rd loop. No performance
loss observed.

Change-Id: I3237770e39a8f87ed17ae5513c87228533397cc3

11 years agoMerge "Move subpixel variance function from common/ to encoder/."
Ronald S. Bultje [Tue, 18 Jun 2013 16:07:21 +0000 (09:07 -0700)]
Merge "Move subpixel variance function from common/ to encoder/."

11 years agoMerge "Use assembly-optimized variance functions in sub_pixel_{avg}_var()."
Ronald S. Bultje [Tue, 18 Jun 2013 16:07:11 +0000 (09:07 -0700)]
Merge "Use assembly-optimized variance functions in sub_pixel_{avg}_var()."

11 years agoMerge "vpx_ports/x86.h: de-dup #elif block"
John Koleszar [Tue, 18 Jun 2013 15:26:54 +0000 (08:26 -0700)]
Merge "vpx_ports/x86.h: de-dup #elif block"

11 years agoconvolve_test: align filter arrays
James Zern [Tue, 18 Jun 2013 06:11:01 +0000 (23:11 -0700)]
convolve_test: align filter arrays

fixes issue #583

Change-Id: I4b855a5b5b168c8961410cef6ab5e6d86f14d301

11 years agovpx_ports/x86.h: de-dup #elif block
James Zern [Tue, 18 Jun 2013 04:58:00 +0000 (21:58 -0700)]
vpx_ports/x86.h: de-dup #elif block

Change-Id: I052647e13dd24354888c890f6b4a987d989552ae

11 years agoCode cleanup inside the decoder code.
Dmitry Kovalev [Fri, 14 Jun 2013 22:11:18 +0000 (15:11 -0700)]
Code cleanup inside the decoder code.

Change-Id: I927c7223996cdeb44f46e0e6c2e2054d458c300b

11 years agoMove subpixel variance function from common/ to encoder/.
Ronald S. Bultje [Mon, 17 Jun 2013 23:54:09 +0000 (16:54 -0700)]
Move subpixel variance function from common/ to encoder/.

This seems to only be used in the encoder. Also remove an empty wrapper
file that contained forward declarations for this function, but didn't
actually define any actual functions.

Change-Id: Ifc561eef7ebe374a7d03698055e51e105f6d614b

11 years agoRemoving vp9_invtrans.{c, h} files.
Dmitry Kovalev [Mon, 17 Jun 2013 23:09:03 +0000 (16:09 -0700)]
Removing vp9_invtrans.{c, h} files.

Moving single function from vp9_invtrans.c to vp9_encodemb.c.

Change-Id: I26bf6bb90de342a3036c0dbfba78a7dd75a61fe7

11 years agoUse assembly-optimized variance functions in sub_pixel_{avg}_var().
Ronald S. Bultje [Mon, 17 Jun 2013 21:57:13 +0000 (14:57 -0700)]
Use assembly-optimized variance functions in sub_pixel_{avg}_var().

2.5% faster when encoding first 50 frames of bus @ 1500kbps.

Change-Id: I5a64703996cf7fd39b07e32c72311c4b125ec6d4

11 years agoMerge "Fixing compilation error on Mac OS."
Dmitry Kovalev [Mon, 17 Jun 2013 21:40:29 +0000 (14:40 -0700)]
Merge "Fixing compilation error on Mac OS."

11 years agoMerge "Fix typo ('weight' instead of 'width')."
Ronald S. Bultje [Mon, 17 Jun 2013 21:30:57 +0000 (14:30 -0700)]
Merge "Fix typo ('weight' instead of 'width')."

11 years agoFix typo ('weight' instead of 'width').
Ronald S. Bultje [Mon, 17 Jun 2013 20:56:11 +0000 (13:56 -0700)]
Fix typo ('weight' instead of 'width').

Change-Id: I5d3944051d091b4bf3eb13e2a30132d34203ef74

11 years agoFixing compilation error on Mac OS.
Dmitry Kovalev [Mon, 17 Jun 2013 20:50:22 +0000 (13:50 -0700)]
Fixing compilation error on Mac OS.

The error happened because of vp8_decrypt_cb typedef redefinition in both
treereader.h and vp8dx.h. Removing typedef from vp8dx.h in favor of raw
function pointer declaration.

Change-Id: I0266eb341ce433d40caf0abf8748694d505ee786

11 years agoMerge "Removed hardcoded global->limit"
John Koleszar [Mon, 17 Jun 2013 19:33:53 +0000 (12:33 -0700)]
Merge "Removed hardcoded global->limit"

11 years agoRemoved hardcoded global->limit
Scott LaVarnway [Mon, 17 Jun 2013 19:28:45 +0000 (15:28 -0400)]
Removed hardcoded global->limit

Looks like test code.

Change-Id: I5deae2bf14ea6fdcbb9b9d993966c9abef95eb2e

11 years agoChange the encryption feature to use a callback for decryption.
Jeff Petkau [Thu, 13 Jun 2013 19:16:58 +0000 (12:16 -0700)]
Change the encryption feature to use a callback for decryption.

This allows code calling the library can choose an arbitrary
encryption algorithm.

Decoder control parameter VP8_SET_DECRYPT_KEY is renamed to
VP8D_SET_DECRYPTOR, and now takes an small config struct instead
of just a byte array.

Change-Id: I0462b3388d8d45057e4f79a6b6777fe713dc546e

11 years agoMerge "Add vp9 test vectors unit test"
John Koleszar [Mon, 17 Jun 2013 17:32:08 +0000 (10:32 -0700)]
Merge "Add vp9 test vectors unit test"

11 years agoMerge "Remove unused need_to_clamp_mvs"
John Koleszar [Mon, 17 Jun 2013 17:31:58 +0000 (10:31 -0700)]
Merge "Remove unused need_to_clamp_mvs"

11 years agoMerge "vpxdec: add --loops option"
John Koleszar [Mon, 17 Jun 2013 17:31:54 +0000 (10:31 -0700)]
Merge "vpxdec: add --loops option"

11 years agovs9 build: avoid deprecated /Wp64 flag
James Zern [Sat, 6 Apr 2013 02:25:48 +0000 (19:25 -0700)]
vs9 build: avoid deprecated /Wp64 flag

quiets a warning on every file; the preference is to use a 64-bit
compiler, which is readily available at and above this version.

Change-Id: I56e7eb569022e7148249d93fe386ad5ea0eee3fc

11 years agoMerge "Fix type mismatch in array definition"
John Koleszar [Sat, 15 Jun 2013 00:07:22 +0000 (17:07 -0700)]
Merge "Fix type mismatch in array definition"

11 years agoMerge "Remove constant vp9_coef_update_prob table"
John Koleszar [Sat, 15 Jun 2013 00:07:19 +0000 (17:07 -0700)]
Merge "Remove constant vp9_coef_update_prob table"

11 years agoFix type mismatch in array definition
John Koleszar [Fri, 14 Jun 2013 23:38:42 +0000 (16:38 -0700)]
Fix type mismatch in array definition

vp9_default_inter_mode_probs was being accessed with a different type
than it was defined with. Ensure that its declaration is included
prior to its definition.

Change-Id: I2f963f513ab2f4e339f8a3c17e3d0f03749eba16

11 years agoRemove constant vp9_coef_update_prob table
John Koleszar [Fri, 14 Jun 2013 22:12:31 +0000 (15:12 -0700)]
Remove constant vp9_coef_update_prob table

All elements of this table are equal to 252, so replace it with a
single constant VP9_COEF_UPDATE_PROB.

Change-Id: I1e2d1d284326ce6df9899a740c2fc344b3ec81c9

11 years agoMerge "Enable sse2 version of sad8x4/4x8"
Jingning Han [Fri, 14 Jun 2013 20:15:49 +0000 (13:15 -0700)]
Merge "Enable sse2 version of sad8x4/4x8"

11 years agoEnable sse2 version of sad8x4/4x8
Jingning Han [Thu, 13 Jun 2013 18:07:12 +0000 (11:07 -0700)]
Enable sse2 version of sad8x4/4x8

The encoding time for bus at CIF goes from 661s to 625s. This commit
also enabled unit test of sad8x4/4x8 in sad_test.cc.

Change-Id: If3d10ebb56bda584bdb69bcf056599d580b12cb1

11 years agoMerge "Avoid inverse transform/recon without residual"
John Koleszar [Fri, 14 Jun 2013 15:49:58 +0000 (08:49 -0700)]
Merge "Avoid inverse transform/recon without residual"

11 years agoSome cleanups in rd motion search
Deb Mukherjee [Thu, 13 Jun 2013 22:10:42 +0000 (15:10 -0700)]
Some cleanups in rd motion search

No bitstream or output change - only cosmetics.

Change-Id: Ic8c1d7ad010a87dcf27d12a38cd7dd5adba683a7

11 years agoAvoid inverse transform/recon without residual
John Koleszar [Thu, 13 Jun 2013 23:37:52 +0000 (16:37 -0700)]
Avoid inverse transform/recon without residual

Avoid calling decode_block, inverse transform/add in the block is
a skip block for SBs smaller than 8x8 and intra-coded SBs.

Change-Id: I1684182f4a0050c8d6bb46cba6830d9425e7127d

11 years agoMerge "Fix win64 warning."
Frank Galligan [Thu, 13 Jun 2013 23:37:52 +0000 (16:37 -0700)]
Merge "Fix win64 warning."

11 years agoFix win64 warning.
Frank Galligan [Thu, 13 Jun 2013 22:17:23 +0000 (15:17 -0700)]
Fix win64 warning.

- size_t is 64bits in win64. int is 32 bits.

Change-Id: I4e756427ad42c841098a01a216469f65313987e7

11 years agoEnable sse2 version of sad8x4/4x8
Jingning Han [Thu, 13 Jun 2013 18:07:12 +0000 (11:07 -0700)]
Enable sse2 version of sad8x4/4x8

The encoding time for bus at CIF goes from 661s to 625s. This commit
also enabled unit test of sad8x4/4x8 in sad_test.cc.

Change-Id: If3d10ebb56bda584bdb69bcf056599d580b12cb1

11 years agoMerge "libs.mk: allow tests to be sharded"
John Koleszar [Thu, 13 Jun 2013 23:14:33 +0000 (16:14 -0700)]
Merge "libs.mk: allow tests to be sharded"

11 years agolibs.mk: allow tests to be sharded
James Zern [Fri, 31 May 2013 23:08:09 +0000 (16:08 -0700)]
libs.mk: allow tests to be sharded

Creates test_shard.#n [0,9] targets, usable with make -j [jobs]

Change-Id: Ied86dcc89ae72d1690564aa0dd86de10e06d25f9

11 years agoAdd vp9 test vectors unit test
John Koleszar [Thu, 13 Jun 2013 19:42:05 +0000 (12:42 -0700)]
Add vp9 test vectors unit test

These files can stand in until we get proper syntax vectors. They
should provide some additional assurance against inadvertant
bitstream changes.

Change-Id: I12f6c9a5f054e30df40a7ff1f33145abf7e1d59d

11 years agoRemove unused need_to_clamp_mvs
John Koleszar [Wed, 12 Jun 2013 23:50:14 +0000 (16:50 -0700)]
Remove unused need_to_clamp_mvs

This flag no longer needed.

Change-Id: If13482015ddb92d225792ea5c0ee455d2285d1f6

11 years agoQuick modifications to mb loopfilter intrinsic functions
Scott LaVarnway [Wed, 12 Jun 2013 23:23:03 +0000 (19:23 -0400)]
Quick modifications to mb loopfilter intrinsic functions

Modified to work with 8x8 blocks of memory.  Will revisit
later for further optimizations.  For the HD clip used, the
decoder improved by almost 20%.

Change-Id: Iaa4785be293a32a42e8db07141bd699f504b8c67

11 years agovpxdec: add --loops option
John Koleszar [Wed, 12 Jun 2013 23:08:35 +0000 (16:08 -0700)]
vpxdec: add --loops option

Allows vpxdec to operate multiple times on the same input. Mostly
useful for debugging/development.

Change-Id: Icf25ece22e387052eade4438971eee5ff4f798ba

11 years agoMerge "Quick modifications to wide loopfilter intrinsic functions"
Yaowu Xu [Wed, 12 Jun 2013 22:16:11 +0000 (15:16 -0700)]
Merge "Quick modifications to wide loopfilter intrinsic functions"

11 years agoImplement SSE version for sad4x8x4d and SSE2 version for sad8x4x4d.
Ronald S. Bultje [Tue, 11 Jun 2013 22:19:14 +0000 (15:19 -0700)]
Implement SSE version for sad4x8x4d and SSE2 version for sad8x4x4d.

Encoding time of crew (CIF, first 50 frames) @ 1500kbps goes from 4min56
to 4min42.

Change-Id: I92c0c8b32980d2ae7c6dafc8b883a2c7fcd14a9f

11 years agoQuick modifications to wide loopfilter intrinsic functions
Scott LaVarnway [Wed, 12 Jun 2013 20:40:22 +0000 (16:40 -0400)]
Quick modifications to wide loopfilter intrinsic functions

Modified to work with 8x8 blocks of memory.  Will revisit
later for further optimizations.  For the HD clip used, the
decoder improved my 20%.

Change-Id: Ia0057f55d66d1445882351ea6c43b595a5a980e5

11 years agoMerge changes I86fe51b0,I4c9a9e0f
John Koleszar [Wed, 12 Jun 2013 20:43:30 +0000 (13:43 -0700)]
Merge changes I86fe51b0,I4c9a9e0f

* changes:
  Remove unused vp9_idct_add_{y,uv}_block
  Remove some unused loopfilter code

11 years agoMerge "Fix compile warnings on windows."
Johann [Wed, 12 Jun 2013 20:36:50 +0000 (13:36 -0700)]
Merge "Fix compile warnings on windows."

11 years agoMerge "Enable mmx loop filter routines"
John Koleszar [Wed, 12 Jun 2013 19:52:04 +0000 (12:52 -0700)]
Merge "Enable mmx loop filter routines"

11 years agoMerge "Fix row tiling."
Ronald S. Bultje [Wed, 12 Jun 2013 19:41:57 +0000 (12:41 -0700)]
Merge "Fix row tiling."

11 years agoMerge "Fix chroma output when scaling"
John Koleszar [Wed, 12 Jun 2013 19:39:10 +0000 (12:39 -0700)]
Merge "Fix chroma output when scaling"

11 years agoRemove unused vp9_idct_add_{y,uv}_block
John Koleszar [Wed, 12 Jun 2013 19:24:22 +0000 (12:24 -0700)]
Remove unused vp9_idct_add_{y,uv}_block

These functions are not used, and appear to have been superceded.

Change-Id: I86fe51b088264f6b1b8d4d232bba97b371b98120

11 years agoFix the comments in boundary block partition check
Jingning Han [Wed, 12 Jun 2013 19:05:06 +0000 (12:05 -0700)]
Fix the comments in boundary block partition check

Change-Id: Ic6b2881d8d495269edbc514b33376ca963798b45

11 years agoRemove some unused loopfilter code
John Koleszar [Wed, 12 Jun 2013 18:36:00 +0000 (11:36 -0700)]
Remove some unused loopfilter code

This code is unreachable, and not useful for later reference.

Change-Id: I4c9a9e0fbf859c1081bbcfbcda9710afb4b4741f

11 years agoFix compile warnings on windows.
Frank Galligan [Wed, 12 Jun 2013 18:31:28 +0000 (11:31 -0700)]
Fix compile warnings on windows.

Change-Id: If74bc6110016bc75ea3883ab136fbbac88f6a913

11 years agoMerge "Quick fix to stop vpxdec infinite loop"
Yaowu Xu [Wed, 12 Jun 2013 18:28:46 +0000 (11:28 -0700)]
Merge "Quick fix to stop vpxdec infinite loop"

11 years agoEnable mmx loop filter routines
John Koleszar [Wed, 12 Jun 2013 18:28:21 +0000 (11:28 -0700)]
Enable mmx loop filter routines

The mmx routines work as expected for the loop filter, so enable them.

Change-Id: I2bbd9b99a4445fcba17bb95002f1fb6e01fe8f85