Yunqing Wang [Mon, 17 Nov 2014 17:10:28 +0000 (09:10 -0800)]
Merge "Code cleanup: remove unused members in RD_OPT"
Yunqing Wang [Mon, 17 Nov 2014 16:57:38 +0000 (08:57 -0800)]
Merge "vp9_ethread: combine encoder counts in separate struct"
Debargha Mukherjee [Sat, 15 Nov 2014 05:11:27 +0000 (21:11 -0800)]
Merge "Added sse2 acceleration for highbitdepth variance"
James Zern [Sat, 15 Nov 2014 04:26:28 +0000 (20:26 -0800)]
Merge "Visual studio build fix using explicit cast"
Yunqing Wang [Sat, 15 Nov 2014 00:33:25 +0000 (16:33 -0800)]
Code cleanup: remove unused members in RD_OPT
These 2 members in RD_OPT were moved to TileDataEnc struct
already, and therefore were removed here.
Change-Id: I22fee3b67f96e473a58e194a7edc76dbd48bfa04
Yunqing Wang [Sat, 15 Nov 2014 00:04:15 +0000 (16:04 -0800)]
vp9_ethread: combine encoder counts in separate struct
Several frame counters in encoder are updated at SB level. Combine
those counters and put them in a separate struct, which allows us
to allocate one copy for each thread.
Change-Id: I00366296a13c0ada4d8fa12f5e07728388b6cab7
Peter de Rivaz [Thu, 16 Oct 2014 13:00:54 +0000 (14:00 +0100)]
Added sse2 acceleration for highbitdepth variance
Change-Id: I446bdf3a405e4e9d2aa633d6281d66ea0cdfd79f
(cherry picked from commit
d7422b2b1eb9f0011a8c379c2be680d6892b16bc)
(cherry picked from commit
6d741e4d76a7d9ece69ca117d1d9e2f9ee48ef8c)
Deb Mukherjee [Fri, 14 Nov 2014 23:12:01 +0000 (15:12 -0800)]
Visual studio build fix using explicit cast
Change-Id: If74510370723e497f4f33d988b8b398124edf69b
Yunqing Wang [Fri, 14 Nov 2014 23:11:47 +0000 (15:11 -0800)]
Merge "vp9_ethread: include a pointer to mb in VP9_COMP"
Yunqing Wang [Fri, 14 Nov 2014 20:31:06 +0000 (12:31 -0800)]
vp9_ethread: include a pointer to mb in VP9_COMP
Modified VP9_COMP struct to include MACROBLOCK *mb. This change
makes it feasible in multi-thread case to allocate a mb for each
thread.
Change-Id: I624d6d1aa9c132362200753e5d90b581b1738d6e
Marco [Fri, 14 Nov 2014 19:36:05 +0000 (11:36 -0800)]
Merge "Add more rate control stats to vpx_temporal_svc_encoder."
Marco [Thu, 13 Nov 2014 23:03:45 +0000 (15:03 -0800)]
Add more rate control stats to vpx_temporal_svc_encoder.
Average and variance of actual encoding bitrate over fixed short-time window.
Change-Id: Icc3d6d93792c81e1fb9fd3b56008bf47a503b204
Yunqing Wang [Fri, 14 Nov 2014 02:35:01 +0000 (18:35 -0800)]
Merge "vp9_ethread: modify the cyclic refresh struct"
Yaowu Xu [Thu, 13 Nov 2014 23:50:30 +0000 (15:50 -0800)]
Merge "adapt the adjustment limit for rate correction factor in RTC mode"
Yunqing Wang [Thu, 13 Nov 2014 21:38:23 +0000 (13:38 -0800)]
vp9_ethread: modify the cyclic refresh struct
Two members in struct CYCLIC_REFRESH
int64_t projected_rate_sb;
int64_t projected_dist_sb;
are updated at the superblock level, which makes them shared data
in the multi-thread situation, and requires extra work to handle
them. However, those values are updated and used immediately, and
therefore can be removed. This patch cleaned up the code and
removed the two members.
Change-Id: I2c6ee4552bf49fb63ce590cdb47f9723974fffb1
Adrian Grange [Thu, 13 Nov 2014 23:01:49 +0000 (15:01 -0800)]
Merge "Prepare for dynamic frame resizing in the recode loop"
Paul Wilkins [Thu, 13 Nov 2014 23:00:41 +0000 (15:00 -0800)]
Merge "Fix 32 bit build emms problem."
Johann [Thu, 13 Nov 2014 22:49:22 +0000 (14:49 -0800)]
Merge "Revert "Fix SSSE3 partial_idct_test detection""
Jingning Han [Thu, 13 Nov 2014 21:59:02 +0000 (13:59 -0800)]
Merge "Use reconstructed pixels for intra prediction"
Jingning Han [Thu, 13 Nov 2014 21:58:21 +0000 (13:58 -0800)]
Merge "Refactor nonrd_use_partition coding process"
Adrian Grange [Mon, 10 Nov 2014 17:51:38 +0000 (09:51 -0800)]
Prepare for dynamic frame resizing in the recode loop
Prepare for the introduction of frame-size change
logic into the recode loop.
Separated the speed dependent features into
separate static and dynamic parts, the latter being
those features that are dependent on the frame size.
Change-Id: Ia693e28c5cf069a1a7bf12e49ecf83e440e1d313
Paul Wilkins [Wed, 12 Nov 2014 17:36:49 +0000 (09:36 -0800)]
Fix 32 bit build emms problem.
Add extra vp9_clear_system_state() calls to fix
double / mmx issue introduced into first pass
code for 32 bit builds.
Change-Id: I84cd2986b80d83650a091ab25c43755efeb82e03
James Zern [Thu, 13 Nov 2014 18:59:54 +0000 (10:59 -0800)]
Revert "Fix SSSE3 partial_idct_test detection"
This reverts commit
7d07f512cd87446eef541e9af4af19b1e8c6342a.
this breaks visual studio builds:
'#' : invalid character : possibly the result of a macro expansion
Change-Id: I77170d549afb71e75a878fa0f6acd204fe8d9e67
Yaowu Xu [Thu, 13 Nov 2014 19:20:04 +0000 (11:20 -0800)]
adapt the adjustment limit for rate correction factor in RTC mode
Rate correction factor is used to correct the estimated rate for any
given quantizer, and feeds into rate control for quantizer selection.
We make use of the actual bits used to calculate this rate correction
factor with an adjustment limit to prevent over-adjustment.
This commit adapts the adjustment limit to the difference between the
estimated bits and the actual bits, allows the adjustment limit to vary
between 0.125 (when estimate is close to actual) and 0.625 (when there
is >10X factor off between estimated and actual bits). By doing this,
the commit appears to have largely corrected two observed issues:
1. Adjustment is too slow when the actual bits used is way off from
estimate due to the small adjustment limit.
2. Extreme oscillating quantizer choices due to the feedback loop.
Change-Id: I4ee148d2c9d26d173b6c48011313ddb07ce2d7d6
Deb Mukherjee [Thu, 13 Nov 2014 11:27:11 +0000 (03:27 -0800)]
Merge "Vidyo: Turn off keyframes in higher spatial layers"
Debargha Mukherjee [Thu, 13 Nov 2014 05:20:34 +0000 (21:20 -0800)]
Merge "Added highbitdepth sse2 SAD acceleration and tests"
Marco [Thu, 13 Nov 2014 01:34:54 +0000 (17:34 -0800)]
Remove vp9_temporal_denoising from experimental list.
Change-Id: I7a941c0efbb5912aa97348da6fd1aba28df4482a
Peter de Rivaz [Thu, 16 Oct 2014 12:41:55 +0000 (13:41 +0100)]
Added highbitdepth sse2 SAD acceleration and tests
Change-Id: I1a74a1b032b198793ef9cc526327987f7799125f
(cherry picked from commit
b1a6f6b9cb47eafe0ce86eaf0318612806091fe5)
Johann [Wed, 12 Nov 2014 20:20:36 +0000 (12:20 -0800)]
Merge "Fix SSSE3 partial_idct_test detection"
Yaowu Xu [Wed, 12 Nov 2014 16:00:26 +0000 (08:00 -0800)]
Merge "Use normal rate_correction_factor for gf in CBR mode"
Deb Mukherjee [Wed, 12 Nov 2014 00:09:07 +0000 (16:09 -0800)]
Vidyo: Turn off keyframes in higher spatial layers
Change-Id: Icdd5e71cd6a2b59bc4b3b972af9e4d4a36821792
Deb Mukherjee [Wed, 12 Nov 2014 00:03:11 +0000 (16:03 -0800)]
Merge "Vidyo: Support for one-pass rc-enabled SVC encoder"
Jingning Han [Sat, 8 Nov 2014 01:50:55 +0000 (17:50 -0800)]
Use reconstructed pixels for intra prediction
This commit makes the speed -6 and above use the reconstructed
boundary pixels for precise intra prediction. This allows more
intra prediction modes to be tested in the non-RD coding process.
Enabling horizontal and vertical intra prediction modes can
improve the speed -6 compression performance for rtc set
by 0.331%.
Change-Id: I3a99f9d12c6af54de2bdbf28c76eab8e0905f744
Paul Wilkins [Tue, 11 Nov 2014 17:17:33 +0000 (09:17 -0800)]
Merge "AQ1 - remove first pass weights."
Yaowu Xu [Mon, 10 Nov 2014 19:46:58 +0000 (11:46 -0800)]
Use normal rate_correction_factor for gf in CBR mode
I0c5f010 changed to allow update golden reference buffer in CBR mode,
this commit changes the use of rate_correction_factor for those frames
to be aligned with the new usage. This commit attempts to solve two
issues:
a. Initialization of rate correction factor for Golden Frame
Prior to this patch, even the regular inter frame has been update
the rate correction factor based on content and encoding results,
the first golden frame would still use the ininitialized value
that can be way off.
b. Allowing rate correction factor update to be slightly faster
Prior to this patch, when the rate correction factor is off, the
update to the factor is too slow, the factor could not get close
to a semi-correct value even after many frames.
The commit helps all clips in psnr/ssim metric, but especially to
a few clip in RTC set that rate correction was way off. For example
thaloundeskmtgvga gained about .5dB for both overall/average psnr.
Change-Id: I0be5c41691be57891d824505348b64be87fa3545
Deb Mukherjee [Fri, 7 Nov 2014 19:01:53 +0000 (11:01 -0800)]
Vidyo: Support for one-pass rc-enabled SVC encoder
Adds support for one-pass rc-enabled SVC encoder with callbacks for
getting per-layer packets.
- the callback function registration is implemented as an encoder
control function.
- if the callback function is not registered, the old way of
aggregating packets with superframe will take effect.
- one more control function “VP9E_GET_SVC_LAYER_ID” has been
implemented to get the temporal/spatial id from the encoder
within the callback. This can be used to get the ids to put on RTP
packet.
Change-Id: I1a90e00135dde65da128b758e6c00b57299a111a
Deb Mukherjee [Mon, 10 Nov 2014 23:39:46 +0000 (15:39 -0800)]
Merge "Iadst transforms to use internal low precision"
Johann [Mon, 10 Nov 2014 18:22:09 +0000 (10:22 -0800)]
Merge "Remove asm offset dependencies"
Alex Converse [Sat, 8 Nov 2014 00:21:37 +0000 (16:21 -0800)]
Merge "Fix LAST SKIP when considering GOLDEN"
Deb Mukherjee [Thu, 6 Nov 2014 20:50:33 +0000 (12:50 -0800)]
Iadst transforms to use internal low precision
Change-Id: I266777d40c300bc53b45b205144520b85b0d6e58
(cherry picked from commit
a1b726117f5470f227bc90cd030b7d25045dc510)
Deb Mukherjee [Fri, 7 Nov 2014 18:19:46 +0000 (10:19 -0800)]
Test name changes to use SSE/SSE2 exactly
Change-Id: I3b5a478d198868c2796366f0ac59d0e2036308b8
Alex Converse [Fri, 7 Nov 2014 21:44:17 +0000 (13:44 -0800)]
Fix LAST SKIP when considering GOLDEN
Change-Id: I39d9f13fa34984ee9dad0c4f303ef672635f420e
Paul Wilkins [Fri, 7 Nov 2014 17:22:12 +0000 (09:22 -0800)]
Merge "Add intra complexity and brightness weight to first pass."
Yaowu Xu [Fri, 7 Nov 2014 14:24:59 +0000 (06:24 -0800)]
Merge "Change the use of a reserved color space entry"
Paul Wilkins [Fri, 7 Nov 2014 14:11:29 +0000 (14:11 +0000)]
AQ1 - remove first pass weights.
Removed redundant weighting function tied for AQ1 from first
pass code.
Improvment in baseline AQ1 results:-
Derf opsnr +0.142% SSIm +0.258%
YT opsnr +0.173% SSIm +0.3%
Change-Id: I16ef91caf2d7f302cd5940cc5e2626d48ebcb212
Yaowu Xu [Fri, 7 Nov 2014 03:09:30 +0000 (19:09 -0800)]
Change the use of a reserved color space entry
This commit rename a reserved color space entry to BT_2020, it intends
to provide support for VP9 bitstream to pass along the color space
type defined in BT.2020(Rec.2020)
please note this entry does not have any effect on encoding/decoding
behavior, but allow applications to the pass the information along
from encoding end to decoding end.
Change-Id: I4678520e89141ea5e8900f7bd1c0e95b710b7091
Jingning Han [Thu, 6 Nov 2014 19:53:18 +0000 (11:53 -0800)]
Refactor nonrd_use_partition coding process
This commit integrates the non-RD mode decision process and the
encoding process into a single recursion scheme.
Change-Id: I6a7e72a0b84d567554801ebbe01ec75d54c1f77d
Johann [Thu, 31 Jul 2014 21:19:31 +0000 (14:19 -0700)]
Remove asm offset dependencies
The obj_int_extract code is no longer worth maintaining. It creates
significant issues when adapting for different build systems and no
longer offers as significant of a performance benefit due to
improvements in intrinsics.
Source files will remain until the various third-party builds are updated.
The neon fast quantizer has been moved to intrinsics. The armv6 version
has been removed because so few remaining targets require it.
Compilers and processors have improved significantly since the
pack_tokens code was written. The assembly is no longer faster than the
C code.
pack_tokens were the only optimizations for the armv5te targets so the targets
will be removed after the test infrastructure has been updated.
BUG=710
Change-Id: Ic785b167cd9f95eeff31c7c76b7b736c07fb30eb
Yunqing Wang [Thu, 6 Nov 2014 21:08:57 +0000 (13:08 -0800)]
Merge "Modify the frame context memory deallocation"
Jingning Han [Thu, 6 Nov 2014 20:03:15 +0000 (12:03 -0800)]
Merge "Remove unused is_background function"
Jingning Han [Thu, 6 Nov 2014 20:03:07 +0000 (12:03 -0800)]
Merge "Rework cut-off decisions in cyclic refresh aq mode"
Yunqing Wang [Thu, 30 Oct 2014 01:38:18 +0000 (18:38 -0700)]
Modify the frame context memory deallocation
This patch was to fix the vpxdec fuzzing3 test failure. When an
error occurs, setjmp() is invoked, which calls the decoder
removing routine. In multiple thread situation, other threads
could try to access the frame context memory that is already
deallocated, thus causing a segfault.
An invalid unit test was added for this issue.
Change-Id: Ida7442154f3d89759483f0f4fe0324041fffb952
Paul Wilkins [Tue, 4 Nov 2014 13:13:39 +0000 (13:13 +0000)]
Add intra complexity and brightness weight to first pass.
The aim of this patch is to apply a positive weighting to
frames that have a significant number of blocks that are
of low spatial complexity and are dark. The rationale behind
this is that artifacts tend to be more visible in such frames.
In this patch the weight is only applied in regard to the distribution
of bits between frames. Hence if all the frames share similar
characteristics (as is the case for most of our short test clips) there
will be little or no net effect.
However, the effect can be seen on some longer form test content.
For example Tears of steel baseline test:
2323.09 Kbit/s opsnr 39.915 ssim 74.729
With this patch:-
2213.34 Kbit/s opsnr 39.963 ssim 74.808
(Sligtly better metrics and about 5% smaller)
The weighting may well need some further tuning along side changes
to the aq modes.
Change-Id: Ieced379bca03938166ab87b2b97f55d94948904c
Jingning Han [Tue, 4 Nov 2014 22:42:07 +0000 (14:42 -0800)]
Remove unused is_background function
Change-Id: Ia540eac5f066ae95280c2f898370eddf0110c279
Jingning Han [Tue, 4 Nov 2014 22:32:04 +0000 (14:32 -0800)]
Rework cut-off decisions in cyclic refresh aq mode
This commit removes the cyclic aq mode dependency on
in_static_area and reworks the corresponding cut-off thresholds.
It improves the compression performance of speed -5 by 1.47% in
PSNR and 2.07% in SSIM, and the compression performance of speed
-6 by 3.10% in PSNR and 5.25% in SSIM. Speed wise, about 1% faster
in both settings at high bit-rates.
Change-Id: I1ffc775afdc047964448d9dff5751491ba4ff4a9
hkuang [Thu, 6 Nov 2014 01:48:47 +0000 (17:48 -0800)]
Merge "Totally remove prev_mi in VP9 decoder."
hkuang [Mon, 3 Nov 2014 19:23:22 +0000 (11:23 -0800)]
Totally remove prev_mi in VP9 decoder.
This will save the memory and improve the decode speed due to
removing unnecessary memset of big prev_mi array for
all the key frames.
Decoding a all key frames 1080p video shows speed improve around 2%.
Change-Id: I6284a445c1291056e3c15135c3c20d502f791c10
Yaowu Xu [Wed, 5 Nov 2014 23:34:13 +0000 (15:34 -0800)]
Merge "Fix visual studio 2013 compiler warnings"
Johann [Wed, 5 Nov 2014 22:05:26 +0000 (14:05 -0800)]
Merge "vp8 quantize is now in intrinsics, not asm"
Yaowu Xu [Wed, 5 Nov 2014 19:29:21 +0000 (11:29 -0800)]
Fix visual studio 2013 compiler warnings
For configured with --enable-vp9-highbitdepth
Change-Id: I2b181519d7192f8d7a241ad5760c3578255f24e6
Hui Su [Wed, 5 Nov 2014 21:33:09 +0000 (13:33 -0800)]
Merge "Simplify interface of write_selected_tx_size and read_tx_size"
Johann [Wed, 5 Nov 2014 20:56:04 +0000 (12:56 -0800)]
Fix SSSE3 partial_idct_test detection
The test filter is not a prefix matcher. It requires test type to
contain no more than the optimization type. In this example, SSSE3_64
fails to match and the test is not skipped even when SSSE3 is not
available.
Change-Id: Ia74229a167c88da4e6da169012a7a77d438c3f75
Johann [Wed, 5 Nov 2014 18:58:26 +0000 (10:58 -0800)]
vp8 quantize is now in intrinsics, not asm
Change-Id: Ie106c5335c9ba5aac81e23150e7026fb6ea6196e
Jingning Han [Wed, 5 Nov 2014 20:04:10 +0000 (12:04 -0800)]
Merge "Skip ref frame mode search conditioned on predicted mv residuals"
James Zern [Wed, 5 Nov 2014 12:11:34 +0000 (13:11 +0100)]
quantize_test: delete QuantizePair related tests
functions were removed in:
2134eb2 Remove pair quantization
fixes ARM build
Change-Id: I634de50c2752408381d70afb0f5088b61052853c
Johann [Wed, 5 Nov 2014 00:29:19 +0000 (16:29 -0800)]
Merge "Remove pair quantization"
Johann [Wed, 5 Nov 2014 00:28:46 +0000 (16:28 -0800)]
Merge "vp8 quantization -> intrinsics"
Hui Su [Fri, 31 Oct 2014 00:56:36 +0000 (17:56 -0700)]
Simplify interface of write_selected_tx_size and read_tx_size
Change-Id: Ia2b2a895deefaaf7b34bf26df86add56dbab082c
Minghai Shang [Tue, 4 Nov 2014 23:57:16 +0000 (15:57 -0800)]
Merge "[spatial svc] Make spatial svc working for one pass rate control"
hkuang [Tue, 4 Nov 2014 22:43:47 +0000 (14:43 -0800)]
Merge "Fix the memory leak due to missing free frame_mvs."
Johann [Tue, 4 Nov 2014 22:28:45 +0000 (14:28 -0800)]
Merge "Avoid divide-by-zero in vp8 initialization"
hkuang [Mon, 3 Nov 2014 20:25:18 +0000 (12:25 -0800)]
Fix the memory leak due to missing free frame_mvs.
Change-Id: I2ceee7341d906259002c0ea31ea009ae32c04bfd
Johann [Tue, 4 Nov 2014 18:46:24 +0000 (10:46 -0800)]
Avoid divide-by-zero in vp8 initialization
Check that the numerator is not zero. If it is, guess 30fps.
Fixes a clang IOC error in the quantize test. It's very unlikely for
this to occur in the wild because the setup in the quantize test is very
nonstandard.
Change-Id: Icdab7b81d4e168d3423e14db20787f960052e0c3
Minghai Shang [Tue, 4 Nov 2014 19:46:48 +0000 (11:46 -0800)]
[spatial svc] Make spatial svc working for one pass rate control
Change-Id: Ibd9114485c3d747f9d148f64f706bf873ea473ac
Johann [Tue, 4 Nov 2014 18:32:51 +0000 (10:32 -0800)]
Align structures in quantize test
MACROBLOCKD structures require alignment.
This fixes an issue caught by clang IOC.
Change-Id: Ibb5bcc122f531b4302a87e1144e4feaf46c1de64
Johann [Tue, 4 Nov 2014 16:20:25 +0000 (08:20 -0800)]
Add some indirection to the quantize test
Visual Studio and XCode require the address of the functions be passed.
Change-Id: Id39cab8e50061fdc6ac6018371a3a158c713e14b
Jingning Han [Tue, 4 Nov 2014 17:11:15 +0000 (09:11 -0800)]
Merge "Refactor sub-pixel motion search unit"
Jingning Han [Mon, 3 Nov 2014 19:36:34 +0000 (11:36 -0800)]
Skip ref frame mode search conditioned on predicted mv residuals
This commit makes the RTC coding mode to conditionally skip the
reference frame mode search, when the predicted motion vector of
the current reference frame gives more than two times sum of
absolute difference compared to that of other reference frames.
It reduces the runtim by 1% - 4% for speed -5 and -6. The average
compression performance is improved by about 0.1% in both settings.
It is of particular benefit to light change scenarios. The
compression performance of test clip mmmovingvga.y4m is improved by
6.39% and 15.69% at high bit rates for speed -5 and -6, respectively.
Speed -5
vidyo1 16555 b/f, 40.818 dB, 12422 ms ->
16552 b/f, 40.804 dB, 12100 ms
nik 33211 b/f, 39.138 dB, 11341 ms ->
33228 b/f, 39.139 dB, 11023 ms
mmmoving 33263 b/f, 40.935 dB, 13508 ms ->
33256 b/f, 41.068 dB, 12861 ms
Speed -6
vidyo1 16541 b/f, 40.227 dB, 8437 ms ->
16540 b/f, 40.220 dB, 8216 ms
nik 33272 b/f, 38.399 dB, 7610 ms ->
33267 b/f, 38.414 dB, 7490 ms
mmmoving 33255 b/f, 40.555 dB, 7523 ms ->
33257 b/f, 40.975 dB, 7493 ms
Change-Id: Id2aef76ef74a3cba5e9a82a83b792144948c6a91
Yunqing Wang [Tue, 4 Nov 2014 00:56:38 +0000 (16:56 -0800)]
Merge "WORKAROUND FIX FOR GCC4.9.1"
Johann [Mon, 3 Nov 2014 23:45:21 +0000 (15:45 -0800)]
Merge "vp8 quantize test"
Marco [Mon, 3 Nov 2014 22:38:05 +0000 (14:38 -0800)]
Merge "Allow disable of refresh golden for more than 1 layer encoding."
Johann [Mon, 20 Oct 2014 17:04:45 +0000 (10:04 -0700)]
vp8 quantize test
Test for Regular, Fast and Pair quantization
Change-Id: I0a26c164afe632db869099402189826c0d43f9a2
Jingning Han [Sun, 2 Nov 2014 18:45:32 +0000 (10:45 -0800)]
Refactor sub-pixel motion search unit
This commit unfolds the legacy macro definitions used in the
sub-pixel motion search and refactors the operational flow for
later optimizations.
Change-Id: I3e3f770cad961d03d1a6eb0b2a0186cc77eaf2b8
Jingning Han [Mon, 3 Nov 2014 16:46:42 +0000 (08:46 -0800)]
Merge "Fix the THR_MODES array used in vp9_pick_inter_mode"
Yaowu Xu [Mon, 3 Nov 2014 16:02:58 +0000 (08:02 -0800)]
Merge "Fix speed 7 and speed 12 for rt"
Marco [Mon, 3 Nov 2014 06:10:56 +0000 (22:10 -0800)]
Allow disable of refresh golden for more than 1 layer encoding.
The current logic was allowing for disabling golden refresh only
for two pass svc encoding. This change disables it as long as
more than 1 layer encoding is used (for example temporal layers under 1pass CBR).
Change-Id: I4dc5204a7ad365c821ec7963e93b59da82e1826b
Jingning Han [Sun, 2 Nov 2014 20:18:53 +0000 (12:18 -0800)]
Fix the THR_MODES array used in vp9_pick_inter_mode
Fix the alignment of entries fo intra prediction modes.
Change-Id: Ie32ad87cf90694efd591a4b1cc29c916c4cd56f7
levytamar82 [Thu, 16 Oct 2014 23:56:37 +0000 (16:56 -0700)]
WORKAROUND FIX FOR GCC4.9.1
In the function mb_lpf_horizontal_edge_w_avx2_16 the usage of the intrinsic
_mm256_cvtepu8_epi16 cause a compiler bug in gcc 4.9.1.
until it will be fixed I created a workaround that create the up convert by
using broadcast128+shuffle.
The bug was reported here:
https://code.google.com/p/webm/issues/detail?id=867
Change-Id: I73452e6806f42e0fadcde96b804ea3afa7eeb351
Yaowu Xu [Sat, 1 Nov 2014 00:27:54 +0000 (17:27 -0700)]
Fix speed 7 and speed 12 for rt
A recent change has introduced big quality drops for speed 7 and 12
for --rt mode. The change reverted the big drop and improved quality
by 9.5% for speed 7 and 13.4% for speed 12.
Change-Id: I07b82e3bb6002a73af486a083458c88877bdad01
hkuang [Mon, 27 Oct 2014 23:19:04 +0000 (16:19 -0700)]
Bind motion vectors with frame buffer structure.
This will save a lot of memory for decoder due to removing of prev_mi,
but prev_mi is still needed in encoder. So this will increase a little bit
memory for encoder.
Change-Id: I24b2f1a423ebffa55a9bd2fcee1077dac995b2ed
Johann [Fri, 31 Oct 2014 20:42:55 +0000 (13:42 -0700)]
Remove pair quantization
The intrinsics version of the pair quant is slower than running it
individually.
Change-Id: I7b4ea8599d4aab04be0a5a0c59b8b29a7fc283f4
Johann [Mon, 20 Oct 2014 23:03:28 +0000 (16:03 -0700)]
vp8 quantization -> intrinsics
Use intrinsics for neon quantization. Slight loss (<5%) of performance
compared to the assembly. Roughly 10x faster on arm64 because that was
running C code before.
Change-Id: I7cf5242d8f29b7eab5bca6a1c20c89c9fc9ca66d
Jingning Han [Fri, 31 Oct 2014 15:55:40 +0000 (08:55 -0700)]
Merge "Fix mode index use case in vp9_pick_inter_mode"
Jingning Han [Fri, 31 Oct 2014 15:55:28 +0000 (08:55 -0700)]
Merge "Refactor vp9_update_rd_thresh_fact"
Jingning Han [Fri, 31 Oct 2014 15:55:24 +0000 (08:55 -0700)]
Merge "Rework pred pixel buffer system in non-RD coding mode"
Jingning Han [Thu, 30 Oct 2014 02:30:12 +0000 (19:30 -0700)]
Fix mode index use case in vp9_pick_inter_mode
This improves coding performance of speed -5 and -6 by 0.6%,
respectively.
Change-Id: Ic5a7746a88c73285f0b14333d35dc16b02152c25
Jingning Han [Thu, 30 Oct 2014 00:37:54 +0000 (17:37 -0700)]
Refactor vp9_update_rd_thresh_fact
Reduce the scope of function parameters.
Change-Id: Ifef2cfb559908a97498ffdbd6ea53da1cd45a73c
Jingning Han [Wed, 29 Oct 2014 23:37:16 +0000 (16:37 -0700)]
Rework pred pixel buffer system in non-RD coding mode
This commit makes the inter prediction buffer system to support
hybrid partition search. It reduces the runtime of speed -5 by
about 3%. No compression performance change.
vidyo1 720p 1000 kbps
11831 ms -> 11497 ms
nik 720p 1000 kbps
10919 ms -> 10645 ms
Change-Id: I5b2da747c6395c253cd074d3907f5402e1840c36
Hui Su [Thu, 30 Oct 2014 18:05:04 +0000 (11:05 -0700)]
Merge "Move the definition of switchable filter numbers into enum INTERP_FILTER; Modify the macro ADD_MV_REF_LIST and IF_DIFF_REF_FRAME_ADD_MV."
Hui Su [Thu, 30 Oct 2014 18:02:31 +0000 (11:02 -0700)]
Merge "Combine vp9_encode_block_intra and encode_block_intra"