Enable SSSE3 inverse 2D-DCT with 10 non-zero coeffs
authorJingning Han <jingning@google.com>
Wed, 28 May 2014 17:51:09 +0000 (10:51 -0700)
committerJingning Han <jingning@google.com>
Wed, 28 May 2014 17:53:33 +0000 (10:53 -0700)
commit6d21cbd20b07dd66f60add80cadd330470f5f529
tree13a13cb5bbf5bdd4c5c9b56736a18d31302e921c
parent239e68ddbfa038e29b9e332503721d6bbae806c3
Enable SSSE3 inverse 2D-DCT with 10 non-zero coeffs

This commit enables SSSE3 implementation of the inverse 2D-DCT
with only first 10 coefficients non-zero. It reduces the runtime
of SSE2 version from 745 cycles to 538 cycles, i.e., 27% speed-up.

Change-Id: I18ba4128859b09c704a6ee361d69a86c09fe8dfe
test/partial_idct_test.cc
vp9/common/vp9_idct.h
vp9/common/vp9_rtcd_defs.pl
vp9/common/x86/vp9_idct_intrin_sse2.c
vp9/common/x86/vp9_idct_intrin_sse2.h
vp9/common/x86/vp9_idct_intrin_ssse3.c