Upgrade fwht4x4_mmx() to fwht4x4_sse2() for vp9 and vp10.
authorLinfeng Zhang <linfengz@google.com>
Tue, 17 May 2016 19:42:55 +0000 (12:42 -0700)
committerLinfeng Zhang <linfengz@google.com>
Fri, 27 May 2016 16:51:16 +0000 (09:51 -0700)
commitaf7fb17c09158e9039bfe6d72c591347b7c0ceb9
treeb4a5d179a8a7173ded8982b137a66be7db189819
parentbdfbd6325ef8582beb0cb382f7d0f12cc488ecee
Upgrade fwht4x4_mmx() to fwht4x4_sse2() for vp9 and vp10.

Function level timing test shows about 27% time saving on
a Xeon E5-2680 v2 desktop.

Rename vp9_dct_sse2.c to vp9_dct_intrin_sse2.c for vp9 and
rename dct_sse2.c to dct_intrin_sse2.c for vp10 to avoid
duplicate basenames.

Actually vp9_fwht4x4_mmx/sse2() and vp10_fwht4x4_mmx/sse2()
are identical. TODO: They should be unified later if there is
no intention to keep a duplicate.

Change-Id: I3e537b7bbd9ba417c606cd7c68c4dbbfa583f77d
test/fdct4x4_test.cc
vp10/common/vp10_rtcd_defs.pl
vp10/encoder/x86/dct_intrin_sse2.c [moved from vp10/encoder/x86/dct_sse2.c with 100% similarity]
vp10/encoder/x86/dct_mmx.asm [deleted file]
vp10/encoder/x86/dct_sse2.asm [new file with mode: 0644]
vp10/vp10cx.mk
vp9/common/vp9_rtcd_defs.pl
vp9/encoder/x86/vp9_dct_intrin_sse2.c [moved from vp9/encoder/x86/vp9_dct_sse2.c with 100% similarity]
vp9/encoder/x86/vp9_dct_mmx.asm [deleted file]
vp9/encoder/x86/vp9_dct_sse2.asm [new file with mode: 0644]
vp9/vp9cx.mk