Slow pshufb removal in 3 intra prediction functions.
authorLinfeng Zhang <linfengz@google.com>
Tue, 24 May 2016 21:32:49 +0000 (14:32 -0700)
committerLinfeng Zhang <linfengz@google.com>
Thu, 2 Jun 2016 17:55:58 +0000 (10:55 -0700)
commitad0646cb848e9facce33b856d3b05a095fc929f2
tree15c64f3bfff7608d6323a89064929ad019edf93e
parent6f397b8a5bbd08b37f2ee8820cc25d218a21ad5d
Slow pshufb removal in 3 intra prediction functions.

Replaced vpx_d45_predictor_4x4_ssse3(), vpx_d45_predictor_8x8_ssse3()
and vpx_d207_predictor_4x4_ssse3() with
created vpx_d45_predictor_4x4_sse2(), vpx_d45_predictor_8x8_sse2()
and vpx_d207_predictor_4x4_sse2() respectively.
It's mostly neutral or slightly worse than ssse3 in good cases and
better than ssse3 in the bad cases (but still worse than using the mmx
regs).

Change-Id: Ib0237ceb71d2c57b8a93fd3170330cfed9d56bdd
test/test_intra_pred_speed.cc
vpx_dsp/vpx_dsp_rtcd_defs.pl
vpx_dsp/x86/intrapred_sse2.asm
vpx_dsp/x86/intrapred_ssse3.asm