crypto: arm/chacha20 - always use vrev for 16-bit rotates
authorEric Biggers <ebiggers@google.com>
Wed, 25 Jul 2018 01:29:07 +0000 (18:29 -0700)
committerHerbert Xu <herbert@gondor.apana.org.au>
Fri, 3 Aug 2018 10:06:05 +0000 (18:06 +0800)
commit4e34e51f48ab7f77a4022aa810a786daa3eb3e22
treedde19d19ab296050fa3d7a2f362124bdbf7d04f5
parentf53ad3e1b35a558528f6d0041568016d1f623a9d
crypto: arm/chacha20 - always use vrev for 16-bit rotates

The 4-way ChaCha20 NEON code implements 16-bit rotates with vrev32.16,
but the one-way code (used on remainder blocks) implements it with
vshl + vsri, which is slower.  Switch the one-way code to vrev32.16 too.

Signed-off-by: Eric Biggers <ebiggers@google.com>
Acked-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
arch/arm/crypto/chacha20-neon-core.S