aarch64: Use RTL builtins for [su]mull_high_n intrinsics
authorJonathan Wright <jonathan.wright@arm.com>
Wed, 3 Feb 2021 14:46:00 +0000 (14:46 +0000)
committerJonathan Wright <jonathan.wright@arm.com>
Thu, 4 Feb 2021 13:56:50 +0000 (13:56 +0000)
commit1d6228454c4bca003c6ecedad67866515503b910
treeb7e41f4de1dd3085a3f6a129f9e785544eb9a68c
parent7002a33d1ba81e4577d965fb9daaee146b31faa8
aarch64: Use RTL builtins for [su]mull_high_n intrinsics

Rewrite [su]mull_high_n Neon intrinsics to use RTL builtins rather
than inline assembly code, allowing for better scheduling and
optimization.

gcc/ChangeLog:

2021-02-03  Jonathan Wright  <jonathan.wright@arm.com>

* config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
builtin generator macros.
* config/aarch64/aarch64-simd.md
(aarch64_<su>mull_hi_n<mode>_insn): Define.
(aarch64_<su>mull_hi_n<mode>): Define.
* config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
instead of inline asm.
(vmull_high_n_s32): Likewise.
(vmull_high_n_u16): Likewise.
(vmull_high_n_u32): Likewise.
gcc/config/aarch64/aarch64-simd-builtins.def
gcc/config/aarch64/aarch64-simd.md
gcc/config/aarch64/arm_neon.h