[Clang][AArch64][SME] Add outer product intrinsics
This patch adds support for the following SME ACLE intrinsics (as defined
in https://arm-software.github.io/acle/main/acle.html):
- svmopa_za32[_bf16]_m // also for s8, u8, f16, f32
- svmops_za32[_bf16]_m // also for s8, u8, f16, f32
- svsumopa_za32[_s8]_m
- svsumops_za32[_s8]_m
- svusmopa_za32[_u8]_m
- svusmops_za32[_u8]_m
When the sme-f64f64 feature is enabled, the following intrinsics are supported:
- svmopa_za64_f64_m
- svmops_za64_f64_m
When the sme-i16i64 feature is enabled, the following intrinsics are supported:
- svmopa_za64[_s16]_m // also for u16
- svmops_za64[_s16]_m // also for u16
- svsumopa_za64[_s16]_m
- svsumops_za64[_s16]_m
- svusmopa_za64[_u16]_m
- svusmops_za64[_u16]_m
Co-authored-by: Sagar Kulkarni <sagar.kulkarni1@huawei.com>
Reviewed By: sdesmalen
Differential Revision: https://reviews.llvm.org/D134681