[X86] Improve combineCastedMaskArithmetic to fold (bitcast (vXi1 (and/or/xor X, C...
authorCraig Topper <craig.topper@intel.com>
Thu, 8 Feb 2018 22:26:39 +0000 (22:26 +0000)
committerCraig Topper <craig.topper@intel.com>
Thu, 8 Feb 2018 22:26:39 +0000 (22:26 +0000)
commit9e030c9e0010b423e877d7a7c8ef3742b16a079f
treea089d03822df489973a6d207a0af2bc4321fe3c2
parent1b5b4ccb77b0bd17a09015722c73698ebd72cc95
[X86] Improve combineCastedMaskArithmetic to fold (bitcast (vXi1 (and/or/xor X, C)))->(vXi1 (and/or/xor (bitcast X), (bitcast C)) where C is a constant build_vector.

Most vxi1 constant build vectors have to be implemented in the scalar domain anyway so we'll probably end up with a cast there later. But by then its too late to do the combine to get rid of it.

llvm-svn: 324662
llvm/lib/Target/X86/X86ISelLowering.cpp
llvm/test/CodeGen/X86/avx512-intrinsics-upgrade.ll
llvm/test/CodeGen/X86/avx512-mask-op.ll
llvm/test/CodeGen/X86/avx512-schedule.ll