review.tizen.org Git - platform/upstream/llvm.git/commit

author	David Green <david.green@arm.com>
	Wed, 22 Sep 2021 11:07:52 +0000 (12:07 +0100)
committer	David Green <david.green@arm.com>
	Wed, 22 Sep 2021 11:07:52 +0000 (12:07 +0100)
commit	02cd8a6b915a9dab32fdd91167f875ce5f67ebd4
tree	b09ab74185441018efd80ed802ac11cc5a7fd068	tree \| snapshot
parent	a5e1c746b870d79142419a07a8aecc471eacfed1	commit \| diff

[ARM] Allow smaller VMOVL in tail predicated loops

This allows VMOVL in tail predicated loops so long as the the vector
size the VMOVL is extending into is less than or equal to the size of
the VCTP in the tail predicated loop. These cases represent a
sign-extend-inreg (or zero-extend-inreg), which needn't block tail
predication as in https://godbolt.org/z/hdTsEbx8Y.

For this a vecsize has been added to the TSFlag bits of MVE
instructions, which stores the size of the elements that the MVE
instruction operates on. In the case of multiple size (such as a
MVE_VMOVLs8bh that extends from i8 to i16, the largest size was be
chosen). The sizes are encoded as 00 = i8, 01 = i16, 10 = i32 and 11 =
i64, which often (but not always) comes from the instruction encoding
directly. A unit test was added, and although only a subset of the
vecsizes are currently used, the rest should be useful for other cases.

Differential Revision: https://reviews.llvm.org/D109706

llvm/lib/Target/ARM/ARMInstrFormats.td		diff \| blob \| history
llvm/lib/Target/ARM/ARMInstrMVE.td		diff \| blob \| history
llvm/lib/Target/ARM/ARMLowOverheadLoops.cpp		diff \| blob \| history
llvm/lib/Target/ARM/MCTargetDesc/ARMBaseInfo.h		diff \| blob \| history
llvm/test/CodeGen/Thumb2/mve-vmovlloop.ll		diff \| blob \| history
llvm/unittests/Target/ARM/MachineInstrTest.cpp		diff \| blob \| history