review.tizen.org Git - platform/upstream/llvm.git/commit

[X86] Take advantage of the lzcnt instruction on btver2 architectures when ORing comparisons to zero.

This change adds transformations such as:
  zext(or(setcc(eq, (cmp x, 0)), setcc(eq, (cmp y, 0))))
  To:
  srl(or(ctlz(x), ctlz(y)), log2(bitsize(x))
This optimisation is beneficial on Jaguar architecture only, where lzcnt has a good reciprocal throughput.
Other architectures such as Intel's Haswell/Broadwell or AMD's Bulldozer/PileDriver do not benefit from it.
For this reason the change also adds a "HasFastLZCNT" feature which gets enabled for Jaguar.

Differential Revision: https://reviews.llvm.org/D23446

llvm-svn: 284248

author	Pierre Gousseau <pierregousseau14@gmail.com>
	Fri, 14 Oct 2016 16:41:38 +0000 (16:41 +0000)
committer	Pierre Gousseau <pierregousseau14@gmail.com>
	Fri, 14 Oct 2016 16:41:38 +0000 (16:41 +0000)
commit	b6d652adb5b12b7d1fc7e973a5afc019875cb547
tree	a8196cccb70b3ebb8eba63d17f04efa2f7f31fde	tree \| snapshot
parent	6d6eca5cdc995fdb8850fd5c79d1018893a44988	commit \| diff

llvm/lib/Target/X86/X86.td		diff \| blob \| history
llvm/lib/Target/X86/X86ISelLowering.cpp		diff \| blob \| history
llvm/lib/Target/X86/X86ISelLowering.h		diff \| blob \| history
llvm/lib/Target/X86/X86InstrInfo.td		diff \| blob \| history
llvm/lib/Target/X86/X86Subtarget.cpp		diff \| blob \| history
llvm/lib/Target/X86/X86Subtarget.h		diff \| blob \| history
llvm/test/CodeGen/X86/lzcnt-zext-cmp.ll	[new file with mode: 0644]	blob