On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128...
authorNadav Rotem <nrotem@apple.com>
Fri, 18 Jan 2013 23:10:30 +0000 (23:10 +0000)
committerNadav Rotem <nrotem@apple.com>
Fri, 18 Jan 2013 23:10:30 +0000 (23:10 +0000)
commit7431211214d54d0cd8cc0d069447abd22c5da0cb
tree74da55584f564ef5a3c8e07f1ea4df7c9e7c6c4a
parent2affc1ea6d27dbd9258cef614725f92c7d2770b3
On Sandybridge loading unaligned 256bits using two XMM loads (vmovups and vinsertf128) is faster than using a single vmovups instruction.

llvm-svn: 172868
llvm/lib/Target/X86/X86ISelLowering.cpp
llvm/test/CodeGen/X86/sandybridge-loads.ll [new file with mode: 0644]
llvm/test/CodeGen/X86/v8i1-masks.ll