[Test] We can benefit from pipelining of ymm load/stores
authorMax Kazantsev <mkazantsev@azul.com>
Thu, 15 Jul 2021 09:40:34 +0000 (16:40 +0700)
committerMax Kazantsev <mkazantsev@azul.com>
Thu, 15 Jul 2021 10:15:14 +0000 (17:15 +0700)
commit69a3acffdf1b3f5fc040aaeafc1c77588a607d1a
treebb4c26c6e830e02abbc17262d01c9413827ed32b
parentdfa76933c29626d08a3538fcc66f120a5bc563b7
[Test] We can benefit from pipelining of ymm load/stores

This patch demonstrates a scenario when we need to load/store a single
64-byte value, which is done by 2 ymm loads and stores in AVX. The current
codegen choses the following sequence:

  load ymm0
  load ymm1
  store ymm1
  store ymm0

If we instead stored ymm0 before ymm1, we could execute 2nd load and 1st store
in parallel.
llvm/test/CodeGen/X86/ymm-ordering.ll [new file with mode: 0644]