[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions.
authorArtem Belevich <tra@google.com>
Wed, 18 Apr 2018 21:51:48 +0000 (21:51 +0000)
committerArtem Belevich <tra@google.com>
Wed, 18 Apr 2018 21:51:48 +0000 (21:51 +0000)
commit0ae8590354b8688e1ec9926abc909b896ea49038
tree3c803aae33ad4fd575d7d3672138519c7b20e0fc
parentc310bfa19397e15903a8f5386b51366aade414b9
[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions.

The new instructions were added added for sm_70+ GPUs in CUDA-9.1.

Differential Revision: https://reviews.llvm.org/D45068

llvm-svn: 330296
clang/include/clang/Basic/BuiltinsNVPTX.def
clang/lib/CodeGen/CGBuiltin.cpp
clang/lib/Driver/ToolChains/Cuda.cpp
clang/test/CodeGen/builtins-nvptx-sm_70.cu
llvm/include/llvm/IR/IntrinsicsNVVM.td
llvm/lib/Target/NVPTX/NVPTX.td
llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
llvm/test/CodeGen/NVPTX/wmma.py