[nnc] Support thread level parallelism in fused kernels (#63386)
authorBert Maher <bertrand@fb.com>
Fri, 20 Aug 2021 18:11:49 +0000 (11:11 -0700)
committerFacebook GitHub Bot <facebook-github-bot@users.noreply.github.com>
Fri, 20 Aug 2021 18:18:17 +0000 (11:18 -0700)
commitd6d86efb1c839ddafd1398d6dab9caa4f31a9f0b
treef40c59edf03d64b2d2cf8d95d1650058a3763708
parentc78ab28441f5616899d363d57317f4de29147e5c
[nnc] Support thread level parallelism in fused kernels (#63386)

Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63386

Test Plan: Imported from OSS

Reviewed By: navahgar

Differential Revision: D30360382

Pulled By: bertmaher

fbshipit-source-id: 29acf4e932c669ce0f35823faea9099bcd8119b6
test/cpp/tensorexpr/test_kernel.cpp
torch/csrc/jit/tensorexpr/kernel.cpp
torch/csrc/jit/tensorexpr/llvm_codegen.cpp
torch/csrc/jit/tensorexpr/llvm_jit.h
torch/csrc/jit/tensorexpr/loopnest.cpp