Optimize fused_dropout_kernel launch bounds for AMD hardware
authorJohannes M Dieterich <johannes.dieterich@amd.com>
Mon, 11 Mar 2019 21:39:07 +0000 (14:39 -0700)
committerFacebook Github Bot <facebook-github-bot@users.noreply.github.com>
Mon, 11 Mar 2019 21:45:42 +0000 (14:45 -0700)
commitfa29c179b708881ae0985fbe6ad4065256e769bb
treeb75e81f8f75d5a210074fc711cf5ccdc736de490
parent3f1d0ee5d5f48bb0fbef433a61cef0be9ad40a76
Optimize fused_dropout_kernel launch bounds for AMD hardware

Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17870

Differential Revision: D14409990

Pulled By: ezyang

fbshipit-source-id: 0452282f459770823641b2527f47b1186ab14666
aten/src/ATen/native/cuda/Dropout.cu