Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/63416
Fix a missing thread local specifier introduced by recent PR
https://github.com/pytorch/pytorch/pull/61002
Test Plan: Unit Tests
Reviewed By: ngimel
Differential Revision:
D30376154
fbshipit-source-id:
c70d37ec85c3eba88eb87f766f1c4e7aeff8eaf9
thread_local at::ScalarType autocast_cpu_dtype = at::kBFloat16;
// autocast_gpu_dtype is the lower_precision_fp used by AutocastGPU.
-at::ScalarType autocast_gpu_dtype = at::kHalf;
+thread_local at::ScalarType autocast_gpu_dtype = at::kHalf;
}
void clear_cache() {