review.tizen.org Git - platform/upstream/pytorch.git/commit

multi-dim standard deviation for CUDA. (#14990)

Summary:
This is the CUDA version of #14535 .
It refactors Reduce.cuh to allow more general classes of reductions to be performed -- we no longer assume that the temporary data returned during reduction is just one scalar, and instead allow an arbitrary accumulate type.
We also allow 64-bit indexing when necessary, since in general we will no longer be able to accumulate directly in the output. (In the cases when we can, we continue to split the tensors until they can be addressed with 32-bits, as before).
As an initial use-case, we implement `std` in multiple dimensions.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/14990

Differential Revision: D13405097

Pulled By: umanwizard

fbshipit-source-id: a56c24dc2fd5326d417632089bd3f5c4f9f0d2cb

author	Brennan Vincent <btv@fb.com>
	Thu, 20 Dec 2018 16:53:44 +0000 (08:53 -0800)
committer	Facebook Github Bot <facebook-github-bot@users.noreply.github.com>
	Thu, 20 Dec 2018 16:56:32 +0000 (08:56 -0800)
commit	7a764fe270ef06f364e6e504db2ce5959660bd8f
tree	a80c571fe50512c4dfd030404bb10f150f35cfb1	tree \| snapshot
parent	5e624948b65ff32f927eed7e4fa1002b4113f8c1	commit \| diff

aten/src/ATen/cuda/detail/OffsetCalculator.cuh		diff \| blob \| history
aten/src/ATen/detail/FunctionTraits.h		diff \| blob \| history
aten/src/ATen/native/ReduceOps.cpp		diff \| blob \| history
aten/src/ATen/native/SharedReduceOps.h	[new file with mode: 0644]	blob
aten/src/ATen/native/cpu/Reduce.h		diff \| blob \| history
aten/src/ATen/native/cpu/ReduceOpsKernel.cpp		diff \| blob \| history
aten/src/ATen/native/cuda/DeviceSqrt.cuh	[new file with mode: 0644]	blob
aten/src/ATen/native/cuda/Normalization.cuh		diff \| blob \| history
aten/src/ATen/native/cuda/Reduce.cuh		diff \| blob \| history
aten/src/ATen/native/cuda/ReduceOpsKernel.cu		diff \| blob \| history
test/test_torch.py		diff \| blob \| history
torch/_torch_docs.py		diff \| blob \| history