review.tizen.org Git - platform/upstream/tensorflow.git/commit

projects / platform / upstream / tensorflow.git / commit

author	James Qin <jamesqin@google.com>
	Wed, 21 Mar 2018 22:55:30 +0000 (15:55 -0700)
committer	TensorFlower Gardener <gardener@tensorflow.org>
	Wed, 21 Mar 2018 22:58:16 +0000 (15:58 -0700)
commit	942a32bc71291994c14625b6311268319dd27808
tree	1ed34c04d06867fd34ef2dcba46351fb7fe6c5bc	tree \| snapshot
parent	9cd65e9a9081640934b2b78cf84b6e51ddd69796	commit \| diff

Change Softmax on CUDA to use fp32 for denominator when input/output are fp16.

This avoids potential overflow in the denominator, also makes sure accumulation is done
in high precision.

PiperOrigin-RevId: 189982655

tensorflow/core/kernels/softmax_op_gpu.cu.cc		diff \| blob \| history
tensorflow/python/framework/test_util.py		diff \| blob \| history
tensorflow/python/kernel_tests/BUILD		diff \| blob \| history
tensorflow/python/kernel_tests/softmax_op_test.py		diff \| blob \| history

Domain: Machine Learning / ML Framework;

RSS Atom