Introduce output fusion for (A dot B) * alpha
authorA. Unique TensorFlower <gardener@tensorflow.org>
Tue, 13 Mar 2018 15:49:17 +0000 (08:49 -0700)
committerTensorFlower Gardener <gardener@tensorflow.org>
Tue, 13 Mar 2018 15:59:46 +0000 (08:59 -0700)
commit975467c6579ecc91ea1ee53bd1a77392818b0fae
tree356ce46de7affdf0756d9c30349804e726d682b1
parentc0cc5f0efa227eb27022b8eeb6a137d550e71519
Introduce output fusion for (A dot B) * alpha

We allow now to fuse (A dot B) times a constant, which is available in cublas.

PiperOrigin-RevId: 188878210
tensorflow/compiler/xla/service/gpu/gemm_thunk.cc
tensorflow/compiler/xla/service/gpu/gemm_thunk.h
tensorflow/compiler/xla/service/gpu/instruction_fusion.cc
tensorflow/compiler/xla/service/gpu/instruction_fusion_test.cc
tensorflow/compiler/xla/service/gpu/ir_emission_utils.cc
tensorflow/compiler/xla/service/gpu/ir_emitter_unnested.cc