Amit Patankar [Mon, 30 Apr 2018 19:43:18 +0000 (12:43 -0700)]
Merge pull request #18972 from bstriner/fix_mklroot
Use MKLROOT
Peter Hawkins [Mon, 30 Apr 2018 19:37:00 +0000 (15:37 -0400)]
Merge pull request #18987 from eliben/update-xla-readme
Update xla README
Eli Bendersky [Mon, 30 Apr 2018 18:27:41 +0000 (11:27 -0700)]
Merge branch 'update-xla-readme' of github.com:eliben/tensorflow into update-xla-readme
Eli Bendersky [Mon, 30 Apr 2018 18:16:48 +0000 (11:16 -0700)]
Add XLA logo and beef up the README
Eli Bendersky [Mon, 30 Apr 2018 18:20:02 +0000 (11:20 -0700)]
Reorder
Eli Bendersky [Mon, 30 Apr 2018 18:19:25 +0000 (11:19 -0700)]
Center-align the logo image and set size
Eli Bendersky [Mon, 30 Apr 2018 18:16:48 +0000 (11:16 -0700)]
Add XLA logo and beef up the README
Shanqing Cai [Mon, 30 Apr 2018 15:36:21 +0000 (11:36 -0400)]
Merge pull request #18983 from caisq/branch_194768567
Branch
194768567
Amit Patankar [Mon, 30 Apr 2018 15:20:05 +0000 (08:20 -0700)]
Merge pull request #18882 from nfelt/tb-nightly-dep
Update tb-nightly dep to >= 1.9.0a0, < 1.10.0a0
Amit Patankar [Mon, 30 Apr 2018 15:19:05 +0000 (08:19 -0700)]
Merge pull request #18825 from saginadir/patch-1
Update README.md
Amit Patankar [Mon, 30 Apr 2018 15:15:04 +0000 (08:15 -0700)]
Merge pull request #18965 from rmanyari/feature_columns_fix
add missing equality
Dan Moldovan [Mon, 30 Apr 2018 15:04:23 +0000 (11:04 -0400)]
autograph: Update README (#18981)
* Update README.md
ManHyuk [Mon, 30 Apr 2018 15:01:30 +0000 (00:01 +0900)]
fix typo (#18957)
Shanqing Cai [Mon, 30 Apr 2018 14:00:24 +0000 (10:00 -0400)]
Merge commit for internal changes
A. Unique TensorFlower [Mon, 30 Apr 2018 11:21:09 +0000 (04:21 -0700)]
Cleaning up tracing code.
PiperOrigin-RevId:
194768567
Ben [Mon, 30 Apr 2018 05:38:48 +0000 (01:38 -0400)]
Use MKLROOT
Russell Power [Sun, 29 Apr 2018 22:37:12 +0000 (15:37 -0700)]
Keras: Supply `maximum_iterations` to the TF backend when possible.
PiperOrigin-RevId:
194723199
Russell Power [Sun, 29 Apr 2018 22:30:22 +0000 (15:30 -0700)]
Add support for a clean checkpoint and shutdown in response to a termination notice.
PiperOrigin-RevId:
194722985
rmanyari [Sun, 29 Apr 2018 21:23:08 +0000 (17:23 -0400)]
add missing equality
Sherry Moore [Sun, 29 Apr 2018 16:56:16 +0000 (09:56 -0700)]
Added del_hparam(), the counter part of add_hparam.
PiperOrigin-RevId:
194711291
Richard Wei [Sun, 29 Apr 2018 06:51:28 +0000 (23:51 -0700)]
Update the Swift for TensorFlow community page.
PiperOrigin-RevId:
194687897
Dimitris Vardoulakis [Sun, 29 Apr 2018 05:19:22 +0000 (22:19 -0700)]
[TF:XLA]
- Require a module config when creating an HloModule.
- All tests using HloTestBase create a module using CreateNewModule.
PiperOrigin-RevId:
194684585
A. Unique TensorFlower [Sun, 29 Apr 2018 02:47:42 +0000 (19:47 -0700)]
Internally rewrite RevBlock to use @custom_gradient
PiperOrigin-RevId:
194679657
Yong Tang [Sat, 28 Apr 2018 19:06:15 +0000 (12:06 -0700)]
Pass dtype to constructor in LSTMCell (#18178)
* Use float32 in case the dtype is not set in the constructor
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add test case for 16228.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add test case where dype is passed explicitly.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Fix pylint issue
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Replace strings to objects to address review feedback.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
QingYing Chen [Sat, 28 Apr 2018 19:05:34 +0000 (03:05 +0800)]
Fix functions in CRF when sequence_lengths contains zero (#18487)
* Fix computation of crf_log_norm when sequence length is zero
* fix _single_seq_fn in crf when sequence_lengths contain zero
Rholais Lii [Sat, 28 Apr 2018 19:05:00 +0000 (03:05 +0800)]
Emphasis any `Estimator` (#18793)
Yong Tang [Sat, 28 Apr 2018 19:04:45 +0000 (12:04 -0700)]
Add uint32 and uint64 support with tf.train.batch (#18805)
* Add uint32 and uint64 support with tf.train.batch
This fix tries to address the issue raised in 18586
to have uint32 and uint64 support with tf.train.batch.
This fix add uint32 and uint64 to `CopyElementToSlice`
for the support.
This fix fixes 18586.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add test case for uint32 with tf.train.batch
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add uint64 test case
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
Nehal J Wani [Sat, 28 Apr 2018 19:03:42 +0000 (14:03 -0500)]
Fix typo in CMakeLists.txt (#18833)
Stefan Schweter [Sat, 28 Apr 2018 19:03:15 +0000 (21:03 +0200)]
Fix link to original LSTM paper (#18876)
Asim Shankar [Sat, 28 Apr 2018 18:31:12 +0000 (11:31 -0700)]
Java: Release 1.8.0
PiperOrigin-RevId:
194663800
Brennan Saeta [Sat, 28 Apr 2018 17:51:32 +0000 (10:51 -0700)]
[tf.data] Use core::ScopedUnref to avoid resource leakage.
If for whatever reason iterator_resource->set_iterator did not return Status::OK(), we would leak a reference on the iterator_resource. With this change, we won't leak the resource.
PiperOrigin-RevId:
194662412
A. Unique TensorFlower [Sat, 28 Apr 2018 17:40:49 +0000 (10:40 -0700)]
Allow not specifying eval_spec when evaluation is not necessarily run.
PiperOrigin-RevId:
194661814
Mingsheng Hong [Sat, 28 Apr 2018 15:55:08 +0000 (08:55 -0700)]
This is Part 1 of Swift<->TF sends/recvs: support sending tensors from TF to
Swift via direct session.
The changes are:
1. Added an experimental TF C API TF_DequeueNamedTensor() to consume the queued
tensors from a dequeue op. One use case is for the Swift host program to consume
tensors sent by TF, where the queue is a Fifo queue managed by TF.
Enqueuing tensors are done by running an enqueue op in a graph. The queued
tensors are not persisted, and will be lost if the process/machine dies. The
queue has a bounded capacity, to prevent producer from being unboundedly ahead
of consumer.
while caller of TF_DequeueNamedTensor() could have run the Fifo dequeue op
directly, the extra level of indirection provided by this API allows us to more
easily switch the queuing impl to another mechanism. If and once we stabilize on
the Fifo queue based impl, we can remove this API.
2. Added a new S4TF runtime API _TFCReceiveTensorHandle() that receives a tensor
via TF_DequeueNamedTensor().
3. To support tensor receives in host program, taught PartitionCloner in
TFPartition to insert SIL code to call _TFCReceiveTensorHandle().
4. To support tensor sends in accelerator program, taught TFGraphLowering in
generate QueueEnqueueV2 nodes in the TF graphs, with appropriate control
dependence to make sure these nodes get executed.
a) The enqueue produces no output tensor, and is executed only for its side
effect. To ensure it is executed properly, control dependence is wired up. The
general design is: before a TF_Function (can be a top level function or the body
function of a while op) produces an output tensor OT, make OT control dependent
on the enqueue op, so that enqueue gets run before the function returns.
b) If tensor send occurs in a while loop body, the body logic currently gets
lowered in 3 places: the while op cond function, the while op body function, and
the ops at the same level as the while op itself (for running the last loop
iteration). In this case, the correct TFGraph lowering is to run the enqueue in
the last 2 out of the 3 places above.
After this CL, the dual versions of the above (dequeuing via an op, and
enqueuing via C API) will be added.
PiperOrigin-RevId:
194658511
Anna R [Sat, 28 Apr 2018 07:13:09 +0000 (00:13 -0700)]
Removing hidden_ops.txt file.
PiperOrigin-RevId:
194637892
A. Unique TensorFlower [Sat, 28 Apr 2018 06:35:42 +0000 (23:35 -0700)]
Fix kernel creation bug, due to constant folding always use CPU.
PiperOrigin-RevId:
194636076
Billy Lamberta [Sat, 28 Apr 2018 06:14:40 +0000 (23:14 -0700)]
Merge pull request #18944 from tensorflow/martinwicke-patch-3
Fix docs rendering in placeholder docs page.
Martin Wicke [Sat, 28 Apr 2018 06:04:58 +0000 (23:04 -0700)]
Fix docs rendering in placeholder docs page.
A. Unique TensorFlower [Sat, 28 Apr 2018 05:57:36 +0000 (22:57 -0700)]
Add test case on compiling dense layer node with XLA.
PiperOrigin-RevId:
194634563
Patrick Nguyen [Sat, 28 Apr 2018 04:58:17 +0000 (21:58 -0700)]
Properly export recurrent in contrib.
The following symbols are available:
- tf.contrib.recurrent.bidirectional_functional_rnn
- tf.contrib.recurrent.functional_rnn
- tf.contrib.recurrent.Recurrent
PiperOrigin-RevId:
194632138
Sanjoy Das [Sat, 28 Apr 2018 03:06:35 +0000 (20:06 -0700)]
HLO profiling for tfcompile.
This CL extends the --xla_hlo_profile knob to tfcompile. tf_library rules can
now set enable_xla_hlo_profiling to True to:
- Have the generated code update per-HLO profile counters as it executes.
- Have tfcompile generate and serialize an instance HloProfilePrinterData with
a compiled model that can be used to pretty-print the collected profile
counters.
PiperOrigin-RevId:
194627272
A. Unique TensorFlower [Sat, 28 Apr 2018 02:23:16 +0000 (19:23 -0700)]
Add internal uint b stats to TfOpStats.
PiperOrigin-RevId:
194625155
Sanjoy Das [Sat, 28 Apr 2018 01:41:27 +0000 (18:41 -0700)]
Split up ElementaIrEmitter::MakeElementGenerator into smaller functions; NFC
PiperOrigin-RevId:
194622198
A. Unique TensorFlower [Sat, 28 Apr 2018 01:24:57 +0000 (18:24 -0700)]
[XLA] Fix bug in ShapeUtil::StripDegenerateDimensions
PiperOrigin-RevId:
194621163
Brennan Saeta [Sat, 28 Apr 2018 00:08:57 +0000 (17:08 -0700)]
Make RetryingFileSystem a template.
PiperOrigin-RevId:
194614877
Sandeep N Gupta [Fri, 27 Apr 2018 23:38:24 +0000 (16:38 -0700)]
Revised roadmap (#18939)
Revised roadmap
Petros Mol [Fri, 27 Apr 2018 23:22:43 +0000 (16:22 -0700)]
Minor fix to SDCAOptimizer documentation.
PiperOrigin-RevId:
194609850
Nick Desaulniers [Fri, 27 Apr 2018 23:14:49 +0000 (16:14 -0700)]
[TF:XLA:INTERPRETER] implement bfloat16 comparisons
PiperOrigin-RevId:
194608854
A. Unique TensorFlower [Fri, 27 Apr 2018 22:28:01 +0000 (15:28 -0700)]
Fix broken ElementWiseFusionTest.
PiperOrigin-RevId:
194602336
Akshay Modi [Fri, 27 Apr 2018 21:47:12 +0000 (14:47 -0700)]
Minor eager service proto clarification.
PiperOrigin-RevId:
194596337
Sanjoy Das [Fri, 27 Apr 2018 21:36:24 +0000 (14:36 -0700)]
[XLA:CPU] Implement fusion for the Gather HLO
PiperOrigin-RevId:
194594759
A. Unique TensorFlower [Fri, 27 Apr 2018 21:28:12 +0000 (14:28 -0700)]
If two identical functions are given different grad func,
they should be named differently. Otherwise, tf.gradients
gets confused.
PiperOrigin-RevId:
194593519
A. Unique TensorFlower [Fri, 27 Apr 2018 21:05:46 +0000 (14:05 -0700)]
Remove whitespaces from tags in saved_model_cli.
This currently causes tags mismatch because a leading whitespace is added
within the saved_model_cli when doing ', '.join(tag_set).
PiperOrigin-RevId:
194590154
Sanjoy Das [Fri, 27 Apr 2018 20:55:35 +0000 (13:55 -0700)]
[TF:XLA] Bump open source llvm revision to r330950
PiperOrigin-RevId:
194588403
gracehoney [Fri, 27 Apr 2018 20:07:16 +0000 (13:07 -0700)]
Roll forward the custom optimizers change (#18742)
Sourabh Bajaj [Fri, 27 Apr 2018 20:01:44 +0000 (13:01 -0700)]
Remove scope name from bfloat16
PiperOrigin-RevId:
194580957
Asim Shankar [Fri, 27 Apr 2018 19:59:20 +0000 (12:59 -0700)]
eager: Improve error message when GradientTape is used incorrectly.
PiperOrigin-RevId:
194580654
A. Unique TensorFlower [Fri, 27 Apr 2018 19:46:46 +0000 (12:46 -0700)]
Replace GrapplerFunctionItem input with a constant.
PiperOrigin-RevId:
194579253
Billy Lamberta [Fri, 27 Apr 2018 19:37:50 +0000 (12:37 -0700)]
Merge pull request #18937 from lamberta/fix-typo
Docs: fix typo
drpngx [Fri, 27 Apr 2018 19:15:59 +0000 (13:15 -0600)]
Merge pull request #18935 from drpngx/branch_194551042
Branch
194551042
Billy Lamberta [Fri, 27 Apr 2018 18:47:42 +0000 (11:47 -0700)]
Docs: fix typo
A. Unique TensorFlower [Fri, 27 Apr 2018 18:41:21 +0000 (11:41 -0700)]
Internally rewrite @recompute_grad to use @custom_gradient
PiperOrigin-RevId:
194571125
gracehoney [Fri, 27 Apr 2018 17:58:43 +0000 (10:58 -0700)]
Add DeviceSet to Cluster (#18838)
* Add DeviceSet to Cluster so we can access memory allocators during grappler optimizations
* Fix review comments
* Add missing dependency to :virtual_cluster and fix clang format.
Patrick Nguyen [Fri, 27 Apr 2018 17:28:50 +0000 (10:28 -0700)]
Fix merge conflict manual merge error.
Patrick Nguyen [Fri, 27 Apr 2018 17:23:02 +0000 (10:23 -0700)]
Merge commit for internal changes
andrehentz [Fri, 27 Apr 2018 17:06:03 +0000 (10:06 -0700)]
Merge pull request #18727 from freedomtan/label_image_tflite_profiling_new
[tflite] add profiling to label_image for tflite
A. Unique TensorFlower [Fri, 27 Apr 2018 16:25:52 +0000 (09:25 -0700)]
Added string conversion operator to tensorflow::StringPiece.
Marked ToString method as deprecated.
This will allow tensorflow::StringPiece to be replaced with absl::string_view (once the deprecated method is removed) as absl::string_view does not contain the ToString method.
PiperOrigin-RevId:
194551042
A. Unique TensorFlower [Fri, 27 Apr 2018 14:21:37 +0000 (07:21 -0700)]
Fix bug in @custom_gradient in Eager mode with numpy inputs
PiperOrigin-RevId:
194538828
Sanjoy Das [Fri, 27 Apr 2018 12:31:38 +0000 (05:31 -0700)]
[TF:XLA] Bump open source llvm revision to r330926
PiperOrigin-RevId:
194530610
Yu-Cheng Ling [Fri, 27 Apr 2018 07:07:07 +0000 (00:07 -0700)]
Automated g4 rollback of changelist
194306629
PiperOrigin-RevId:
194507274
A. Unique TensorFlower [Fri, 27 Apr 2018 05:06:36 +0000 (22:06 -0700)]
Reenable factorization_ops_test on ASAN after adding shard_count = 4. Tests now
finish with these stats: "max = 150.6s, min = 27.4s, avg = 66.3s, dev = 19.5s" over 1000 runs and this runtime distribution should be fairly safe for deadline of 300s.
PiperOrigin-RevId:
194500204
A. Unique TensorFlower [Fri, 27 Apr 2018 02:35:10 +0000 (19:35 -0700)]
Implement floor operator
PiperOrigin-RevId:
194490433
A. Unique TensorFlower [Fri, 27 Apr 2018 01:24:48 +0000 (18:24 -0700)]
Automated g4 rollback of changelist
194442428
PiperOrigin-RevId:
194485227
A. Unique TensorFlower [Fri, 27 Apr 2018 01:03:50 +0000 (18:03 -0700)]
Free scratch memory in ~BaseGPUDevice.
PiperOrigin-RevId:
194483351
A. Unique TensorFlower [Fri, 27 Apr 2018 00:56:08 +0000 (17:56 -0700)]
Implements linear no-offset (aka symmetric) quantizer.
PiperOrigin-RevId:
194482547
Allen Lavoie [Thu, 26 Apr 2018 23:40:16 +0000 (16:40 -0700)]
Split out SaveableObjects into their own file
Pulls a couple build rules out of tensorflow/python:training. I'd like to use a SaveableObject in :checkpointable (for saving some Python state by default), which means the file with SaveableObject has to be essientially dependency-free.
PiperOrigin-RevId:
194473987
Malcolm Reynolds [Thu, 26 Apr 2018 23:24:51 +0000 (16:24 -0700)]
More informative error message when loading a graph_def which uses unknown ops.
Fixes #17014
PiperOrigin-RevId:
194472083
Shashi Shekhar [Thu, 26 Apr 2018 23:11:11 +0000 (16:11 -0700)]
Fix some flakiness in test.
PiperOrigin-RevId:
194470125
A. Unique TensorFlower [Thu, 26 Apr 2018 23:01:00 +0000 (16:01 -0700)]
Internal change.
PiperOrigin-RevId:
194468535
ekelsen [Thu, 26 Apr 2018 22:49:48 +0000 (15:49 -0700)]
Merge pull request #18846 from yongtang/
04252018-FloorDiv-int8
Enable int8 support for FloorDiv
Raghuraman Krishnamoorthi [Thu, 26 Apr 2018 22:40:15 +0000 (15:40 -0700)]
Handle variations in scoping of batch norms for correct unfused batch norm folding.
PiperOrigin-RevId:
194465704
Sanjoy Das [Thu, 26 Apr 2018 22:33:38 +0000 (15:33 -0700)]
Remove the inter-op thread pool
Forgot about this in cl/
194299356. However, when I checked cl/
194299356, I
found that we actually (incorrectly?) used the *intra* op thread pool in the
parallel CPU executable? Does that mean the inter op thread pool was always
unused?
PiperOrigin-RevId:
194464734
ekelsen [Thu, 26 Apr 2018 22:32:44 +0000 (15:32 -0700)]
Merge pull request #18881 from ManHyuk/fix_typo
fix typo
ekelsen [Thu, 26 Apr 2018 22:30:35 +0000 (15:30 -0700)]
Merge pull request #18907 from yongtang/18363-mpi
Fix build error with MPI support
Shivani Agrawal [Thu, 26 Apr 2018 22:24:44 +0000 (15:24 -0700)]
[tf.data] Adds support for adding scalar value to `StatsAggregator`.
PiperOrigin-RevId:
194463407
ekelsen [Thu, 26 Apr 2018 22:27:08 +0000 (15:27 -0700)]
Merge pull request #18896 from KikaTech/fix_lite_topk
Fix some issues in official tf.nn.topk() in lite
Shivani Agrawal [Thu, 26 Apr 2018 22:15:37 +0000 (15:15 -0700)]
[tf.data] Changes description for `bytes_produced_stats` and `latency_stats` in accordance with the breaking changes in cl/
193432590.
PiperOrigin-RevId:
194461964
James Martens [Thu, 26 Apr 2018 22:13:48 +0000 (15:13 -0700)]
- Default values of cov and inv variables are now 0. Zero-debiasing (as in Adam) is used for the cov matrices. Note this this requires that cov variables, then inv variables, are all updated before the first training update is made. All examples have been modified to do this. NOTE: you *may* have to increase the damping value you use at the start of optimization after this change (or throughout, if you are using a constant value).
- Changed the initial default approximation used for generic registrations to "diagonal"
- Convenience properties for ops and thunks have all been removed, along with "make_ops_and_vars". User should only interface with "make_vars_and_create_op_thunks" (or maybe "create_ops_and_vars_thunks").
PiperOrigin-RevId:
194461623
A. Unique TensorFlower [Thu, 26 Apr 2018 22:08:24 +0000 (15:08 -0700)]
Add support for variables in tf.custom_gradient
PiperOrigin-RevId:
194460752
A. Unique TensorFlower [Thu, 26 Apr 2018 22:07:43 +0000 (15:07 -0700)]
Disable triangular_solve_test on ASAN due to flaky time outs.
PiperOrigin-RevId:
194460641
Daniel Zheng [Thu, 26 Apr 2018 22:02:53 +0000 (15:02 -0700)]
Edit tensorflow.org/community/swift page.
PiperOrigin-RevId:
194459754
A. Unique TensorFlower [Thu, 26 Apr 2018 21:59:29 +0000 (14:59 -0700)]
Adds optimization to convert division of sqrt to multiplication of rsqrt
PiperOrigin-RevId:
194459152
A. Unique TensorFlower [Thu, 26 Apr 2018 21:53:19 +0000 (14:53 -0700)]
Disable densenet_test on MSAN due to flaky time outs.
PiperOrigin-RevId:
194458270
Martin Wicke [Thu, 26 Apr 2018 21:12:43 +0000 (14:12 -0700)]
Merge branch 'master' into label_image_tflite_profiling_new
A. Unique TensorFlower [Thu, 26 Apr 2018 20:53:17 +0000 (13:53 -0700)]
Remove unnecessary TF_NEED_GCP from build scripts.
PiperOrigin-RevId:
194448612
Yong Tang [Thu, 26 Apr 2018 20:47:51 +0000 (13:47 -0700)]
Fix cmake build issues with GPU on Linux (#18775)
* Update include file for cmake
While attempting to build tensorflow with cmake for gpu the following
build error happens:
```
/home/ubuntu/tensorflow/tensorflow/core/platform/default/gpu/cupti_wrapper.h:26:45: fatal error: cuda/extras/CUPTI/include/cupti.h: No such file or directory
compilation terminated.
CMakeFiles/tf_core_cpu.dir/build.make:3302: recipe for target 'CMakeFiles/tf_core_cpu.dir/home/ubuntu/tensorflow/tensorflow/core/platform/default/gpu/cupti_wrapper.cc.o' failed
make[2]: *** [CMakeFiles/tf_core_cpu.dir/home/ubuntu/tensorflow/tensorflow/core/platform/default/gpu/cupti_wrapper.cc.o] Error 1
CMakeFiles/Makefile2:2402: recipe for target 'CMakeFiles/tf_core_cpu.dir/all' failed
make[1]: *** [CMakeFiles/tf_core_cpu.dir/all] Error 2
Makefile:127: recipe for target 'all' failed
make: *** [all] Error 2
```
This fix tries to fix the build error above for cmake.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add xla dependency to stream_executor
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Remove nccl files temporarily
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Add gomp library to libcusolver.so
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
* Fix cmake issue
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
Martin Wicke [Thu, 26 Apr 2018 20:45:56 +0000 (13:45 -0700)]
Merge pull request #17602 from joeyearsley/patch-1
Update fold_old_batch_norms.cc to accommodate 'NCHW' format.
A. Unique TensorFlower [Thu, 26 Apr 2018 20:35:35 +0000 (13:35 -0700)]
Adding some slightly more exhaustive strided_slice test parameters.
PiperOrigin-RevId:
194446000
Tony Wang [Thu, 26 Apr 2018 20:30:15 +0000 (13:30 -0700)]
[TF:XLA] Add INTEL MKL_DNN Conv2d method to XLA/CPU backend
The INTEL MKL_DNN provides 32-bit Conv2d method. With INTEL_MKL flag set,
XLA backend emits runtime call to MKL_DNN Conv2d instead of Eigen.
PiperOrigin-RevId:
194445212
Benoit Steiner [Thu, 26 Apr 2018 20:19:39 +0000 (13:19 -0700)]
Run 2 passes of rewrites by default
PiperOrigin-RevId:
194443770
Yong Tang [Thu, 26 Apr 2018 20:21:18 +0000 (20:21 +0000)]
Fix build error with MPI support
This fix tries to fix the issue raised in 18363 where
the bazel build with MPI support fails as a header is missing
in the include.
This fix fixes the issue. The fix is verified locally
with MPI+CUDA on Ubuntu 16.04.
This fix fixes 18363.
Signed-off-by: Yong Tang <yong.tang.github@outlook.com>