gracehoney [Tue, 6 Mar 2018 22:59:33 +0000 (14:59 -0800)]
Revert the changes of ScopedActivateExecutorContext, which requires depending on core:lib which is forbidden
gracehoney [Tue, 6 Mar 2018 21:54:33 +0000 (13:54 -0800)]
Fix std::string and unused Status problems
gracehoney [Tue, 6 Mar 2018 21:20:36 +0000 (13:20 -0800)]
Fix the include for cuda_runtime_api.h
Sami Kama [Tue, 6 Mar 2018 00:06:06 +0000 (16:06 -0800)]
Fix enum ints
jjsjann123 [Mon, 5 Mar 2018 23:54:01 +0000 (15:54 -0800)]
removing unused variables
Sami Kama [Mon, 5 Mar 2018 23:10:03 +0000 (15:10 -0800)]
More variable renaming
Sami Kama [Mon, 5 Mar 2018 22:56:20 +0000 (14:56 -0800)]
Change variable naming
Jie [Mon, 5 Mar 2018 22:22:31 +0000 (14:22 -0800)]
remove cudaSetDevice with tensorflow ScopedActivateExecutorContext
Sami Kama [Mon, 5 Mar 2018 22:15:18 +0000 (14:15 -0800)]
Remove debug messages and fix logger
Sami Kama [Mon, 5 Mar 2018 20:37:55 +0000 (12:37 -0800)]
Fixes for PR comments
Sami Kama [Fri, 2 Mar 2018 23:45:25 +0000 (15:45 -0800)]
Int8, FP16 and new ops support
Sami Kama [Fri, 2 Mar 2018 22:38:08 +0000 (14:38 -0800)]
Change consts
Sami Kama [Fri, 2 Mar 2018 21:18:22 +0000 (13:18 -0800)]
Remove debug from config
Sami Kama [Fri, 2 Mar 2018 20:59:52 +0000 (12:59 -0800)]
Merge branch 'tftrt_int8' of github.com:NVIDIA/tftrt into tftrt_int8
Sami Kama [Fri, 2 Mar 2018 20:59:28 +0000 (12:59 -0800)]
Fixes for PR
Jie [Fri, 2 Mar 2018 20:56:58 +0000 (12:56 -0800)]
[removing converter type check]
removing type check, since fp16 conversion will break the type consistency
between TF & TRT
More type check should be removed for now
(and add back once TRT fp16 is fixed)
Sami Kama [Fri, 2 Mar 2018 17:20:17 +0000 (09:20 -0800)]
Uncomment google preprocessor conditionals
Sami Kama [Fri, 2 Mar 2018 05:39:22 +0000 (21:39 -0800)]
Merging upstream
Sami Kama [Fri, 2 Mar 2018 03:55:18 +0000 (19:55 -0800)]
Merge with upstream
Sami Kama [Fri, 2 Mar 2018 03:06:52 +0000 (19:06 -0800)]
Update testing script and README.md
Yifei Feng [Fri, 2 Mar 2018 02:18:30 +0000 (18:18 -0800)]
Merge pull request #17365 from imsheridan/fix_error_link
Fix the error activation function link in custom_estimators.md
Yifei Feng [Fri, 2 Mar 2018 02:17:57 +0000 (18:17 -0800)]
Merge pull request #17335 from giuscri/fix-doc-error
Fix markdown error in documentation.
Sami Kama [Fri, 2 Mar 2018 01:20:54 +0000 (17:20 -0800)]
Scaffolding for int8 calibration in TF-TRT (#17309)
* Scaffolding for int8 calibration
* Add ops/trt_calib_op.cc
* Rename files and replace std::string with string
* Line lengths, variable names, conditionals in BUILD
* mode variable renaming
* More fixes for review
* Run clang-format
* Fix the build failue and replace the macro with a function
* Add TODO(aaroey) for future PRs
* Fix namespace for internal build
* Fix mismatched argument name and unused includes to make internal build happy
* Fix order of dependencies in BUILD file
* Remove dangling #undef
Sami Kama [Fri, 2 Mar 2018 01:03:56 +0000 (17:03 -0800)]
Make segmentation option configurable
Fred Reiss [Thu, 1 Mar 2018 21:25:21 +0000 (13:25 -0800)]
Clean up output formatting of saved_model_cli.py (#17235)
Tatiana Shpeisman [Thu, 1 Mar 2018 20:19:37 +0000 (12:19 -0800)]
Merge pull request #17004 from Intel-tensorflow/relu_bn_fix2
MKL: cifar 10 divergance fix and batchnorm unit test fix
Guozhong Zhuang [Thu, 1 Mar 2018 18:29:11 +0000 (10:29 -0800)]
Fix improper comments such as tf --> TensorFlow
imsheridan [Thu, 1 Mar 2018 17:58:06 +0000 (01:58 +0800)]
Fix the error activation function link in custom_estimators
ImSheridan [Thu, 1 Mar 2018 16:34:36 +0000 (00:34 +0800)]
Fix some minor typos in get started docs to keep consistent (#17357)
4d55397500 [Thu, 1 Mar 2018 01:57:35 +0000 (17:57 -0800)]
Fix return value in sampled_softmax_loss
Giuseppe [Wed, 28 Feb 2018 22:32:19 +0000 (23:32 +0100)]
Fix markdown error in documentation.
Newline in the middle of links
was preventing their rendering.
Loo Rong Jie [Wed, 28 Feb 2018 20:38:30 +0000 (04:38 +0800)]
[Windows] Copy NominalCPUFrequency from Abseil (#16905)
* [Windows] Copy NominalCPUFrequency from Abseil
* Add #include <shlwapi.h>
MandarJKulkarni [Wed, 28 Feb 2018 18:35:09 +0000 (00:05 +0530)]
Fix typos in profiler.h (#16938)
vihanjain [Wed, 28 Feb 2018 00:05:26 +0000 (16:05 -0800)]
Pull request for fixing warm-starting device placement (#17312)
* Update checkpoint_utils.py
Fix device allocation bug for warm-starting op
* Update checkpoint_utils_test.py
Fix test
Sami Kama [Tue, 27 Feb 2018 23:17:39 +0000 (15:17 -0800)]
Set oplib visibility to public
Gunhan Gulsoy [Tue, 27 Feb 2018 18:19:08 +0000 (10:19 -0800)]
Bump the version of CUB in cmake build.
A. Unique TensorFlower [Tue, 27 Feb 2018 14:00:21 +0000 (06:00 -0800)]
Enable dynamic function calls. These are compiled just in time by inserting a call to compile.
PiperOrigin-RevId:
187165096
A. Unique TensorFlower [Tue, 27 Feb 2018 09:02:36 +0000 (01:02 -0800)]
Add documentation to Grappler RewriterConfig to give a short description for each
of the optimizer on what they do.
PiperOrigin-RevId:
187143156
A. Unique TensorFlower [Tue, 27 Feb 2018 05:25:22 +0000 (21:25 -0800)]
Improve error handling in strided_slice_op to fail more gracefully and return an error status instead of crashing.
PiperOrigin-RevId:
187126888
A. Unique TensorFlower [Tue, 27 Feb 2018 05:09:30 +0000 (21:09 -0800)]
Automated g4 rollback of changelist
187092622
PiperOrigin-RevId:
187125995
Brennan Saeta [Tue, 27 Feb 2018 04:21:07 +0000 (20:21 -0800)]
Add helpers to stream data from the GCE VM to a Cloud TPU.
PiperOrigin-RevId:
187122870
Skye Wanderman-Milne [Tue, 27 Feb 2018 03:58:18 +0000 (19:58 -0800)]
Fix bug in deserializing CondContexts.
PiperOrigin-RevId:
187121244
Francois Chollet [Tue, 27 Feb 2018 03:57:42 +0000 (19:57 -0800)]
Fixes and simplification in the Keras training engine.
- Explicitly disallow sample/class weighting in eager (it was never supported)
- Remove tests for it (which were actually ignoring sample/class weights)
- Make sample weight placeholders placeholder_with_default, and do not create all-ones numpy arrays to feed them when no sample weights are provided (this might lead to better performance)
PiperOrigin-RevId:
187121215
A. Unique TensorFlower [Tue, 27 Feb 2018 03:46:27 +0000 (19:46 -0800)]
Make sure rounding and handling of denormals in Grappler is the same as in TensorFlow.
Enable constant folding for more types, particularly on GPUs.
PiperOrigin-RevId:
187120456
A. Unique TensorFlower [Tue, 27 Feb 2018 02:05:59 +0000 (18:05 -0800)]
Remove old implementation of the adaptive shared batcher, the in flight batches implemntation delivers similar performance but is simpler and requires less tuning.
PiperOrigin-RevId:
187111685
Suharsh Sivakumar [Tue, 27 Feb 2018 02:04:55 +0000 (18:04 -0800)]
Modify retrain script to output TFLite compatible quantized models.
-Also fix flaky input name selection introduced by last PR.
-Also rely on tf.contrib.quantize to do graph transformations.
-Also, update retrain script to use new float mobilenet_v1 and quantized mobilenet_v1 models.
PiperOrigin-RevId:
187111533
Brennan Saeta [Tue, 27 Feb 2018 01:56:15 +0000 (17:56 -0800)]
Support configurable stats publishers in the grpc server.
PiperOrigin-RevId:
187110497
Kay Zhu [Tue, 27 Feb 2018 01:55:31 +0000 (17:55 -0800)]
[XLA::Interpreter] Add support for kConditional to HloEvaluator. Also enable
xla/tests/conditional_tests to run on interpreter.
PiperOrigin-RevId:
187110438
A. Unique TensorFlower [Tue, 27 Feb 2018 01:27:20 +0000 (17:27 -0800)]
Fix buffer assignment for conditional instruction.
PiperOrigin-RevId:
187107432
A. Unique TensorFlower [Tue, 27 Feb 2018 01:04:09 +0000 (17:04 -0800)]
Consolidate the builtin function overrides into a single module, and use a generic `dynamic_builtin` function to dispatch between implementations. Use the generic dispatcher in the generated code.
PiperOrigin-RevId:
187104685
Benoit Steiner [Tue, 27 Feb 2018 01:01:24 +0000 (17:01 -0800)]
Only link the swapping code when compiling TensorFlow with CUDA support.
PiperOrigin-RevId:
187104273
Kay Zhu [Tue, 27 Feb 2018 00:24:54 +0000 (16:24 -0800)]
[XLA] In HloEvaluator, fix an issue for HandleAbs to handle complex numbers
more correctly:
- abs([complex numbers]) would yield floats. However since the specilization for
HandleAbs is based on the return type (float), we'd CHECK fail due to float !=
complex when accessing the elements of the operand (complex).
- enable unary_op_test for interpreter.
PiperOrigin-RevId:
187099576
Martin Wicke [Tue, 27 Feb 2018 00:23:46 +0000 (16:23 -0800)]
Deprecate tf.contrib.learn.
RELNOTES: Deprecated tf.contrib.learn. Please check contrib/learn/README.md for instructions on how to convert existing code.
PiperOrigin-RevId:
187099439
A. Unique TensorFlower [Tue, 27 Feb 2018 00:01:04 +0000 (16:01 -0800)]
[XLA] Add more supported dtypes to the local Python client.
PiperOrigin-RevId:
187096144
Bixia Zheng [Mon, 26 Feb 2018 23:42:52 +0000 (15:42 -0800)]
[XLA] Fix #17090 a problem in IrArray::Index::SourceIndexOfTranspose.
Agebraic simplification transforms bitcast-equivalent transpose/reshape
instructions to bitcast instructions before IR emission. As such, we should
skip the checking on whether a transpose/reshape instruction is
bitcast-equivalent or not during IR emission. Remove the call from
IrArray::Index::SourceIndexOfTranspose to ShapeUtil::TransposeIsBitcast. Also
remove the call from IrArray::Index::SourceIndexOfReshape to
ShapeUtil::ReshapeIsBitcast.
Remove the calls to ShapeUtil::TransposeIsBitcast and
ShapeUtil::ReshapeIsBitcast from NotWorthHoistingIndividually
because layout assignment hasn't been done there yet. Instead, returns true
when the input is a transpose or reshape instruction, to prevent it from
being hoisted out of loops.
Add a check to ShapeUtil::TransposeIsBitcast and ShapeUtil::ReshapeIsBitcast
to make sure that both input shape and output shape have layouts.
Add two test cases.
PiperOrigin-RevId:
187093399
Alexandre Passos [Mon, 26 Feb 2018 23:37:40 +0000 (15:37 -0800)]
Uses a thread pool for graph functions in eager mode with inter_op_parallelism_threads.
PiperOrigin-RevId:
187092622
Kay Zhu [Mon, 26 Feb 2018 23:37:27 +0000 (15:37 -0800)]
[XLA::Interpreter] Add support for kCall to HloEvaluator. Also enable
xla/tests/call_test to run on interpreter.
PiperOrigin-RevId:
187092587
Skye Wanderman-Milne [Mon, 26 Feb 2018 22:38:31 +0000 (14:38 -0800)]
Enable de/serialization of nested control flow.
This is a follow-up to the previous commit
(https://github.com/tensorflow/tensorflow/commit/
23851760b7b099214bdd4f1b88156d7ac2bdd2a2).
It adds the new proto schemas, enables the behavior for reading and
writing the new protos, and adds a test for de/serializing nested
while loops.
There's still a bug preventing deserializing conds, which will be addressed
in another change.
PiperOrigin-RevId:
187082713
Sanjoy Das [Mon, 26 Feb 2018 22:32:08 +0000 (14:32 -0800)]
Track DebugOptions in AotCompilationOptions
In particular, I need this for supporting HLO profiling in the AOT backend.
PiperOrigin-RevId:
187081674
Sanjoy Das [Mon, 26 Feb 2018 22:31:29 +0000 (14:31 -0800)]
[TF:XLA] Bump open source llvm revision to r326083
PiperOrigin-RevId:
187081592
A. Unique TensorFlower [Mon, 26 Feb 2018 22:25:37 +0000 (14:25 -0800)]
Add a function that allows to dynamically verify whether a function is white listed for graph mode.
PiperOrigin-RevId:
187080654
A. Unique TensorFlower [Mon, 26 Feb 2018 22:25:30 +0000 (14:25 -0800)]
1st version of sequential feature columns.
PiperOrigin-RevId:
187080635
Michael Kuperstein [Mon, 26 Feb 2018 22:19:56 +0000 (14:19 -0800)]
[XLA] GTE of a certain element of the tuple does not need not keep other elements alive.
This achieves two things:
1. Heap simulation runtime is no longer quadratic in the number of tuple elements (as we don't add each GetTupleElement to the liveset of each buffer defined by the tuple).
2. A reduction in the heap memory footprint.
PiperOrigin-RevId:
187079787
Shivani Agrawal [Mon, 26 Feb 2018 22:11:08 +0000 (14:11 -0800)]
Adding documentation for dataset/iterator checkpointing.
PiperOrigin-RevId:
187078347
Allen Lavoie [Mon, 26 Feb 2018 22:00:07 +0000 (14:00 -0800)]
TFTS: Switch to using core feature columns
This fixes some shape issues that came up when using the tf.contrib.layers
parsing functions.
Adds a string -> embedding column API example to the LSTM example.
PiperOrigin-RevId:
187076400
Akshay Agrawal [Mon, 26 Feb 2018 21:54:02 +0000 (13:54 -0800)]
Update eager uniform replay buffer microbenchmarks to compare against graph functions when possible.
PiperOrigin-RevId:
187075418
Richard Wei [Mon, 26 Feb 2018 21:06:59 +0000 (13:06 -0800)]
Include c_api_experimental in libtensorflow.so's dependencies.
PiperOrigin-RevId:
187068103
A. Unique TensorFlower [Mon, 26 Feb 2018 20:33:17 +0000 (12:33 -0800)]
Maintain a cache of output dtypes of ops in TFE_Context.
PiperOrigin-RevId:
187062992
Jeremy Lau [Mon, 26 Feb 2018 20:23:36 +0000 (12:23 -0800)]
Internal change.
PiperOrigin-RevId:
187061863
Alexandre Passos [Mon, 26 Feb 2018 20:10:01 +0000 (12:10 -0800)]
Fix bug calling gradients_function inside custom_gradient
PiperOrigin-RevId:
187059871
Benoit Steiner [Mon, 26 Feb 2018 19:57:30 +0000 (11:57 -0800)]
Use optimized ops to handle GPU memory swapping: this avoids the need for 2
pairs of extra _send/_recv nodes which speeds things up a bit. This also
ensures that performance doesn't depend on the recv scheduling built in TF,
which isn't always optimal.
PiperOrigin-RevId:
187057831
Anna R [Mon, 26 Feb 2018 19:52:26 +0000 (11:52 -0800)]
Internal change.
PiperOrigin-RevId:
187056963
A. Unique TensorFlower [Mon, 26 Feb 2018 19:50:49 +0000 (11:50 -0800)]
Add the internal module name prefix to the white list.
PiperOrigin-RevId:
187056701
Skye Wanderman-Milne [Mon, 26 Feb 2018 19:43:14 +0000 (11:43 -0800)]
Actually expose smart_cond and smart_constant_value in tf.contrib.framework
Also moves these methods into their own file in python/framework. This avoids further bloating control_flow_ops.py and makes the BUILD deps easier for a future change I'm working on.
PiperOrigin-RevId:
187055501
Yu-Cheng Ling [Mon, 26 Feb 2018 19:22:43 +0000 (11:22 -0800)]
TFLite: Ensures pointers to tensors won't be invalidated unless 16+ tensors are added.
PiperOrigin-RevId:
187052100
Mingsheng Hong [Mon, 26 Feb 2018 19:13:09 +0000 (11:13 -0800)]
Added const to Node* in various parts of the code base.
PiperOrigin-RevId:
187050526
Sanjoy Das [Mon, 26 Feb 2018 19:12:04 +0000 (11:12 -0800)]
Add support for parsing the "gather" HLO
PiperOrigin-RevId:
187050345
Asim Shankar [Mon, 26 Feb 2018 19:10:20 +0000 (11:10 -0800)]
eager/examples/resnet50: Fix breakage.
PiperOrigin-RevId:
187050075
A. Unique TensorFlower [Mon, 26 Feb 2018 19:08:54 +0000 (11:08 -0800)]
Bring in `isbuiltin`.
PiperOrigin-RevId:
187049824
Guangda Lai [Mon, 26 Feb 2018 18:59:54 +0000 (10:59 -0800)]
Automated g4 rollback of changelist
185324160
PiperOrigin-RevId:
187048135
Brennan Saeta [Mon, 26 Feb 2018 18:54:31 +0000 (10:54 -0800)]
Integrate ClusterResolvers with TPUEstimator.
PiperOrigin-RevId:
187047094
Michael Kuperstein [Mon, 26 Feb 2018 18:52:05 +0000 (10:52 -0800)]
[XLA] Do not recompute flattened sets inside layout assignment.
Cache the flattened sets instead of recomputing them. This matters for large graphs, since we may request the flattened set thousands of times on the same instruction, and it may be fairly expensive to construct for large tuples.
PiperOrigin-RevId:
187046642
Yuanzhong Xu [Mon, 26 Feb 2018 18:42:59 +0000 (10:42 -0800)]
[XLA] Add kConvert to EffectiveOperandPrecisionIsOutputPrecision list.
PiperOrigin-RevId:
187044921
Mark Daoust [Mon, 26 Feb 2018 18:41:44 +0000 (10:41 -0800)]
Deleting references to outdated `translate/seq2seq` tutorial.
PiperOrigin-RevId:
187044697
Alexandre Passos [Mon, 26 Feb 2018 18:24:56 +0000 (10:24 -0800)]
Move accumulate_n_v2 to core.
PiperOrigin-RevId:
187042001
A. Unique TensorFlower [Mon, 26 Feb 2018 18:24:08 +0000 (10:24 -0800)]
Arithemtic optimization: Rewite Sub(0, y) => Neg(y)
PiperOrigin-RevId:
187041872
Sanjoy Das [Mon, 26 Feb 2018 18:17:15 +0000 (10:17 -0800)]
Generalize the gather_indices dimension that stores indices
This is now exposed as a index_vector_dim dimension number.
Also fixed an off-by-one error in ValidateGatherDimensionNumbers in the
expression computing output_shape_rank.
PiperOrigin-RevId:
187040748
Yun Peng [Tue, 27 Feb 2018 06:36:01 +0000 (07:36 +0100)]
Fix some breakages in TensorFlow Windows build (#17271)
* Fix configure.py
* Add quantization_utils for building quantize_weights, quantize_nodes, round_weights
Caused by https://github.com/tensorflow/tensorflow/pull/16121
Nick Kreeger [Tue, 27 Feb 2018 05:11:36 +0000 (21:11 -0800)]
Upgrade Jenkins/Docker build scripts to Bazel 0.11.0. (#17280)
The 0.10.0 bazel has problems with static-linking on linux:
https://github.com/bazelbuild/bazel/issues/4474. This PR bumps to the
latest bazel that produces proper binaries w/o the linking issue.
Martin Wicke [Tue, 27 Feb 2018 02:32:36 +0000 (18:32 -0800)]
Fix bad wrong jpeg/nasm mirror (#17277)
Sami Kama [Tue, 27 Feb 2018 01:38:21 +0000 (17:38 -0800)]
Local FP16 conversion to workaround TRT issue
Daniel Trebbien [Mon, 26 Feb 2018 21:42:07 +0000 (13:42 -0800)]
Add missing `override' (#17098)
This fixes a warning produced by clang:
./tensorflow/core/common_runtime/gpu/gpu_device.h:70:10: warning: 'FillContextMap' overrides a member function but is not marked 'override' [-Winconsistent-missing-override]
Status FillContextMap(const Graph* graph,
^
./tensorflow/core/common_runtime/device.h:124:18: note: overridden virtual function is here
virtual Status FillContextMap(const Graph* graph,
Appledore [Mon, 26 Feb 2018 19:31:28 +0000 (11:31 -0800)]
Merge pull request #17222 from kbsriram/pool-gradients
C++ gradients for MaxPool3D, AvgPool and AvgPool3D
Gunhan Gulsoy [Mon, 26 Feb 2018 19:16:30 +0000 (11:16 -0800)]
Merge pull request #17279 from gunan/branch_187038889
Branch
187038889
Gunhan Gulsoy [Mon, 26 Feb 2018 18:20:02 +0000 (10:20 -0800)]
Merge commit for internal changes
A. Unique TensorFlower [Mon, 26 Feb 2018 18:07:09 +0000 (10:07 -0800)]
Fix pip install examples to match text: Use pip and point to Py2 packages
PiperOrigin-RevId:
187038889
Rui Zhao [Mon, 26 Feb 2018 17:32:47 +0000 (09:32 -0800)]
Fix print format error.
PiperOrigin-RevId:
187033623
Justin Lebar [Mon, 26 Feb 2018 17:24:38 +0000 (09:24 -0800)]
[XLA:GPU] Fix HLO profiling when multiple streams are involved.
We were enqueueing the timer on the main stream, but not blocking the
substreams, so the results were nonsensical.
PiperOrigin-RevId:
187032412
A. Unique TensorFlower [Mon, 26 Feb 2018 16:58:48 +0000 (08:58 -0800)]
Add __str__ method to _RefVariableProcessor.
PiperOrigin-RevId:
187029027
A. Unique TensorFlower [Mon, 26 Feb 2018 16:04:09 +0000 (08:04 -0800)]
Drop the getcallargs extension as its logic had to be moved to a higher level into api.py.
PiperOrigin-RevId:
187022717