Sunil Srivastava [Mon, 25 Jul 2016 17:17:06 +0000 (17:17 +0000)]
Support '#pragma once' in headers when using PCH
The '#pragma once' directive was erroneously ignored when encountered
in the header-file specified in generate-PCH-mode. This resulted in
compile-time errors in some cases with legal code, and also a misleading
warning being produced.
Patch by Warren Ristow!
Differential Revision: http://reviews.llvm.org/D19815
llvm-svn: 276653
Jordan Rose [Mon, 25 Jul 2016 17:08:24 +0000 (17:08 +0000)]
StringSwitch cannot be copied or moved.
...but most importantly, it cannot be used well with 'auto', because
the inferred type is StringSwitch rather than the result type. This
is a problem because StringSwitch stores addresses of temporary
values rather than copying or moving the value into its own storage.
Changing this uncovered the bug in PassBuilder, also in this patch.
Clang doesn't seem to have any occurrences of the issue.
llvm-svn: 276652
Alexander Kornienko [Mon, 25 Jul 2016 17:08:18 +0000 (17:08 +0000)]
Revert "MPITypeMismatchCheck for Clang-Tidy"
This reverts commit r276640. Breaks multiple buildbots.
llvm-svn: 276651
Alexander Kornienko [Mon, 25 Jul 2016 17:08:10 +0000 (17:08 +0000)]
Revert "Remove trailing spaces."
This reverts commit r276641. Breaks multiple buildbots.
llvm-svn: 276650
Rui Ueyama [Mon, 25 Jul 2016 16:51:52 +0000 (16:51 +0000)]
Remove trailing whitespaces.
llvm-svn: 276649
Elena Demikhovsky [Mon, 25 Jul 2016 16:51:00 +0000 (16:51 +0000)]
AVX-512: Fixed [US]INT_TO_FP selection for i1 vectors.
It failed with assertion before this patch.
Differential Revision: https://reviews.llvm.org/D22735
llvm-svn: 276648
Douglas Katzman [Mon, 25 Jul 2016 16:36:02 +0000 (16:36 +0000)]
[Myriad]: better compatibility with vendor source
- Accept ma{2100,2150,2150} for -mcpu
- Define more preprocessor macros
- Don't append "le/" to little-endian lib dirs
llvm-svn: 276646
Tobias Grosser [Mon, 25 Jul 2016 16:31:21 +0000 (16:31 +0000)]
GPGPU: Load GPU kernels
We embed the PTX code into the host IR as a global variable and compile it
at run-time into a GPU kernel.
llvm-svn: 276645
Wei Mi [Mon, 25 Jul 2016 16:27:34 +0000 (16:27 +0000)]
Remove useless pass from the pipeline in test/Analysis/Dominators/2007-01-14-BreakCritEdges.ll.
llvm-svn: 276644
Davide Italiano [Mon, 25 Jul 2016 16:10:40 +0000 (16:10 +0000)]
[ELF/LinkerScript] Use correct section name in a test.
llvm-svn: 276642
Alexander Kornienko [Mon, 25 Jul 2016 15:43:22 +0000 (15:43 +0000)]
Remove trailing spaces.
llvm-svn: 276641
Alexander Kornienko [Mon, 25 Jul 2016 15:43:14 +0000 (15:43 +0000)]
MPITypeMismatchCheck for Clang-Tidy
Summary:
This check verifies if buffer type and MPI (Message Passing Interface)
datatype pairs match. All MPI datatypes defined by the MPI standard (3.1)
are verified by this check. User defined typedefs, custom MPI datatypes and
null pointer constants are skipped, in the course of verification.
Instructions on how to apply the check can be found at: https://github.com/0ax1/MPI-Checker/tree/master/examples
Reviewers: alexfh
Subscribers: cfe-commits
Projects: #clang-tools-extra
Patch by Alexander Droste!
Differential Revision: https://reviews.llvm.org/D21962
llvm-svn: 276640
Alexander Kornienko [Mon, 25 Jul 2016 15:27:16 +0000 (15:27 +0000)]
MPI-Checker: move MPIFunctionClassifier.h
Summary:
This patch moves the MPIFunctionClassifier header to `clang/include/clang/StaticAnalyzer/Checkers`,
in order to make it accessible in other parts of the architecture.
Reviewers: dcoughlin, zaks.anna
Subscribers: alexfh, cfe-commits
Patch by Alexander Droste!
Differential Revision: https://reviews.llvm.org/D22671
llvm-svn: 276639
Krzysztof Parzyszek [Mon, 25 Jul 2016 14:42:11 +0000 (14:42 +0000)]
[Hexagon] Add target feature to generate long calls
llvm-svn: 276638
Johannes Doerfert [Mon, 25 Jul 2016 12:48:45 +0000 (12:48 +0000)]
[GSoC] Add PolyhedralInfo pass - new interface to polly analysis
Adding a new pass PolyhedralInfo. This pass will be the interface to Polly.
Initially, we will provide the following interface:
- #IsParallel(Loop *L) - return a bool depending on whether the loop is
parallel or not for the given program order.
Patch by Utpal Bora <cs14mtech11017@iith.ac.in>
Differential Revision: https://reviews.llvm.org/D21486
llvm-svn: 276637
Tobias Grosser [Mon, 25 Jul 2016 12:47:39 +0000 (12:47 +0000)]
GPGPU: Emit data-transfer code
Also factor out getArraySize() to avoid code dupliciation and reorder some
function arguments to indicate the direction into which data is transferred.
llvm-svn: 276636
Tobias Grosser [Mon, 25 Jul 2016 12:47:33 +0000 (12:47 +0000)]
GPGPU: Complete code to allocate and free device arrays
At the beginning of each SCoP, we allocate device arrays for all arrays
used on the GPU and we free such arrays after the SCoP has been executed.
llvm-svn: 276635
Tobias Grosser [Mon, 25 Jul 2016 12:47:28 +0000 (12:47 +0000)]
GPURuntime: Add missing debug output
llvm-svn: 276634
Tobias Grosser [Mon, 25 Jul 2016 12:47:25 +0000 (12:47 +0000)]
GPURuntime: Fix typo in docu
llvm-svn: 276633
Tobias Grosser [Mon, 25 Jul 2016 12:47:22 +0000 (12:47 +0000)]
GPURuntime: Drop polly_cleanupGPGPUResources
This function is currently unused and won't be used in this form again. Instead
of freeing many unrelated items at the same time, we will instead explicitly
call free function from the host-IR we generate for each object we want to free.
These specific free functions will be added together with the corresponding
host-IR generation code.
llvm-svn: 276632
Johannes Doerfert [Mon, 25 Jul 2016 12:40:59 +0000 (12:40 +0000)]
[GSoC] Do not process SCoPs with infeasible runtime context
Do not process SCoPs with infeasible runtime context in the new
ScopInfoWrapperPass. Do not compute dependences for such SCoPs in the new
DependenceInfoWrapperPass.
Patch by Utpal Bora <cs14mtech11017@iith.ac.in>
Differential Revision: https://reviews.llvm.org/D22402
llvm-svn: 276631
Eugene Leviant [Mon, 25 Jul 2016 10:59:20 +0000 (10:59 +0000)]
[ELF] Test case for correct input section order, when sections are listed in linker script
llvm-svn: 276630
Sam Parker [Mon, 25 Jul 2016 10:11:00 +0000 (10:11 +0000)]
[ARM] Improve longMAC codegen test
Added thumb targets and dataflow checks to the longMAC test.
Differential Revision: https://reviews.llvm.org/D22684
llvm-svn: 276629
Simon Dardis [Mon, 25 Jul 2016 09:57:28 +0000 (09:57 +0000)]
[mips] Optimize materialization of i64 constants
Avoid MipsAnalyzeImmediate usage if the constant fits in an 32-bit
integer. This allows us to generate the same instructions for the
materialization of the same constants regardless the width of their
type.
Patch by: Vasileios Kalintiris
Contributions by: Simon Dardis
Reviewers: Daniel Sanders
Differential Review: https://reviews.llvm.org/D21689
llvm-svn: 276628
Roman Gareev [Mon, 25 Jul 2016 09:42:53 +0000 (09:42 +0000)]
Apply all necessary tilings and interchangings to get a macro-kernel
This is the second patch to apply the BLIS matmul optimization pattern
on matmul kernels
(http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf).
BLIS implements gemm as three nested loops around a macro-kernel, plus
two packing routines. The macro-kernel is implemented in terms
of two additional loops around a micro-kernel. The micro-kernel
is a loop around a rank-1 (i.e., outer product) update. In this change
we create the BLIS macro-kernel by applying a combination of tiling
and interchanging. In subsequent changes we will implement the packing
transformation.
Reviewed-by: Tobias Grosser <tobias@grosser.es>
Differential Revision: http://reviews.llvm.org/D21491
llvm-svn: 276627
Sam Parker [Mon, 25 Jul 2016 09:29:24 +0000 (09:29 +0000)]
[ARM] Small refactor of Thumb2 SMLA insts
Follow up to r276624. Changes bits 22-20 to be parameters to
instruction class.
Differential Revision: https://reviews.llvm.org/D22562
llvm-svn: 276626
Oliver Stannard [Mon, 25 Jul 2016 09:21:56 +0000 (09:21 +0000)]
[libunwind][ARM] Add support for Thumb1 targets
The Thumb1 version of the code for saving and restoring the unwind
context has a few bugs which prevent it from working:
* It uses the STM instruction without writeback, which is not valid for Thumb1
(It was introduced in Thumb2).
* It only saves/restores the low 8 registers, the sp and the lr, so if a
program uses r8-r12 they will not be correctly restored when throwing an
exception.
There aren't currently any Thumb1 build-bots to test this, but we have
been successfully running the libc++abi and libc++ test suites on
Cortex-M0 models, as well as some other test suites that use C++
exceptions on a downstream version of libunwind with this patch applied.
Differential Revision: https://reviews.llvm.org/D22292
llvm-svn: 276625
Sam Parker [Mon, 25 Jul 2016 09:20:20 +0000 (09:20 +0000)]
[ARM] Enable ISel of SMMLS for ARM and Thumb2
Use ISelDAGToDAG to recognise the SMMLS instruction pattern.
Differential Revision: https://reviews.llvm.org/D22562
llvm-svn: 276624
Tobias Grosser [Mon, 25 Jul 2016 09:16:01 +0000 (09:16 +0000)]
GPGPU: initialize GPU context and simplify the corresponding GPURuntime interface.
There is no need to expose the selected device at the moment. We also pass back
pointers as return values, as this simplifies the interface.
llvm-svn: 276623
Tobias Grosser [Mon, 25 Jul 2016 09:15:57 +0000 (09:15 +0000)]
IslNodeBuilder: Make finalize() virtual
This allows the finalization routine of the IslNodeBuilder to be overwritten
by derived classes. Being here, we also drop the unnecessary 'Scop' postfix
and the unnecessary 'Scop' parameter.
llvm-svn: 276622
Tobias Grosser [Mon, 25 Jul 2016 09:15:53 +0000 (09:15 +0000)]
GPURuntime: Check for debug-mode early on
Before this change, the debug statements in polly_initDevice would all be
skipped, as debug-mode would only be enabled _after_ they have already been run.
llvm-svn: 276621
Kirill Bobyrev [Mon, 25 Jul 2016 08:56:11 +0000 (08:56 +0000)]
[clang-rename] fix typos in tests
TemplateFunctionFindBy{Declaration|Use}.cpp contained typos and therefore were
failing. clang-rename passes these tests after typos are fixed.
llvm-svn: 276620
George Rimar [Mon, 25 Jul 2016 08:29:46 +0000 (08:29 +0000)]
[ELF] - Linkerscript: implemented output section [address] attribute.
Output section description in SECTIONS looks like that:
section [address] [(type)] :
...
{
...
}
Patch implements support of address atribute.
Differential revision: https://reviews.llvm.org/D22689
llvm-svn: 276619
Jonas Hahnfeld [Mon, 25 Jul 2016 08:04:26 +0000 (08:04 +0000)]
Update description for CLANG_DEFAULT_CXX_STDLIB and add comment. NFC
We agreed to call it "platform default" instead of "architecture default".
(see D17286)
llvm-svn: 276618
Tobias Grosser [Mon, 25 Jul 2016 08:03:08 +0000 (08:03 +0000)]
GPURuntime: Drop timing functionality (some leftover II)
llvm-svn: 276617
Roman Gareev [Mon, 25 Jul 2016 07:27:59 +0000 (07:27 +0000)]
[NFC] Refactor creation of the BLIS mirco-kernel and improve documentation
Reviewed-by: Tobias Grosser <tobias@grosser.es>
llvm-svn: 276616
Craig Topper [Mon, 25 Jul 2016 07:20:35 +0000 (07:20 +0000)]
[AVX512] Add load folding support for the unmasked forms of the FMA instructions.
llvm-svn: 276615
Craig Topper [Mon, 25 Jul 2016 07:20:31 +0000 (07:20 +0000)]
[AVX512] Add some additional patterns so that we can fold broadcast loads in the first argument of an FMADD/FMSUB/FNMADD/FNMSUB/FMADDSUB/FMSUBADD node. Also add patterns to support all combinations of the broadcast input and the preserved input for masked versions.
llvm-svn: 276614
Craig Topper [Mon, 25 Jul 2016 07:20:28 +0000 (07:20 +0000)]
[AVX512] Cleanup FMA operand order in patterns to match the VEX versions and to really be 213, 231, and 132.
llvm-svn: 276613
Tobias Grosser [Mon, 25 Jul 2016 07:11:49 +0000 (07:11 +0000)]
GPURuntime: Drop timing functionality (some leftover)
llvm-svn: 276612
Tobias Grosser [Mon, 25 Jul 2016 07:10:45 +0000 (07:10 +0000)]
GPURuntime: Drop timing functionality
This functionality won't be used in the current iteration. Drop it for now to
reduce the surface of the library. We can always add it back in when we need
it again.
llvm-svn: 276611
Sean Silva [Mon, 25 Jul 2016 05:57:59 +0000 (05:57 +0000)]
Cleanup : Reformat PartialInliner.cpp to have current LLVM style conventions
Modify the variable names and code style to be that of modern LLVM.
Patch by River Riddle!
Differential Revision: https://reviews.llvm.org/D22743
llvm-svn: 276610
Sean Silva [Mon, 25 Jul 2016 05:00:00 +0000 (05:00 +0000)]
Fix : Partial Inliner requires AssumptionCacheTracker
The public InlineFunction utility assumes that the passed in
InlineFunctionInfo has a valid AssumptionCacheTracker.
Patch by River Riddle!
Differential Revision: https://reviews.llvm.org/D22706
llvm-svn: 276609
Eric Fiselier [Mon, 25 Jul 2016 04:56:32 +0000 (04:56 +0000)]
Remove use of C++1z static assert in C++11 test
llvm-svn: 276608
David Majnemer [Mon, 25 Jul 2016 04:47:45 +0000 (04:47 +0000)]
Update test to account for r276604
llvm-svn: 276607
David Majnemer [Mon, 25 Jul 2016 04:47:44 +0000 (04:47 +0000)]
[MSVCToolChain] Guard hardcoded Windows paths with LLVM_ON_WIN32
Paths like C:/foo will never work on UNIX platforms, don't bother
implicitly adding them to the search path.
llvm-svn: 276606
Eric Fiselier [Mon, 25 Jul 2016 04:32:07 +0000 (04:32 +0000)]
Implement the std::pair parts of "Improving pair and tuple". Completes N4387.
llvm-svn: 276605
David Majnemer [Mon, 25 Jul 2016 03:39:21 +0000 (03:39 +0000)]
[InstSimplify] Fold trunc([zs]ext(%V)) -> %V
Truncates can completely cancel out a zext or sext instruction.
llvm-svn: 276604
Eric Fiselier [Mon, 25 Jul 2016 02:36:42 +0000 (02:36 +0000)]
Recommit r276548 - Make pair/tuples assignment operators SFINAE properly.
I think I've solved issues with is_assignable and references to incomplete
types. The updated patch adds tests for this case.
llvm-svn: 276603
David Majnemer [Mon, 25 Jul 2016 02:21:25 +0000 (02:21 +0000)]
[GVNHoist] Merge metadata on hoisted instructions less conservatively
We can combine metadata from multiple instructions intelligently for
certain metadata nodes.
llvm-svn: 276602
David Majnemer [Mon, 25 Jul 2016 02:21:23 +0000 (02:21 +0000)]
[GVNHoist] Properly merge alignments when hoisting
If we two loads of two different alignments, we must use the minimum of
the two alignments when hoisting. Same deal for stores.
For allocas, use the maximum of the two allocas.
llvm-svn: 276601
David Majnemer [Mon, 25 Jul 2016 02:21:19 +0000 (02:21 +0000)]
[Utils] Simplify combineMetadata
Use a range-based for loop, no functional change is intended.
llvm-svn: 276600
Eric Fiselier [Mon, 25 Jul 2016 02:08:55 +0000 (02:08 +0000)]
Make std::is_assignable tolerate references to incomplete types.
llvm-svn: 276599
Eric Fiselier [Mon, 25 Jul 2016 01:45:07 +0000 (01:45 +0000)]
Revert r276548 - Make pair/tuples assignment operators SFINAE properly.
This is a breaking change. The SFINAE required is instantiated the second
the class is instantiated, and this can cause hard SFINAE errors
when applied to references to incomplete types. Ex.
struct IncompleteType;
extern IncompleteType it;
std::tuple<IncompleteType&> t(it); // SFINAE will blow up.
llvm-svn: 276598
NAKAMURA Takumi [Mon, 25 Jul 2016 00:59:51 +0000 (00:59 +0000)]
Untabify.
llvm-svn: 276597
NAKAMURA Takumi [Mon, 25 Jul 2016 00:59:46 +0000 (00:59 +0000)]
Trailing whitespace.
llvm-svn: 276596
Eric Fiselier [Mon, 25 Jul 2016 00:50:32 +0000 (00:50 +0000)]
Make dtor_noexcept.pass.cpp tests more portable. Patch from STL@microsoft.com
llvm-svn: 276595
Eric Fiselier [Mon, 25 Jul 2016 00:48:36 +0000 (00:48 +0000)]
Don't SFINAE pair's copy assignment operator in C++03 mode.
In C++03 mode evaluating the SFINAE can cause a hard error due to
access control violations. This is a problem because the SFINAE
is evaluated as soon as the class is instantiated, and not later.
llvm-svn: 276594
Eric Fiselier [Mon, 25 Jul 2016 00:26:41 +0000 (00:26 +0000)]
Mark bucket_count() assertions as non-portable. Patch from STL@microsoft.com
llvm-svn: 276593
Davide Italiano [Mon, 25 Jul 2016 00:25:18 +0000 (00:25 +0000)]
[ELF] Fix the semantic of PROVIDE in linker scripts.
PROVIDE request us to define a symbol only if it is referenced and is
not defined by any object included in the link. We created the
symbol in the symbol table no matter what.
Differential Revision: https://reviews.llvm.org/D22739
llvm-svn: 276592
Eric Fiselier [Mon, 25 Jul 2016 00:18:12 +0000 (00:18 +0000)]
Make move_assign_noexcept.pass.cpp tests more portable. Patch from STL@microsoft.com
llvm-svn: 276591
Eric Fiselier [Mon, 25 Jul 2016 00:15:29 +0000 (00:15 +0000)]
Make swap_noexcept.pass.cpp tests more portable. Patch from STL@microsoft.com.
See D21820 for more information (https://reviews.llvm.org/D21820).
llvm-svn: 276590
Eric Fiselier [Mon, 25 Jul 2016 00:02:23 +0000 (00:02 +0000)]
Work around MSVC's non-standard ABI for enums. Patch from STL@microsoft.com
llvm-svn: 276589
Eric Fiselier [Sun, 24 Jul 2016 23:49:42 +0000 (23:49 +0000)]
Fix a non-standard allocator in vector tests. Patch from STL@microsoft.com
llvm-svn: 276588
Eric Fiselier [Sun, 24 Jul 2016 23:48:26 +0000 (23:48 +0000)]
Fix unique_ptr.runtime tests for null inputs. Patch from STL@microsoft.com
llvm-svn: 276587
Rui Ueyama [Sun, 24 Jul 2016 23:47:31 +0000 (23:47 +0000)]
Simplify. NFC.
llvm-svn: 276586
Eric Fiselier [Sun, 24 Jul 2016 23:36:18 +0000 (23:36 +0000)]
Fix portability issues in <random> tests. Patch from STL@microsoft.com
llvm-svn: 276585
Eric Fiselier [Sun, 24 Jul 2016 23:34:18 +0000 (23:34 +0000)]
Mark bucket() assertions as non-portable. Patch from STL@microsoft.com
llvm-svn: 276584
Eric Fiselier [Sun, 24 Jul 2016 23:32:48 +0000 (23:32 +0000)]
Fix MSVC unreferenced parameter warning. Patch from STL@microsoft.com
llvm-svn: 276583
Davide Italiano [Sun, 24 Jul 2016 23:29:18 +0000 (23:29 +0000)]
[ELF] Move linker script tests to a subdirectory. NFCI.
llvm-svn: 276582
Eric Fiselier [Sun, 24 Jul 2016 23:19:51 +0000 (23:19 +0000)]
Make move_noexcept.pass.cpp tests more portable. Patch from STL@microsoft.com
llvm-svn: 276581
Eric Fiselier [Sun, 24 Jul 2016 23:16:37 +0000 (23:16 +0000)]
Make bucket_count() greater-equal assertions portable. Patch from STL@microsoft.com
llvm-svn: 276580
Davide Italiano [Sun, 24 Jul 2016 23:13:48 +0000 (23:13 +0000)]
[ELF] Support PROVIDE/PROVIDE_HIDDEN inside output sections description.
Differential Revision: https://reviews.llvm.org/D22738
llvm-svn: 276579
Eric Fiselier [Sun, 24 Jul 2016 23:13:36 +0000 (23:13 +0000)]
Mark bucket_size() assertions as non-portible. Patch from STL@microsoft.com
llvm-svn: 276578
George Burgess IV [Sun, 24 Jul 2016 23:12:40 +0000 (23:12 +0000)]
[Sema] Replace mem_fn with lambdas. NFC.
I'm told that some optimizers like lambdas a lot more than mem_fn.
Given that the readability difference is basically nil, and we seem to
use lambdas basically everywhere else, it seems sensible to just use
lambdas.
llvm-svn: 276577
Eric Fiselier [Sun, 24 Jul 2016 23:08:21 +0000 (23:08 +0000)]
Guard libc++ specific tests SFINAE on std::bind's call operator. Patch from STL@microsoft.com
llvm-svn: 276576
Rui Ueyama [Sun, 24 Jul 2016 23:05:57 +0000 (23:05 +0000)]
Merge readSymbolAssignment with readAssignment. NFC.
llvm-svn: 276575
Rui Ueyama [Sun, 24 Jul 2016 18:19:40 +0000 (18:19 +0000)]
Make readExpr return an Expr object instead of a vector of tokens.
Previously, we handled an expression as a vector of tokens. In other
words, an expression was a vector of uncooked raw StringRefs.
When we need a value of an expression, we used ExprParser to run
the expression.
The separation was needed essentially because parse time is too
early to evaluate an expression. In order to evaluate an expression,
we need to finalize section sizes. Because linker script parsing
is done at very early stage of the linking process, we can't
evaluate expressions while parsing.
The above mechanism worked fairly well, but there were a few
drawbacks.
One thing is that we sometimes have to parse the same expression
more than once in order to find the end of the expression.
In some contexts, linker script expressions have no clear end marker.
So, we needed to recognize balanced expressions and ternary operators.
The other is poor error reporting. Since expressions are parsed
basically twice, and some information that is available at the first
stage is lost in the second stage, it was hard to print out
apprpriate error messages.
This patch fixes the issues with a new approach.
Now the expression parsing is integrated into ScriptParser.
ExprParser class is removed. Expressions are represented as lambdas
instead of vectors of tokens. Lambdas captures information they
need to run themselves when they are created.
In this way, ends of expressions are naturally detected, and
errors are handled in the usual way. This patch also reduces
the amount of code.
Differential Revision: https://reviews.llvm.org/D22728
llvm-svn: 276574
David Majnemer [Sun, 24 Jul 2016 17:44:03 +0000 (17:44 +0000)]
[Driver] Switch some getenv calls to llvm::sys::Process::GetEnv
No functional change is intended.
llvm-svn: 276573
David Majnemer [Sun, 24 Jul 2016 17:19:59 +0000 (17:19 +0000)]
[CommandLine] Use Process::GetEnv instead of _wgetenv
Process::GetEnv does the right thing across our platforms.
CommandLine.cpp had, more or less, the same logic. Let's remove the
duplication.
No functional change is intended.
llvm-svn: 276572
Simon Pilgrim [Sun, 24 Jul 2016 16:39:50 +0000 (16:39 +0000)]
[X86][SSE] Added PR27854 tests
llvm-svn: 276571
Simon Pilgrim [Sun, 24 Jul 2016 16:10:21 +0000 (16:10 +0000)]
[X86] Add shift double tests for PR14593
llvm-svn: 276570
Simon Pilgrim [Sun, 24 Jul 2016 16:00:53 +0000 (16:00 +0000)]
[X86] Add 'FeatureSlowSHLD' to cpu 'bdver4'
As with all AMD CPUs, excavator has poor SHLD/SHRD performance. Also added bdver3 to the test as it was missing.
llvm-svn: 276569
Simon Pilgrim [Sun, 24 Jul 2016 15:47:44 +0000 (15:47 +0000)]
[X86] Add SHRD shift combine tests
llvm-svn: 276568
Simon Pilgrim [Sun, 24 Jul 2016 15:38:51 +0000 (15:38 +0000)]
[X86] Regenerate shift by parts tests
llvm-svn: 276567
Simon Pilgrim [Sun, 24 Jul 2016 15:25:36 +0000 (15:25 +0000)]
[X86][SSE] Regenerate shifts tests
llvm-svn: 276566
Simon Pilgrim [Sun, 24 Jul 2016 15:17:50 +0000 (15:17 +0000)]
[X86][SSE] Regenerate SSE copysign tests
llvm-svn: 276565
Craig Topper [Sun, 24 Jul 2016 14:58:06 +0000 (14:58 +0000)]
[X86] Block pbroadcastq instructions on 32-bit targets instead of pbroadcastb.
Thanks to Simon Pilgrim for catching the mistake.
llvm-svn: 276564
Simon Pilgrim [Sun, 24 Jul 2016 13:01:51 +0000 (13:01 +0000)]
[X86][AVX512VL] Added AVX512VL half2float vector conversions tests to demonstrate PR23941
llvm-svn: 276563
Craig Topper [Sun, 24 Jul 2016 08:26:38 +0000 (08:26 +0000)]
[X86] Make the FMA3 instruction names consistent between VEX and EVEX encoded versions.
This places the 132/213/231 form number in front of the SS/SD/PS/PD. Move the Y for 256-bit versions to be after the PS/PD. Change the AVX512 scalar forms to include a Z in the their name. This new format should be consistent with the general naming of instructions.
llvm-svn: 276559
Eric Fiselier [Sun, 24 Jul 2016 08:16:37 +0000 (08:16 +0000)]
commit test missing from r276556
llvm-svn: 276558
Artem Dergachev [Sun, 24 Jul 2016 08:15:58 +0000 (08:15 +0000)]
[analyzer] Pring LocationContext in ExplodedGraph dumps.
Remove some FIXMEs in the surrounding code,
which have been addressed long time ago
by introducing checker-specific tags.
Differential Revision: https://reviews.llvm.org/D22622
llvm-svn: 276557
Eric Fiselier [Sun, 24 Jul 2016 07:42:13 +0000 (07:42 +0000)]
Add __is_inplace_type metafunction helper
llvm-svn: 276556
Craig Topper [Sun, 24 Jul 2016 07:32:45 +0000 (07:32 +0000)]
[X86] Replace CodeGenOnly VPSRAVW/D/Q_Int instructions with patterns since the operand types exactly match the normal VPSRAVW/D/Q instructions.
llvm-svn: 276555
Elena Demikhovsky [Sun, 24 Jul 2016 07:24:54 +0000 (07:24 +0000)]
[Loop Vectorizer] Handling loops FP induction variables.
Allowed loop vectorization with secondary FP IVs. Like this:
float *A;
float x = init;
for (int i=0; i < N; ++i) {
A[i] = x;
x -= fp_inc;
}
The auto-vectorization is possible when the induction binary operator is "fast" or the function has "unsafe" attribute.
Differential Revision: https://reviews.llvm.org/D21330
llvm-svn: 276554
George Burgess IV [Sun, 24 Jul 2016 07:03:49 +0000 (07:03 +0000)]
[MSSA] Make EXPENSIVE_CHECKS check more.
checkClobberSanity will now be run for all results of `ClobberWalk`,
instead of just the crazy phi-optimized ones. This can help us catch
cases where our cache is being wonky.
llvm-svn: 276553
Eric Fiselier [Sun, 24 Jul 2016 06:51:55 +0000 (06:51 +0000)]
Start adding benchmarks for vector
llvm-svn: 276552
Tobias Grosser [Sun, 24 Jul 2016 06:43:21 +0000 (06:43 +0000)]
GPGPU: Optimize kernel IR before generating assembly code
We optimize the kernel _after_ dumping the IR we generate to make the IR we
dump easier readable and independent of possible changes in the general
purpose LLVM optimizers.
llvm-svn: 276551
Tobias Grosser [Sun, 24 Jul 2016 06:43:17 +0000 (06:43 +0000)]
GPGPU: Verify kernel IR before generating assembly
llvm-svn: 276550
Eric Fiselier [Sun, 24 Jul 2016 06:22:25 +0000 (06:22 +0000)]
Skip chash computation in insert/emplace if the unconstrained hash matches.
llvm-svn: 276549