Nikolay Haustov [Wed, 2 Mar 2016 10:54:21 +0000 (10:54 +0000)]
Revert "[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler."
Build failure with clang.
llvm-svn: 262475
Nikolay Haustov [Wed, 2 Mar 2016 10:36:30 +0000 (10:36 +0000)]
[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler.
complementary patch to table-driven amd_kernel_code_t field parser/printer utility. lit tests passed.
Patch by: Valery Pykhtin
Differential Revision: http://reviews.llvm.org/D17151
llvm-svn: 262474
Nikolay Haustov [Wed, 2 Mar 2016 10:36:25 +0000 (10:36 +0000)]
[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields
This is going to be used in .hsatext disassembler and can be used
in current assembler parser (lit tests passed on parsing).
Code using this helpers isn't included in this patch.
Benefits:
unified approach
fast field name lookup on parsing
Later I would like to enhance some of the field naming/syntax using this code.
Patch by: Valery Pykhtin
Differential Revision: http://reviews.llvm.org/D17150
llvm-svn: 262473
Dmitry Vyukov [Wed, 2 Mar 2016 09:54:40 +0000 (09:54 +0000)]
libfuzzer: fix compiler warnings
- unused sigaction/setitimer result (used in assert)
- unchecked fscanf return value
- signed/unsigned comparison
llvm-svn: 262472
Michael Zuckerman [Wed, 2 Mar 2016 09:05:46 +0000 (09:05 +0000)]
[CLANG] [AVX512] [BUILTIN] Adding PSRAV
Differential Revision: http://reviews.llvm.org/D17699
llvm-svn: 262471
Haojian Wu [Wed, 2 Mar 2016 09:01:25 +0000 (09:01 +0000)]
[clang-tidy] Make 'modernize-pass-by-value' fix work on header files.
Reviewers: alexfh
Subscribers: jbcoe, cfe-commits
Differential Revision: http://reviews.llvm.org/D17756
llvm-svn: 262470
Filipe Cabecinhas [Wed, 2 Mar 2016 08:30:40 +0000 (08:30 +0000)]
[test/vptr-non-unique-typeinfo] Address Samsonov's post-commit review
Reviewers: samsonov
Subscribers: llvm-commits
Differential Revision: http://reviews.llvm.org/D17783
llvm-svn: 262469
Craig Topper [Wed, 2 Mar 2016 07:32:45 +0000 (07:32 +0000)]
[X86] Remove unnecessary call to isReg from emitter's DestMem handling for VEX prefix. The operand is always a register. NFC
llvm-svn: 262468
Craig Topper [Wed, 2 Mar 2016 07:32:43 +0000 (07:32 +0000)]
[X86] Make X86MCCodeEmitter::DetermineREXPrefix locate operands more like how VEX prefix handling does.
llvm-svn: 262467
David Majnemer [Wed, 2 Mar 2016 06:48:47 +0000 (06:48 +0000)]
[Sema] PR26444 fix crash when alignment value is >= 2**16
Sema allows max values up to 2**28, use unsigned instead of unsiged
short to hold values that large.
Differential Revision: http://reviews.llvm.org/D17248
Patch by Don Hinton!
llvm-svn: 262466
David Majnemer [Wed, 2 Mar 2016 06:46:52 +0000 (06:46 +0000)]
[X86] Permit reading of the FLAGS register without it being previously defined
We modeled the RDFLAGS{32,64} operations as "using" {E,R}FLAGS.
While technically correct, this is not be desirable for folks who want
to examine aspects of the FLAGS register which are not related to
computation like whether or not CPUID is a valid instruction.
Differential Revision: http://reviews.llvm.org/D17782
llvm-svn: 262465
Craig Topper [Wed, 2 Mar 2016 06:35:22 +0000 (06:35 +0000)]
[X86] Remove assertion I accidentally left in.
llvm-svn: 262464
Davide Italiano [Wed, 2 Mar 2016 06:09:18 +0000 (06:09 +0000)]
[modules] addHeaderInclude() can't fail.
Differential Revision: http://reviews.llvm.org/D17794
llvm-svn: 262463
Craig Topper [Wed, 2 Mar 2016 06:06:18 +0000 (06:06 +0000)]
[X86] Be more structured about how we capture the register number when it is encoded in bits 7:4 of the immediate.
For some instructions the register is not the last operand and the immediate handling had to detect this and hardcode the index to find it. It also required CurOp to be pointing at the last operand handled in the Form switch whereas for any instruction it would be pointing at the next operand.
Now we just capture the value in the Form switch when we know exactly where it is and the CurOp pointer can behave normally.
llvm-svn: 262462
Simon Atanasyan [Wed, 2 Mar 2016 05:38:42 +0000 (05:38 +0000)]
[ELF] Fix reading of PC values of FDEs
The patch fixes two related problems:
- If CIE augmentation string has 'L' token the CIE contains a byte
defines LSDA encoding. We should skip this byte in `getFdeEncoding`
routine. Before this fix we do not skip it and if the next token
is 'R' treat this byte as FDE encoding.
- FDE encoding format has separate flags e.g. DW_EH_PE_pcrel for
definition of relative pointers. We should add .eh_frame address to
the PC value iif the DW_EH_PE_pcrel is specified.
http://www.airs.com/blog/archives/460
There is one more not fixed problem in this code. If PC value is encoded
using signed relative format e.g. DW_EH_PE_sdata4 | DW_EH_PE_pcrel we
should sign extend result of read32 to perform calculation correctly.
I am going to fix that in a separate patch.
Differential Revision: http://reviews.llvm.org/D17733
llvm-svn: 262461
Alexey Bataev [Wed, 2 Mar 2016 04:57:40 +0000 (04:57 +0000)]
[OPENMP 4.5] Codegen for data members in 'reduction' clause.
OpenMP 4.5 allows to privatize non-static data members of current class
in non-static member functions. Patch supports codegen for non-static
data members in 'reduction' clauses.
llvm-svn: 262460
Sanjoy Das [Wed, 2 Mar 2016 04:52:22 +0000 (04:52 +0000)]
[SCEV] Minor naming, braces cleanup; NFC
llvm-svn: 262459
Craig Topper [Wed, 2 Mar 2016 04:42:31 +0000 (04:42 +0000)]
[X86] Use MCPhysReg and uint16_t for static arrays of registers and opcodes respectively should reduce size tiny bit. NFC
llvm-svn: 262458
Matt Arsenault [Wed, 2 Mar 2016 04:12:39 +0000 (04:12 +0000)]
AMDGPU: Fix bug 26659.
Fix checking the same instruction twice instead of the
second branch that uses vccz. I don't think this matters
currently because s_branch_vccnz is always used currently.
llvm-svn: 262457
Matt Arsenault [Wed, 2 Mar 2016 04:05:14 +0000 (04:05 +0000)]
AMDGPU: Cleanup suggested in bug 23960
llvm-svn: 262456
Matt Arsenault [Wed, 2 Mar 2016 03:33:55 +0000 (03:33 +0000)]
Bug 20810: Use report_fatal_error instead of unreachable
llvm-svn: 262455
Sanjoy Das [Wed, 2 Mar 2016 02:56:29 +0000 (02:56 +0000)]
Add a comment with a rational for the unusual code structure
llvm-svn: 262454
Sanjoy Das [Wed, 2 Mar 2016 02:44:08 +0000 (02:44 +0000)]
Qualify getRangeForAffineAR with this-> for MSVC
llvm-svn: 262453
George Burgess IV [Wed, 2 Mar 2016 02:35:04 +0000 (02:35 +0000)]
Attempt to fix ASAN failure in a MemorySSA test.
llvm-svn: 262452
Sanjoy Das [Wed, 2 Mar 2016 02:34:20 +0000 (02:34 +0000)]
Perturb code in an attempt to appease MSVC
For some reason MSVC seems to think I'm calling getConstant() from a
static context. Try to avoid this issue by explicitly specifying
'this->' (though I'm not confident that this will actually work).
llvm-svn: 262451
Eugene Zelenko [Wed, 2 Mar 2016 02:18:18 +0000 (02:18 +0000)]
Fix Clang-tidy modernize-use-nullptr warnings in some files in source/Core; other minor fixes.
llvm-svn: 262450
Sanjoy Das [Wed, 2 Mar 2016 02:15:42 +0000 (02:15 +0000)]
More code permutation to appease MSVC
llvm-svn: 262449
Sanjoy Das [Wed, 2 Mar 2016 01:59:37 +0000 (01:59 +0000)]
Remove "auto" to appease the MSVC bots
llvm-svn: 262448
Rui Ueyama [Wed, 2 Mar 2016 01:49:27 +0000 (01:49 +0000)]
Use C++11 initializers for data members.
llvm-svn: 262447
Matt Arsenault [Wed, 2 Mar 2016 01:36:51 +0000 (01:36 +0000)]
DAGCombiner: Make sure an integer is being truncated
llvm-svn: 262446
Rui Ueyama [Wed, 2 Mar 2016 01:34:36 +0000 (01:34 +0000)]
Remove more dead code.
llvm-svn: 262445
Rui Ueyama [Wed, 2 Mar 2016 01:34:34 +0000 (01:34 +0000)]
Remove default implementations that are always overridden by subclasses.
llvm-svn: 262444
Rui Ueyama [Wed, 2 Mar 2016 01:23:29 +0000 (01:23 +0000)]
Remove more dead code.
llvm-svn: 262443
Reid Kleckner [Wed, 2 Mar 2016 01:21:48 +0000 (01:21 +0000)]
Add a few dllimport/dllexport tests. NFC
Summary:
This change just adds tests for some corner cases of dllimport/dllexport,
primarily for some static methods.
We plan to enable dllimport/dllexport support for the PS4, and these
additional tests are for points we previously were testing internally.
-Warren Ristow
SN Systems - Sony Computer Entertainment Group
Reviewers: rnk
Subscribers: silvas
Differential Revision: http://reviews.llvm.org/D17775
llvm-svn: 262442
Eugene Zelenko [Wed, 2 Mar 2016 01:09:03 +0000 (01:09 +0000)]
Fix Clang-tidy modernize-use-nullptr warnings in some files in source/Core; other minor fixes.
llvm-svn: 262441
Sanjay Patel [Wed, 2 Mar 2016 01:04:09 +0000 (01:04 +0000)]
revert r262424 because there's a *clang test* for AArch64 that checks -O3 asm output
that is broken by this change
llvm-svn: 262440
Daniel Berlin [Wed, 2 Mar 2016 00:58:48 +0000 (00:58 +0000)]
Fix SHARED_LIBS build
llvm-svn: 262439
Sanjoy Das [Wed, 2 Mar 2016 00:57:54 +0000 (00:57 +0000)]
[SCEV] Make getRange smarter around selects
Have ScalarEvolution::getRange re-consider cases like "{C?A:B,+,C?P:Q}"
by factoring out "C" and computing RangeOf{A,+,P} union RangeOf({B,+,Q})
instead.
The latter can be easier to compute precisely in cases like
"{C?0:N,+,C?1:-1}" N is the backedge taken count of the loop; since in
such cases the latter form simplifies to [0,N+1) union [0,N+1).
llvm-svn: 262438
Sanjoy Das [Wed, 2 Mar 2016 00:57:39 +0000 (00:57 +0000)]
[SCEV] Extract out a getRangeForAffineAR; NFC
Pure code-motion change. Will be used later in making getRange more clever.
llvm-svn: 262437
Rui Ueyama [Wed, 2 Mar 2016 00:43:37 +0000 (00:43 +0000)]
Remove dead code.
llvm-svn: 262436
Rui Ueyama [Wed, 2 Mar 2016 00:37:50 +0000 (00:37 +0000)]
Remove remaining files of Core.
llvm-svn: 262435
Rui Ueyama [Wed, 2 Mar 2016 00:31:34 +0000 (00:31 +0000)]
Tidy up CMakefiles.
llvm-svn: 262434
Rui Ueyama [Wed, 2 Mar 2016 00:28:35 +0000 (00:28 +0000)]
Remove unused #include's.
llvm-svn: 262431
Chris Bieneman [Wed, 2 Mar 2016 00:27:15 +0000 (00:27 +0000)]
[CMake] Add convenience target clang-test-depends to build test dependencies.
This is useful when paired with the distribution targets to build prerequisites for running tests.
llvm-svn: 262429
Chris Bieneman [Wed, 2 Mar 2016 00:27:14 +0000 (00:27 +0000)]
[CMake] Add convenience target llvm-test-depends to build test dependencies.
This is useful when paired with the distribution targets to build prerequisites for running tests.
llvm-svn: 262428
Chris Bieneman [Wed, 2 Mar 2016 00:27:12 +0000 (00:27 +0000)]
[CMake] Add distribution target that is the "just-build" side of install-distribution
This is just a convenience target to allow limiting what you build.
llvm-svn: 262427
Sanjay Patel [Tue, 1 Mar 2016 23:55:18 +0000 (23:55 +0000)]
[InstCombine] convert 'isPositive' and 'isNegative' vector comparisons to shifts (PR26701)
As noted in the code comment, I don't think we can do the same transform that we do for
*scalar* integers comparisons to *vector* integers comparisons because it might pessimize
the general case.
Exhibit A for an incomplete integer comparison ISA remains x86 SSE/AVX: it only has EQ and GT
for integer vectors.
But we should now recognize all the variants of this construct and produce the optimal code
for the cases shown in:
https://llvm.org/bugs/show_bug.cgi?id=26701
llvm-svn: 262424
Rui Ueyama [Tue, 1 Mar 2016 23:52:04 +0000 (23:52 +0000)]
Make a few utility functions file-scoped.
makeErrorFile and parseMemberFiles are now used only in DarwinLdDriver.cpp.
This patch moves them to that file.
llvm-svn: 262423
Rui Ueyama [Tue, 1 Mar 2016 23:48:03 +0000 (23:48 +0000)]
Remove dead code.
llvm-svn: 262422
Rui Ueyama [Tue, 1 Mar 2016 23:44:05 +0000 (23:44 +0000)]
Remove CoreDriver.
CoreDriver implements a driver for a hypothetical platform.
It is intended to be used in unittests. However, it is actually
redundant because the features are tested using the real driver
for the real platforms. So we can remove this.
http://reviews.llvm.org/D17698
llvm-svn: 262421
Nico Weber [Tue, 1 Mar 2016 23:16:44 +0000 (23:16 +0000)]
clang-cl: Implement initial limited support for precompiled headers.
In the gcc precompiled header model, one explicitly runs clang with `-x
c++-header` on a .h file to produce a gch file, and then includes the header
with `-include foo.h` and if a .gch file exists for that header it gets used.
This is documented at
http://clang.llvm.org/docs/UsersManual.html#precompiled-headers
cl.exe's model is fairly different, and controlled by the two flags /Yc and
/Yu. A pch file is generated as a side effect of a regular compilation when
/Ycheader.h is passed. While the compilation is running, the compiler keeps
track of #include lines in the main translation unit and writes everything up
to an `#include "header.h"` line into a pch file. Conversely, /Yuheader.h tells
the compiler to skip all code in the main TU up to and including `#include
"header.h"` and instead load header.pch. (It's also possible to use /Yc and /Yu
without an argument, in that case a `#pragma hrdstop` takes the role of
controlling the point where pch ends and real code begins.)
This patch implements limited support for this in that it requires the pch
header to be passed as a /FI force include flag – with this restriction,
it can be implemented almost completely in the driver with fairly small amounts
of code. For /Yu, this is trivial, and for /Yc a separate pch action is added
that runs before the actual compilation. After r261774, the first failing
command makes a compilation stop – this means if the pch fails to build the
main compilation won't run, which is what we want. However, in /fallback builds
we need to run the main compilation even if the pch build fails so that the
main compilation's fallback can run. To achieve this, add a ForceSuccessCommand
that pretends that the pch build always succeeded in /fallback builds (the main
compilation will then fail to open the pch and run the fallback cl.exe
invocation).
If /Yc /Yu are used in a setup that clang-cl doesn't implement yet, clang-cl
will now emit a "not implemented yet; flag ignored" warning that can be
disabled using -Wno-clang-cl-pch.
Since clang-cl doesn't yet serialize some important things (most notably
`pragma comment(lib, ...)`, this feature is disabled by default and only
enabled by an internal driver flag. Once it's more stable, this internal flag
will disappear.
(The default stdafx.h setup passes stdafx.h as explicit argument to /Yc but not
as /FI – instead every single TU has to `#include <stdafx.h>` as first thing it
does. Implementing support for this should be possible with the approach in
this patch with minimal frontend changes by passing a --stop-at / --start-at
flag from the driver to the frontend. This is left for a follow-up. I don't
think we ever want to support `#pragma hdrstop`, and supporting it with this
approach isn't easy: This approach relies on the driver knowing the pch
filename in advance, and `#pragma hdrstop(out.pch)` can set the output
filename, so the driver can't know about it in advance.)
clang-cl now also honors /Fp and puts pch files in the same spot that cl.exe
would put them, but the pch file format is of course incompatible. This has
ramifications on /fallback, so /Yc /Yu aren't passed through to cl.exe in
/fallback builds.
http://reviews.llvm.org/D17695
llvm-svn: 262420
Dehao Chen [Tue, 1 Mar 2016 22:53:02 +0000 (22:53 +0000)]
Perform InstructioinCombiningPass before SampleProfile pass.
Summary: SampleProfile pass needs to be performed after InstructionCombiningPass, which helps eliminate un-inlinable function calls.
Reviewers: davidxl, dnovillo
Subscribers: llvm-commits
Differential Revision: http://reviews.llvm.org/D17742
llvm-svn: 262419
Simon Pilgrim [Tue, 1 Mar 2016 22:38:33 +0000 (22:38 +0000)]
Updated SSE41 builtin tests to more closely match the llvm fast-isel equivalent tests
llvm-svn: 262418
Kostya Serebryany [Tue, 1 Mar 2016 22:33:14 +0000 (22:33 +0000)]
[libFuzzer] deprecate exit_on_first flag
llvm-svn: 262417
David Blaikie [Tue, 1 Mar 2016 22:29:00 +0000 (22:29 +0000)]
llvm-dwp: Add missing copyright notice to llvm-dwp.cpp
Addressing feedback on IRC by Sean Silva.
llvm-svn: 262416
Kostya Serebryany [Tue, 1 Mar 2016 22:19:21 +0000 (22:19 +0000)]
[libFuzzer] add generic signal handlers so that libFuzzer can report at least something if ASan is not handlig the signals for us. Remove abort_on_timeout flag.
llvm-svn: 262415
John McCall [Tue, 1 Mar 2016 22:18:03 +0000 (22:18 +0000)]
Mangle extended qualifiers in the proper order and mangle the
ARC ownership-convention function type modifications.
According to the Itanium ABI, vendor extended qualifiers are
supposed to be mangled in reverse-alphabetical order before
any CVR qualifiers. The ARC function type conventions are
plausibly order-significant (they are associated with the
function type), which permits us to ignore the need to correctly
inter-order them with any other vendor qualifiers on the parameter
and return types.
Implementing these rules correctly is technically an ABI break.
Apple is comfortable with the risk of incompatibility here for
the ARC features, and I believe that address-space qualification
is still uncommon enough to allow us to adopt the conforming
rule without serious risk. Still, targets which make heavy
use of address space qualification may want to revert to the
non-conforming order.
llvm-svn: 262414
Rui Ueyama [Tue, 1 Mar 2016 22:17:09 +0000 (22:17 +0000)]
Fix --help message.
llvm-svn: 262413
Simon Pilgrim [Tue, 1 Mar 2016 22:05:05 +0000 (22:05 +0000)]
[X86][SSE41] Added missing fast-isel intrinsics tests
Match IR generated in clang/test/CodeGen/sse41-builtins.c
llvm-svn: 262412
Colin LeMahieu [Tue, 1 Mar 2016 22:05:03 +0000 (22:05 +0000)]
[NFC] Convert tabs to spaces.
llvm-svn: 262411
Simon Pilgrim [Tue, 1 Mar 2016 21:58:50 +0000 (21:58 +0000)]
[X86][XOP] Regenerated intrinsics tests
llvm-svn: 262410
Daniel Sanders [Tue, 1 Mar 2016 21:57:22 +0000 (21:57 +0000)]
Explicitly select IAS on one more RUN line in new embed-bitcode.c test.
This should fix clang-cmake-mips builder since MIPS does not have IAS enabled
by default (yet).
llvm-svn: 262409
Matthias Braun [Tue, 1 Mar 2016 21:55:35 +0000 (21:55 +0000)]
AArch64: Reenable CompleteModel for A53, A57 and Kryo models
The fixes in r262393 completed them as well.
llvm-svn: 262408
Ted Woodward [Tue, 1 Mar 2016 21:53:26 +0000 (21:53 +0000)]
Fix bug with function resolution when using IR Interpreter
Summary: Recent changes to the expression parser broke function name resolution when using the IR interpreter instead of JIT. This patch changes the IRMemoryMap ivar in InterpreterStackFrame to an IRExecutionUnitSP (which is a subclass), allowing InterpreterStackFrame::ResolveConstantValue() to call FindSymbol() on the name of the Value when it's a FunctionVal. It also changes IRExecutionUnit::FindInSymbols() to call GetFileAddress() on the symball if ResolveCallableAddress() fails and there is no valid Process.
Reviewers: spyffe
Subscribers: lldb-commits
Differential Revision: http://reviews.llvm.org/D17745
llvm-svn: 262407
Simon Pilgrim [Tue, 1 Mar 2016 21:53:12 +0000 (21:53 +0000)]
[X86][AVX2] Regenerated 256-bit vector / 64-bit element permute tests
llvm-svn: 262406
Tim Northover [Tue, 1 Mar 2016 21:45:22 +0000 (21:45 +0000)]
Fix typo. NFC.
llvm-svn: 262405
Michael Kruse [Tue, 1 Mar 2016 21:44:06 +0000 (21:44 +0000)]
Fix non-synthesizable loop exit values.
Polly recognizes affine loops that ScalarEvolution does not, in
particular those with loop conditions that depend on hoisted invariant
loads. Check for SCEVAddRec dependencies on such loops and do not
consider their exit values as synthesizable because SCEVExpander would
generate them as expressions that depend on the original induction
variables. These are not available in generated code.
llvm-svn: 262404
Simon Pilgrim [Tue, 1 Mar 2016 21:43:55 +0000 (21:43 +0000)]
[X86][AVX2] Regenerated horizontal add/sub tests
llvm-svn: 262403
Daniel Jasper [Tue, 1 Mar 2016 21:41:58 +0000 (21:41 +0000)]
[clang-format] Detect constructor initializers preceded by `noexcept`.
Patch by Erik Kessler, thank you.
llvm-svn: 262402
Simon Pilgrim [Tue, 1 Mar 2016 21:38:41 +0000 (21:38 +0000)]
[X86][AVX2] Regenerated intrinsics tests
llvm-svn: 262401
Colin LeMahieu [Tue, 1 Mar 2016 21:37:41 +0000 (21:37 +0000)]
[Hexagon] Modifying r262258 to only be in effect in the hand assembler path, not the integrated assembler.
llvm-svn: 262400
Matthias Braun [Tue, 1 Mar 2016 21:36:12 +0000 (21:36 +0000)]
TableGen: Display helpfull message for incomplete models.
llvm-svn: 262399
Simon Pilgrim [Tue, 1 Mar 2016 21:33:08 +0000 (21:33 +0000)]
[X86][AVX] Fixed triple/arch clash in test case
We were specifying a x64 triple and then overriding with a x86 arch.
llvm-svn: 262398
Matt Arsenault [Tue, 1 Mar 2016 21:31:53 +0000 (21:31 +0000)]
DAGCombiner: Turn truncate of a bitcasted vector to an extract
On AMDGPU where operations i64 operations are often bitcasted to v2i32
and back, this pattern shows up regularly where it breaks some
expected combines on i64, such as load width reducing.
This fixes some test failures in a future commit when i64 loads
are changed to promote.
llvm-svn: 262397
Rafael Espindola [Tue, 1 Mar 2016 21:29:33 +0000 (21:29 +0000)]
Add LLVMBuild for ObjectYAML.
Should fix the DBUILD_SHARED_LIBS bots.
llvm-svn: 262396
David Blaikie [Tue, 1 Mar 2016 21:24:04 +0000 (21:24 +0000)]
Revert "llvm-dwp: Keep ObjectFiles alive until object emission their contents can be referenced directly rather than copied"
Accidentally committed.
This reverts commit r262389.
llvm-svn: 262395
Jacques Pienaar [Tue, 1 Mar 2016 21:21:42 +0000 (21:21 +0000)]
[lanai] Add ELF enum value and relocations.
Add ELF enum value and relocations for Lanai backed.
General Lanai backend discussion on llvm-dev thread "[RFC] Lanai backend" (http://lists.llvm.org/pipermail/llvm-dev/2016-February/095118.html).
Differential Revision: http://reviews.llvm.org/D17008
llvm-svn: 262394
Matthias Braun [Tue, 1 Mar 2016 21:20:31 +0000 (21:20 +0000)]
AArch64: Add missing schedinfo, check completeness for cyclone
This adds some missing generic schedule info definitions, enables
completeness checking for cyclone and fixes a typo uncovered by that.
Differential Revision: http://reviews.llvm.org/D17748
llvm-svn: 262393
Kit Barton [Tue, 1 Mar 2016 20:51:57 +0000 (20:51 +0000)]
[Power9] Implement new vector compare, extract, insert instructions
This change implements the following vector operations:
- Vector Compare Not Equal
- vcmpneb(.) vcmpneh(.) vcmpnew(.)
- vcmpnezb(.) vcmpnezh(.) vcmpnezw(.)
- Vector Extract Unsigned
- vextractub vextractuh vextractuw vextractd
- vextublx vextubrx vextuhlx vextuhrx vextuwlx vextuwrx
- Vector Insert
- vinsertb vinserth vinsertw vinsertd
26 instructions.
Phabricator: http://reviews.llvm.org/D15916
llvm-svn: 262392
Sanjay Patel [Tue, 1 Mar 2016 20:47:02 +0000 (20:47 +0000)]
[x86] use getBitcast()
This isn't quite NFC because some of the SDLocs may change which could
cause scheduling differences. But no regression tests are affected and
there is no functional change intended.
llvm-svn: 262391
David Blaikie [Tue, 1 Mar 2016 20:41:17 +0000 (20:41 +0000)]
Fix some warnings a bit harder/different
This is an alternate fix to 262378 and a fix to a pessimizing-move
warning.
llvm-svn: 262390
David Blaikie [Tue, 1 Mar 2016 20:40:02 +0000 (20:40 +0000)]
llvm-dwp: Keep ObjectFiles alive until object emission their contents can be referenced directly rather than copied
llvm-svn: 262389
Geoff Berry [Tue, 1 Mar 2016 20:28:52 +0000 (20:28 +0000)]
Revert "[AArch64] Fix isLegalAddImmediate() to return true for valid negative values."
Revert r262248 in an attempt to fix the clang-native-aarch64-full
bot and to investigate a performance regression in
SingleSource/Benchmarks/CoyoteBench/huffbench
llvm-svn: 262388
Vasileios Kalintiris [Tue, 1 Mar 2016 20:25:43 +0000 (20:25 +0000)]
Revert "[mips] Promote the result of SETCC nodes to GPR width."
This reverts commit r262316.
It seems that my change breaks an out-of-tree chromium buildbot, so
I'm reverting this in order to investigate the situation further.
llvm-svn: 262387
Kit Barton [Tue, 1 Mar 2016 20:19:43 +0000 (20:19 +0000)]
New file to track implementation status of new POWER9 instructions
llvm-svn: 262386
Ekaterina Romanova [Tue, 1 Mar 2016 20:04:57 +0000 (20:04 +0000)]
This patch adds doxygen comments for the intrinsincs in the header file popcntintrin.h.
The doxygen comments are automatically generated based on Sony's intrinsics documentation.
Differential Revision: http://reviews.llvm.org/D17550
llvm-svn: 262385
Matthias Braun [Tue, 1 Mar 2016 20:03:21 +0000 (20:03 +0000)]
TableGen: Check scheduling models for completeness
TableGen checks at compiletime that for scheduling models with
"CompleteModel = 1" one of the following holds:
- Is marked with the hasNoSchedulingInfo flag
- The instruction is a subclass of Sched
- There are InstRW definitions in the scheduling model
Typical steps necessary to complete a model:
- Ensure all pseudo instructions that are expanded before machine
scheduling (usually everything handled with EmitYYY() functions in
XXXTargetLowering).
- If a CPU does not support some instructions mark the corresponding
resource unsupported: "WriteRes<WriteXXX, []> { let Unsupported = 1; }".
- Add missing scheduling information.
Differential Revision: http://reviews.llvm.org/D17747
llvm-svn: 262384
Matthias Braun [Tue, 1 Mar 2016 20:03:11 +0000 (20:03 +0000)]
TableGen: Add hasNoSchedulingInfo to instructions
This introduces a new flag that indicates that a specific instruction
will never be present when the MachineScheduler runs and therefore needs
no scheduling information.
This is in preparation for an upcoming commit which checks completeness
of a scheduling model when tablegen runs.
Differential Revision: http://reviews.llvm.org/D17728
llvm-svn: 262383
Reid Kleckner [Tue, 1 Mar 2016 19:51:48 +0000 (19:51 +0000)]
Reword a misleading comment discussing landingpads and SEH
SEH doesn't use landingpads anymore.
llvm-svn: 262382
Justin Lebar [Tue, 1 Mar 2016 19:44:22 +0000 (19:44 +0000)]
[NVPTX] Annotate param loads/stores as mayLoad/mayStore.
Summary:
Tablegen was unable to determine that param loads/stores were actually
reading or writing from memory. I think this isn't a problem in
practice for param stores, because those occur in a block right before
we make our call. But param loads don't have to at the very beginning
of a function, so should be annotated as mayLoad so we don't incorrectly
optimize them.
Reviewers: jholewinski
Subscribers: jholewinski, llvm-commits
Differential Revision: http://reviews.llvm.org/D17471
llvm-svn: 262381
Justin Lebar [Tue, 1 Mar 2016 19:44:20 +0000 (19:44 +0000)]
[NVPTX] Remove workaround for tablegen crash in NVPTXInstrInfo.td.
Summary: Looks like this was caused by a typo.
Reviewers: jholewinski
Subscribers: jholewinski, llvm-commits, tra
Differential Revision: http://reviews.llvm.org/D17357
llvm-svn: 262380
David Majnemer [Tue, 1 Mar 2016 19:42:53 +0000 (19:42 +0000)]
[MSVC Compat] Correctly handle finallys nested within finallys
We'd lose track of the parent CodeGenFunction, leading us to get
confused with regard to which function a nested finally belonged to.
Differential Revision: http://reviews.llvm.org/D17752
llvm-svn: 262379
Reid Kleckner [Tue, 1 Mar 2016 19:39:54 +0000 (19:39 +0000)]
Fix -Wnon-virtual-dtor warnings
llvm-svn: 262378
George Rimar [Tue, 1 Mar 2016 19:38:51 +0000 (19:38 +0000)]
[ELF] - handle of command line options incompatible with -r
Patch handles the command line options
incompatible with relocatable output.
Differential revision: http://reviews.llvm.org/D17758
llvm-svn: 262377
Owen Anderson [Tue, 1 Mar 2016 19:35:52 +0000 (19:35 +0000)]
Fix an issue where fast math flags were dropped during scalarization.
Most portions of InstCombine properly propagate fast math flags, but
apparently the vector scalarization section was overlooked.
llvm-svn: 262376
Michael Kruse [Tue, 1 Mar 2016 19:30:54 +0000 (19:30 +0000)]
[SCEVValidator] Remove redundant visit.
SCEVAddRecExpr::getStart() is synonymous to SCEVAddRecExpr::getOperand(0)
which will be visited in the following loop anyway.
llvm-svn: 262375
Sanjoy Das [Tue, 1 Mar 2016 19:28:01 +0000 (19:28 +0000)]
[SCEV] Minor cleanup: rename method, C++11'ify; NFC
llvm-svn: 262374
Justin Lebar [Tue, 1 Mar 2016 19:24:03 +0000 (19:24 +0000)]
[NVPTX] Use different, convergent MIs for convergent calls.
Summary:
Calls sometimes need to be convergent. This is already handled at the
LLVM IR level, but it also needs to be handled at the MI level.
Ideally we'd propagate convergence from instructions, down through the
selection DAG, and into MIs. But this is Hard, and would affect
optimizations in the SDNs -- right now only SDNs with two operands have
any flags at all.
Instead, here's a much simpler hack: Add new opcodes for NVPTX for
convergent calls, and generate these when lowering convergent LLVM
calls.
Reviewers: jholewinski
Subscribers: jholewinski, chandlerc, joker.eph, jhen, tra, llvm-commits
Differential Revision: http://reviews.llvm.org/D17423
llvm-svn: 262373
Justin Lebar [Tue, 1 Mar 2016 19:24:00 +0000 (19:24 +0000)]
[NVPTX] Nix hack used to emit '{' and '}' for NVPTX calls.
Summary: Tablegen understands backslash as an escape char; that's sufficient.
Reviewers: jholewinski
Subscribers: llvm-commits, tra, jholewinski
Differential Revision: http://reviews.llvm.org/D17432
llvm-svn: 262372
Justin Lebar [Tue, 1 Mar 2016 19:23:30 +0000 (19:23 +0000)]
[NVPTX] Reformat NVPTXInstrInfo.td, and add additional comments.
Summary:
Also simplify some of the embedded C++ logic.
No functional changes.
Reviewers: jholewinski
Subscribers: llvm-commits, tra, jholewinski
Differential Revision: http://reviews.llvm.org/D17354
llvm-svn: 262371