platform/upstream/llvm.git
4 years ago[MachineCombine] add a hook for resource length limit
Chen Zheng [Wed, 27 May 2020 02:39:37 +0000 (22:39 -0400)]
[MachineCombine] add a hook for resource length limit

4 years ago[PowerPC] Exploit vabsd on P9
Li Rong Yi [Mon, 1 Jun 2020 02:29:18 +0000 (02:29 +0000)]
[PowerPC] Exploit vabsd on P9

Summary: Exploit vabsd* for for absolute difference of vectors on P9,
for example:
void foo (char *restrict p, char *restrict q, char *restrict t)
{
  for (int i = 0; i < 16; i++)
     t[i] = abs (p[i] - q[i]);
}
this case should be matched to the HW instruction vabsdub.

Reviewed By: steven.zhang

Differential Revision: https://reviews.llvm.org/D80271

4 years ago[gn build] (semi-manually) port a8ca0ec2670
Nico Weber [Mon, 1 Jun 2020 02:04:35 +0000 (22:04 -0400)]
[gn build] (semi-manually) port a8ca0ec2670

4 years agoAMDGPU/GlobalISel: Add stub reg-bank aware combiner pass
Matt Arsenault [Sun, 31 May 2020 23:58:55 +0000 (19:58 -0400)]
AMDGPU/GlobalISel: Add stub reg-bank aware combiner pass

4 years ago[Driver] Simplify Linux::addProfileRTLibs
Fangrui Song [Mon, 1 Jun 2020 00:15:14 +0000 (17:15 -0700)]
[Driver] Simplify Linux::addProfileRTLibs

4 years ago[analyzer] Add dumps to CheckerRegistry
Kirstóf Umann [Sun, 31 May 2020 19:22:35 +0000 (21:22 +0200)]
[analyzer] Add dumps to CheckerRegistry

4 years ago[Driver] NFC: Use Twine temp to replace std::string local
Hubert Tong [Sun, 31 May 2020 20:38:10 +0000 (16:38 -0400)]
[Driver] NFC: Use Twine temp to replace std::string local

This patch replaces a `std::string` local used for a concatentation with
a `Twine` where the string was being passed into call.

4 years ago[test][compiler-rt] Avoid LD_PRELOAD for "outer" dynamic linkers
Hubert Tong [Sun, 31 May 2020 20:33:42 +0000 (16:33 -0400)]
[test][compiler-rt] Avoid LD_PRELOAD for "outer" dynamic linkers

Summary:
This patch moves the setting of `LD_PRELOAD` "inwards" to avoid issues
where the built library needs to be loaded with the dynamic linker that
was configured with the build (and cannot, for example, be loaded by the
dynamic linker associated with the `env` utility).

Reviewed By: vitalybuka, nemanjai, jsji

Differential Revision: https://reviews.llvm.org/D79695

4 years ago[X86] Rewrite how X86PartialReduction finds candidates to consider optimizing.
Craig Topper [Sun, 31 May 2020 19:39:14 +0000 (12:39 -0700)]
[X86] Rewrite how X86PartialReduction finds candidates to consider optimizing.

Previously we walked the users of any vector binop looking for
more binops with the same opcode or phis that eventually ended up
in a reduction. While this is simple it also means visiting the
same nodes many times since we'll do a forward walk for each
BinaryOperator in the chain. It was also far more general than what
we have tests for or expect to see.

This patch replaces the algorithm with a new method that starts at
extract elements looking for a horizontal reduction. Once we find
a reduction we walk through backwards through phis and adds to
collect leaves that we can consider for rewriting.

We only consider single use adds and phis. Except for a special
case if the Add is used by a phi that forms a loop back to the
Add. Including other single use Adds to support unrolled loops.

Ultimately, I want to narrow the Adds, Phis, and final reduction
based on the partial reduction we're doing. I still haven't
figured out exactly what that looks like yet. But restricting
the types of graphs we expect to handle seemed like a good first
step. As does having all the leaves and the reduction at once.

Differential Revision: https://reviews.llvm.org/D79971

4 years ago[X86][AVX] Reduce unary target shuffles width if the upper elements aren't demanded.
Simon Pilgrim [Sun, 31 May 2020 19:19:24 +0000 (20:19 +0100)]
[X86][AVX] Reduce unary target shuffles width if the upper elements aren't demanded.

4 years ago[X86][AVX] combineX86ShufflesRecursively - peekThroughOneUseBitcasts subvector before...
Simon Pilgrim [Sun, 31 May 2020 18:58:15 +0000 (19:58 +0100)]
[X86][AVX] combineX86ShufflesRecursively - peekThroughOneUseBitcasts subvector before widening.

This matches what we do for the full sized vector ops at the start of combineX86ShufflesRecursively, and helps getFauxShuffleMask extract more INSERT_SUBVECTOR patterns.

4 years agoChange some extraneous /// comments to // comments inside methods. NFC.
Chris Lattner [Sun, 31 May 2020 18:43:54 +0000 (11:43 -0700)]
Change some extraneous /// comments to // comments inside methods. NFC.

4 years agoclang-tidy and clang-query wont crash with invalid command line options
Nathan James [Sun, 31 May 2020 16:41:29 +0000 (17:41 +0100)]
clang-tidy and clang-query wont crash with invalid command line options

Motivated by [[ https://bugs.llvm.org/show_bug.cgi?id=46141 | clang-tidy crashed for unknown command line argument. ]]

Reviewed By: aaron.ballman, thakis

Differential Revision: https://reviews.llvm.org/D80879

4 years agoRevert "clang-tidy and clang-query wont crash with invalid command line options"
Nathan James [Sun, 31 May 2020 15:40:09 +0000 (16:40 +0100)]
Revert "clang-tidy and clang-query wont crash with invalid command line options"

This reverts commit f23ddbe3c3ae5f40b99ba272afc3d16b800ba8b9.

4 years ago[utils] update expected strings in tests; NFC
Sanjay Patel [Sun, 31 May 2020 15:06:32 +0000 (11:06 -0400)]
[utils] update expected strings in tests; NFC

The script was changes with:
https://github.com/llvm/llvm-project/commit/bfdc2552664d6f0bb332a9c6a115877020f3c1df

4 years agoclang-tidy and clang-query wont crash with invalid command line options
Nathan James [Sat, 30 May 2020 21:03:50 +0000 (22:03 +0100)]
clang-tidy and clang-query wont crash with invalid command line options

Summary: Motivated by [[ https://bugs.llvm.org/show_bug.cgi?id=46141 | clang-tidy crashed for unknown command line argument. ]]

Reviewers: aaron.ballman, alexfh

Reviewed By: aaron.ballman

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D80879

4 years agoAArch64/GlobalISel: Fix incorrect ptrmask usage for alignment
Matt Arsenault [Sat, 30 May 2020 20:03:16 +0000 (16:03 -0400)]
AArch64/GlobalISel: Fix incorrect ptrmask usage for alignment

I inverted the mask when I ported to the new form of G_PTRMASK in
8bc03d2168241f7b12265e9cd7e4eb7655709f34.

I don't think this really broke anything, since G_VASTART isn't
handled for types with an alignment higher than the stack alignment.

4 years ago[utils] change update_test_checks.py use of 'TMP' value names
Sanjay Patel [Sun, 31 May 2020 14:46:11 +0000 (10:46 -0400)]
[utils] change update_test_checks.py use of 'TMP' value names

As discussed in PR45951:
https://bugs.llvm.org/show_bug.cgi?id=45951

There's a potential name collision between update_test_checks.py and -instnamer
and/or manually-generated IR test files because all of them try to use the
variable name that should never be used: "tmp".

This patch proposes to reduce the odds of collision and adds a warning if we
detect the problem. This will cause regression test churn when regenerating
CHECK lines on existing files.

Differential Revision: https://reviews.llvm.org/D80584

4 years ago[PhaseOrdering] add test for hoisting/CSE (PR46115); NFC
Sanjay Patel [Sun, 31 May 2020 14:10:28 +0000 (10:10 -0400)]
[PhaseOrdering] add test for hoisting/CSE (PR46115); NFC

4 years ago[X86][AVX] Add SimplifyMultipleUseDemandedBits VBROADCAST handling to SimplifyDemande...
Simon Pilgrim [Sun, 31 May 2020 13:20:00 +0000 (14:20 +0100)]
[X86][AVX] Add SimplifyMultipleUseDemandedBits VBROADCAST handling to SimplifyDemandedVectorElts.

As suggested on D79987.

4 years ago[VectorCombine] add tests for scalarizing binop-with-constant; NFC
Sanjay Patel [Sun, 31 May 2020 13:05:48 +0000 (09:05 -0400)]
[VectorCombine] add tests for scalarizing binop-with-constant; NFC

Goes with proposal in D80885.

This is adapted from the InstCombine tests that were added for
D50992

But these should be adjusted further to provide more interesting
scenarios for x86-specific codegen. Eg, vector types/sizes will
have different costs depending on ISA attributes.

We also need to add tests that include a load of the scalar
variable and add tests that include extra uses of the insert
to further exercise the cost model.

4 years ago[X86] getFauxShuffleMask/getTargetShuffleInputs - make SelectionDAG const (PR45974).
Simon Pilgrim [Sun, 31 May 2020 12:50:40 +0000 (13:50 +0100)]
[X86] getFauxShuffleMask/getTargetShuffleInputs - make SelectionDAG const (PR45974).

Try to prevent future node creation issues (as detailed in PR45974) by making the SelectionDAG reference const, so it can still be used for analysis, but not node creation.

4 years ago[X86][AVX] Add test case described in D79987
Simon Pilgrim [Sun, 31 May 2020 12:46:46 +0000 (13:46 +0100)]
[X86][AVX] Add test case described in D79987

4 years ago[PhaseOrdering] add scalarization test for PR42174; NFC
Sanjay Patel [Sun, 31 May 2020 12:41:09 +0000 (08:41 -0400)]
[PhaseOrdering] add scalarization test for PR42174; NFC

Motivating test for vector-combine enhancement in D80885.
Make sure that vectorization and canonicalization are
working together as expected.

4 years ago[X86][AVX] getFauxShuffleMask - don't widen shuffle inputs from INSERT_SUBVECTOR...
Simon Pilgrim [Sun, 31 May 2020 12:19:18 +0000 (13:19 +0100)]
[X86][AVX] getFauxShuffleMask - don't widen shuffle inputs from INSERT_SUBVECTOR(X,SHUFFLE(Y,Z))

Don't create nodes on the fly when decoding INSERT_SUBVECTOR as faux shuffles.

4 years ago[X86][AVX] Pad small shuffle inputs in combineX86ShufflesRecursively
Simon Pilgrim [Sun, 31 May 2020 10:43:27 +0000 (11:43 +0100)]
[X86][AVX] Pad small shuffle inputs in combineX86ShufflesRecursively

As detailed on PR45974 and D79987, getFauxShuffleMask is creating nodes on the fly to create shuffles with inputs the same size as the result, causing problems for hasOneUse() checks in later simplification stages.

Currently only combineX86ShufflesRecursively benefits from these widened inputs so I've begun moving the functionality there, and out of getFauxShuffleMask. This allows us to remove the widening from VBROADCAST and *EXTEND* faux shuffle cases.

This just leaves the INSERT_SUBVECTOR case in getFauxShuffleMask still creating nodes, which will require more extensive refactoring.

4 years ago[ScheduleDAG] Avoid unnecessary recomputation of topological order.
Florian Hahn [Sun, 31 May 2020 10:04:35 +0000 (11:04 +0100)]
[ScheduleDAG] Avoid unnecessary recomputation of topological order.

In some cases ScheduleDAGRRList has to add new nodes to resolve problems
with interfering physical registers. When new nodes are added, it
completely re-computes the topological order, which can take a long
time, but is unnecessary. We only add nodes one by one, and initially
they do not have any predecessors. So we can just insert them at the end
of the vector. Later we add predecessors, but the helper function
properly updates the topological order much more efficiently. With this
change, the compile time for the program below drops from 300s to 30s on
my machine.

    define i11129 @test1() {
      %L1 = load i11129, i11129* undef
      %B30 = ashr i11129 %L1, %L1
      store i11129 %B30, i11129* undef
      ret i11129 %L1
    }

This should be generally beneficial, as we can skip a large amount of
work. Theoretically there are some scenarios where we might not safe
much, e.g. when we add a dependency between the first and last node.
Then we would have to shift all nodes. But we still do not have to spend
the time re-computing the initial order.

Reviewers: MatzeB, atrick, efriedma, niravd, paquette

Reviewed By: paquette

Differential Revision: https://reviews.llvm.org/D59722

4 years agoRevert "[NFC][PowerPC] Add a new case to test phi-node-elimination pass"
Kang Zhang [Sun, 31 May 2020 09:24:21 +0000 (09:24 +0000)]
Revert "[NFC][PowerPC] Add a new case to test phi-node-elimination pass"
This case wll be failed on some machines which enable expensive-checks.

This reverts commit af3abbf7bd2213003a133c361c212ac6efb1bd2b.

4 years ago[NFC][PowerPC] Add a new case to test phi-node-elimination pass
Kang Zhang [Sun, 31 May 2020 08:05:27 +0000 (08:05 +0000)]
[NFC][PowerPC] Add a new case to test phi-node-elimination pass

4 years ago[AMDGPU] Propagate fast-math flags when lowering FSIN and FCOS
Jay Foad [Fri, 29 May 2020 16:02:13 +0000 (17:02 +0100)]
[AMDGPU] Propagate fast-math flags when lowering FSIN and FCOS

Differential Revision: https://reviews.llvm.org/D80813

4 years ago[AMDGPU] Precommit tests for D80813
Jay Foad [Fri, 29 May 2020 15:52:43 +0000 (16:52 +0100)]
[AMDGPU] Precommit tests for D80813

4 years agoAMDGPU: Add setTruncStoreAction for vector i64 types made legal recently
Changpeng Fang [Sun, 31 May 2020 03:45:27 +0000 (20:45 -0700)]
AMDGPU: Add setTruncStoreAction for vector i64 types made legal recently

Reviewers:
  rampitec, arsenm

Differential Revision:
  https://reviews.llvm.org/D80853

4 years ago[X86] Remove unneeded bitconverts from isel patterns. NFC
Craig Topper [Sun, 31 May 2020 03:24:51 +0000 (20:24 -0700)]
[X86] Remove unneeded bitconverts from isel patterns. NFC

The types already match so TableGen is removing the bitconvert.

4 years ago[X86] Add DAG combine to turn (v2i64 (scalar_to_vector (i64 (bitconvert (mmx))))...
Craig Topper [Sun, 31 May 2020 01:51:20 +0000 (18:51 -0700)]
[X86] Add DAG combine to turn (v2i64 (scalar_to_vector (i64 (bitconvert (mmx))))) to MOVQ2DQ. Remove unneeded isel patterns.

We already had a DAG combine for (mmx (bitconvert (i64 (extractelement v2i64))))
to MOVDQ2Q.

Remove patterns for MMX_MOVQ2DQrr/MMX_MOVDQ2Qrr that use
scalar_to_vector/extractelement involving i64 scalar type with
v2i64 and x86mmx.

4 years ago[DAGCombiner] Move debug message and statistic update into CommitTargetLoweringOpt.
Craig Topper [Sun, 31 May 2020 01:40:17 +0000 (18:40 -0700)]
[DAGCombiner] Move debug message and statistic update into CommitTargetLoweringOpt.

This code was repeated in two callers of CommitTargetLoweringOpt.
But CommitTargetLoweringOpt is also called from TargetLowering.
We should print a message for those calls to. So sink the
repeated code into CommitTargetLoweringOpt to catch those calls.

4 years ago[X86] Teach computeKnownBitsForTargetNode that the upper half of X86ISD::MOVQ2DQ...
Craig Topper [Sun, 31 May 2020 00:04:56 +0000 (17:04 -0700)]
[X86] Teach computeKnownBitsForTargetNode that the upper half of X86ISD::MOVQ2DQ is all zero.

4 years ago[X86] Fix a place where we created MOVQ2DQ with a DstVT other than v2i64.
Craig Topper [Sun, 31 May 2020 00:03:53 +0000 (17:03 -0700)]
[X86] Fix a place where we created MOVQ2DQ with a DstVT other than v2i64.

The type profile and isel pattern have this type declared as
being MVT::v2i64. But isel skips the explicit type check due to
the type profile.

4 years ago[X86] Autogenerate complete checks. NFC
Craig Topper [Sat, 30 May 2020 23:27:26 +0000 (16:27 -0700)]
[X86] Autogenerate complete checks. NFC

4 years ago[X86] Move MMX_SET0 pattern into the instruction definition. NFC
Craig Topper [Sat, 30 May 2020 23:12:44 +0000 (16:12 -0700)]
[X86] Move MMX_SET0 pattern into the instruction definition. NFC

4 years ago[ELF][docs] Update supported targets
Fangrui Song [Sun, 31 May 2020 02:33:50 +0000 (19:33 -0700)]
[ELF][docs] Update supported targets

PowerPC, PowerPC64 and x86-32 have production quality.
Mention Hexagon, RISC-V and SPARC V9.

4 years ago[llvm-objdump] Delete unneeeded namespace llvm {}
Fangrui Song [Sun, 31 May 2020 01:03:43 +0000 (18:03 -0700)]
[llvm-objdump] Delete unneeeded namespace llvm {}

4 years ago[llvm-objdump] Move llvm:: to llvm::objdump:: and qualifying definitions with objdump::
Fangrui Song [Sun, 31 May 2020 01:00:14 +0000 (18:00 -0700)]
[llvm-objdump] Move llvm:: to llvm::objdump:: and qualifying definitions with objdump::

Or adding `static`.

Qualifying definitions with `objdump::` comforms to the coding standards
https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions

4 years ago[llvm-objdump] Simplify reportError() and prepend outs().flush()
Fangrui Song [Sun, 31 May 2020 00:25:18 +0000 (17:25 -0700)]
[llvm-objdump] Simplify reportError() and prepend outs().flush()

As noticed by dblaikie.

I don't know what code paths using reportError can cause stdout output
to be interleaved with stderr, so no test is added now.

Also drop an unneeded use of errs().fflush() in reportWarning().
I requested this in D64165.

4 years ago[X86] Add pseudo instructions to use MULX with a single destination when the low...
Craig Topper [Sat, 30 May 2020 22:51:56 +0000 (15:51 -0700)]
[X86] Add pseudo instructions to use MULX with a single destination when the low result isn't used.

The instruction is defined to only produce high result if both
destinations are the same. We can exploit this to avoid
unnecessarily clobbering a register.

In order to hide this from register allocation we use a pseudo
instruction and expand the result during MCInst creation.

Differential Revision: https://reviews.llvm.org/D80500

4 years ago[Driver] Fix BooleanFFlag identifiers to use 'f' 'fno_' prefixes instead of suffixes
Fangrui Song [Sat, 30 May 2020 22:41:09 +0000 (15:41 -0700)]
[Driver] Fix BooleanFFlag identifiers to use 'f' 'fno_' prefixes instead of suffixes

4 years agoRemove some non-determinism from the `Darwin/duplicate_os_log_reports.cpp` test.
Dan Liew [Sat, 30 May 2020 22:17:41 +0000 (15:17 -0700)]
Remove some non-determinism from the `Darwin/duplicate_os_log_reports.cpp` test.

The test read from an uninitialized buffer which could cause the output
to be unpredictable.

The test is currently disabled so this won't actually change anything
until the test is re-enabled.

4 years ago[X86] Minor cleanups to addShuffleComments in X86MCInstPrinter.cpp. NFCI
Craig Topper [Sat, 30 May 2020 19:53:56 +0000 (12:53 -0700)]
[X86] Minor cleanups to addShuffleComments in X86MCInstPrinter.cpp. NFCI

-Replace some ifs that should be impossible with asserts.
-Use X86::AddrDisp and X86::AddrNumOperands to make code more readable
-Use X86II::isKMasked/isKMergeMasked to do some operand skipping to remove or simplify switches

4 years ago[X86] Factor constant pool comment printing out of the switch in X86AsmPrinter::emitI...
Craig Topper [Sat, 30 May 2020 07:31:21 +0000 (00:31 -0700)]
[X86] Factor constant pool comment printing out of the switch in X86AsmPrinter::emitInstruction. NFC

Pull the verbose asm check out of the cases and move it up to
the call of the new function.

4 years ago[LoopUnroll] Add a test case for rG7873376bb36b.
Whitney Tsang [Sat, 30 May 2020 20:24:10 +0000 (20:24 +0000)]
[LoopUnroll] Add a test case for rG7873376bb36b.

rG7873376bb36b fixes a build failure for allyesconfig.

The problem happened when the single exiting block doesn't dominate the
loop latch, then the immediate dominator of the exit block should not be
the exiting block after unrolling. As the exiting block of
different unrolled iteration can branch to the exit block, and the ith
exiting block doesn't dominate (i+1)th exiting block, the immediate
dominator of the exit block should not the nearest common dominator of
the exiting block and the loop latch of the same iteration.

Differential Revision: https://reviews.llvm.org/D80477

4 years ago[Tests] Convert last statepoint lowering tests to bundle format
Philip Reames [Sat, 30 May 2020 19:58:41 +0000 (12:58 -0700)]
[Tests] Convert last statepoint lowering tests to bundle format

4 years ago[clang-tidy] RenamerClangTidyChecks ignore builtin and command line macros
Nathan James [Wed, 27 May 2020 14:01:17 +0000 (15:01 +0100)]
[clang-tidy] RenamerClangTidyChecks ignore builtin and command line macros

Summary: Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42635 | readability-identifier-naming option MacroDefinitionCase should ignore macros passed as parameters. ]]

Reviewers: aaron.ballman, alexfh, gribozavr2, hokein

Reviewed By: aaron.ballman

Subscribers: xazax.hun, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D80631

4 years ago[LoopUnroll] Fix build failure for allyesconfig.
Whitney Tsang [Sat, 30 May 2020 18:20:23 +0000 (18:20 +0000)]
[LoopUnroll] Fix build failure for allyesconfig.

Differential Revision: https://reviews.llvm.org/D80477.

4 years ago[DSE] Remove noop stores in MSSA.
zoecarver [Sat, 30 May 2020 16:56:04 +0000 (09:56 -0700)]
[DSE] Remove noop stores in MSSA.

Adds a simple fast-path check for the pattern:
v = load ptr
store v to ptr

I took the tests from the bugzilla post, I can add more if needed (but I think these should be sufficent).

Refs: https://bugs.llvm.org/show_bug.cgi?id=45795

Differential Revision: https://reviews.llvm.org/D79391

4 years ago[lldb] Pass -fPIC flag even when DYLIB_ONLY is set
Raphael Isemann [Fri, 29 May 2020 15:08:28 +0000 (17:08 +0200)]
[lldb] Pass -fPIC flag even when DYLIB_ONLY is set

Summary:
It seems that when we rewrite a few rules to only build a dylib (i.e., when DYLIB_ONLY is set),
the rule for setting the CFLAGS for the dylib's object file compilation will no longer work. From what I can
see this is because in DYLIB_ONLY mode we pretend to compile the main executable so
the DYLIB_OBJECTS scope is actually never used.

This patch makes `-fPIC` unstopped if DYLIB_ONLY is set so that -fPIC actually ends up in the
CFLAGS for the dylib object file compilation.

The test for this is D80798 which only compiles on Linux with this patch.

Reviewers: friss, labath

Reviewed By: friss

Subscribers: JDevlieghere

Differential Revision: https://reviews.llvm.org/D80789

4 years ago[BasicAA] Use known lower bounds for index values for size based check.
Florian Hahn [Sat, 30 May 2020 15:20:42 +0000 (16:20 +0100)]
[BasicAA] Use known lower bounds for index values for size based check.

Currently, BasicAA does not exploit information about value ranges of
indexes. For example, consider the 2 pointers %a = %base and
%b = %base + %stride below, assuming they are used to access 4 elements.

If we know that %stride >= 4, we know the accesses do not alias. If
%stride is a constant, BasicAA currently gets that. But if the >= 4
constraint is encoded using an assume, it misses the NoAlias.

This patch extends DecomposedGEP to include an additional MinOtherOffset
field, which tracks the constant offset similar to the existing
OtherOffset, which the difference that it also includes non-negative
lower bounds on the range of the index value. When checking if the
distance between 2 accesses exceeds the access size, we can use this
improved bound.

For now this is limited to using non-negative lower bounds for indices,
as this conveniently skips cases where we do not have a useful lower
bound (because it is not constrained). We potential miss out in cases
where the lower bound is constrained but negative, but that can be
exploited in the future.

Reviewers: sanjoy, hfinkel, reames, asbirlea

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D76194

4 years agoSafeStackColoring.h - reduce Instructions.h include to forward declaration. NFC.
Simon Pilgrim [Sat, 30 May 2020 13:38:02 +0000 (14:38 +0100)]
SafeStackColoring.h - reduce Instructions.h include to forward declaration. NFC.

SafeStackColoring.cpp - remove includes directly defined in SafeStackColoring.h header. NFC.

4 years agoCriticalAntiDepBreaker.cpp - remove includes directly defined in CriticalAntiDepBreak...
Simon Pilgrim [Sat, 30 May 2020 13:32:36 +0000 (14:32 +0100)]
CriticalAntiDepBreaker.cpp - remove includes directly defined in CriticalAntiDepBreaker.h header. NFC.

4 years agoSafeStackLayout.cpp - remove includes directly defined in SafeStackLayout.h module...
Simon Pilgrim [Sat, 30 May 2020 13:30:19 +0000 (14:30 +0100)]
SafeStackLayout.cpp - remove includes directly defined in SafeStackLayout.h module header. NFC.

4 years ago[TargetLowering] SimplifyDemandedBits - remove shift amount clamps from getValidShift...
Simon Pilgrim [Sat, 30 May 2020 13:04:55 +0000 (14:04 +0100)]
[TargetLowering] SimplifyDemandedBits - remove shift amount clamps from getValidShiftAmountConstant calls. NFC.

getValidShiftAmountConstant only returns a value if the shift amount is in range, so we don't need to check it again.

4 years ago[SelectionDAG] ComputeNumSignBits - use Valid Min/Max shift amount helpers directly...
Simon Pilgrim [Sat, 30 May 2020 13:02:14 +0000 (14:02 +0100)]
[SelectionDAG] ComputeNumSignBits - use Valid Min/Max shift amount helpers directly. NFCI.

We are calling getValidShiftAmountConstant first followed by getValidMinimumShiftAmountConstant/getValidMaximumShiftAmountConstant if that failed. But both are used in the same way in ComputeNumSignBits and the Min/Max variants call getValidShiftAmountConstant internally anyhow.

4 years agoPackedVersion.h - reduce includes to forward declarations. NFC.
Simon Pilgrim [Sat, 30 May 2020 12:17:19 +0000 (13:17 +0100)]
PackedVersion.h - reduce includes to forward declarations. NFC.

4 years ago[clang-format] [PR46130] When editing a file with unbalance {} the namespace comment...
mydeveloperday [Sat, 30 May 2020 12:00:35 +0000 (13:00 +0100)]
[clang-format] [PR46130] When editing a file with unbalance {} the namespace comment fixer can incorrectly comment the wrong closing brace

Summary:
https://bugs.llvm.org/show_bug.cgi?id=46130   from Twitter https://twitter.com/ikautak/status/1265998988232159232

I have seen this myself many times.. if you have format on save and you work in an editor where you are constantly saving (:w muscle memory)

If you are in the middle of editing and somehow you've missed a { or } in your code, somewhere, often way below where you are at the bottom of your file the namespace comment fixer will have put the namespace on the previous closing brace.

This leads to you having to fix up the bottom of the file.

This revision prevents that happening by performing an initial pass of the tokens and simply counting the number of `{` and `}`  and ensuring they balance.

If they don't balance we don't do any namespace fixing as it will likely be unstable and incorrect.

Reviewed By: curdeius

Subscribers: cfe-commits

Tags: #clang, #clang-format

Differential Revision: https://reviews.llvm.org/D80830

4 years agoTBEHandler.h - remove unnecessary VersionTuple forward declaration. NFC.
Simon Pilgrim [Sat, 30 May 2020 12:07:57 +0000 (13:07 +0100)]
TBEHandler.h - remove unnecessary VersionTuple forward declaration. NFC.

We already have to include VersionTuple.h

4 years agoArchitectureSet.h - add missing <tuple> include.
Simon Pilgrim [Sat, 30 May 2020 11:48:46 +0000 (12:48 +0100)]
ArchitectureSet.h - add missing <tuple> include.

MSVC seems to implicitly include this from <utility> but other toolchains don't

4 years agoArchitectureSet.h - reduce raw_ostream.h include to forward declaration. NFC.
Simon Pilgrim [Sat, 30 May 2020 11:36:16 +0000 (12:36 +0100)]
ArchitectureSet.h - reduce raw_ostream.h include to forward declaration. NFC.

Move raw_ostream.h include to ArchitectureSet.cpp.

4 years agoArchitecture.h - reduce includes to forward declarations. NFC.
Simon Pilgrim [Sat, 30 May 2020 11:17:13 +0000 (12:17 +0100)]
Architecture.h - reduce includes to forward declarations. NFC.

Move includes to Architecture.cpp.

4 years agoIPDBRawSymbol.h - remove already declared forward declarations. NFC.
Simon Pilgrim [Sat, 30 May 2020 11:00:17 +0000 (12:00 +0100)]
IPDBRawSymbol.h - remove already declared forward declarations. NFC.

PDBTypes.h holds most PDB forward declarations already, move IPDBSession in there as well.

4 years agoIPDBRawSymbol.h - reduce StringRef.h include to forward declaration. NFC.
Simon Pilgrim [Sat, 30 May 2020 10:29:17 +0000 (11:29 +0100)]
IPDBRawSymbol.h - reduce StringRef.h include to forward declaration. NFC.

4 years ago[libcxx testing] Stop using arbitrary timeouts in one test
David Zarzycki [Sat, 30 May 2020 10:06:43 +0000 (06:06 -0400)]
[libcxx testing] Stop using arbitrary timeouts in one test

On a busy and/or slow system, 100ms might not be long enough. Instead,
we now use atomic variables to communicate between threads.

4 years ago[OpenMP][SYCL] Do not crash on attempt to diagnose unsupported type use
Mariya Podchishchaeva [Sat, 30 May 2020 09:27:47 +0000 (12:27 +0300)]
[OpenMP][SYCL] Do not crash on attempt to diagnose unsupported type use

Summary:
Do not ask size of type if it is dependent. ASTContext doesn't seem expecting
this.

Reviewers: jdoerfert, ABataev, bader

Reviewed By: ABataev

Subscribers: yaxunl, guansong, ebevhan, Anastasia, sstefan1, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D80829

4 years ago[SelectionDAG] Remove repeated getOperand() call. NFC.
Simon Pilgrim [Sat, 30 May 2020 09:21:36 +0000 (10:21 +0100)]
[SelectionDAG] Remove repeated getOperand() call. NFC.

4 years ago[lldb/CMake] Fix typo that prevented regenerating the bindings
Jonas Devlieghere [Sat, 30 May 2020 06:50:26 +0000 (23:50 -0700)]
[lldb/CMake] Fix typo that prevented regenerating the bindings

A typo in the GLOB patter prevented us from detecting changes in the
interface files and trigger SWIG to regenerate the bindings.

4 years ago[X86] Autogenerate complete checks. NFC
Craig Topper [Sat, 30 May 2020 05:46:57 +0000 (22:46 -0700)]
[X86] Autogenerate complete checks. NFC

4 years ago[test] Regenerate checks in aarch64_win64cc_vararg.ll with update_llc_test_checks...
Martin Storsjö [Fri, 29 May 2020 22:20:14 +0000 (01:20 +0300)]
[test] Regenerate checks in aarch64_win64cc_vararg.ll with update_llc_test_checks.py. NFC.

4 years ago[AArch64] Treat x18 as callee-saved in functions with windows calling convention...
Martin Storsjö [Tue, 30 Apr 2019 08:50:09 +0000 (11:50 +0300)]
[AArch64] Treat x18 as callee-saved in functions with windows calling convention on non-windows OSes

Treat it as callee-saved, and always back it up. When windows code calls
entry points in unix code, marked with the windows calling convention,
that unix code can call other functions that isn't compiled with
-ffixed-x18 which may clobber x18 freely. By backing it up and restoring
it on return, we preserve the register across the function call,
fulfilling this part of the windows calling convention on another OS.

This isn't enough for making sure that x18 is preseved when non-windows
code does a callback to windows code, but is a clear improvement over
the current status quo. Additionally, wine is nowadays building many
modules as PE DLLs, which avoids the callback issue altogether for those
DLLs.

Differential Revision: https://reviews.llvm.org/D61892

4 years ago[lldb/Test] use GetLoadAddress from scripted thread plan
Jonas Devlieghere [Sat, 30 May 2020 06:01:09 +0000 (23:01 -0700)]
[lldb/Test] use GetLoadAddress from scripted thread plan

Commit 0800529fe605 adds a runtime error which triggers when using
SBAddress properties that use the current process/target from a
non-interactive session. TestThreadPlanCommands.py was doing exactly
this and this patch fixes that by use GetLoadAddress instead.

4 years ago[DWARF5] Added support for emission of .debug_macro.dwo section
Sourabh Singh Tomar [Sat, 30 May 2020 05:41:09 +0000 (11:11 +0530)]
[DWARF5] Added support for emission of .debug_macro.dwo section

This patch adds support for emission of following DWARFv5 macro
forms in .debug_macro.dwo section:

- DW_MACRO_start_file
- DW_MACRO_end_file
- DW_MACRO_define_strx
- DW_MACRO_undef_strx

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D78866

4 years ago[lldb/Bindings] Raise exception when using properties that rely on lldb.target
Jonas Devlieghere [Sat, 30 May 2020 05:10:05 +0000 (22:10 -0700)]
[lldb/Bindings] Raise exception when using properties that rely on lldb.target

Several SBAddress properties use the lldb.target or lldb.process
convenience variables which are only set under the interactive script
interpreter. Unfortunately, users have been using these properties in
Python script and commands. This patch raises a Python exception to
force users to use GetLoadAddress instead.

Differential revision: https://reviews.llvm.org/D80848

4 years agoUse .empty() instead of .size() == 0 (NFC)
Mehdi Amini [Sat, 30 May 2020 03:36:22 +0000 (03:36 +0000)]
Use .empty() instead of .size() == 0 (NFC)

Cleanup / Fix a clang-tidy warning

4 years agoNFC: Simplify O1 pass pipeline construction.
Eric Christopher [Tue, 5 May 2020 01:33:49 +0000 (18:33 -0700)]
NFC: Simplify O1 pass pipeline construction.

Pull O1 pass pipeline out into a separate function and simplify
buildFunctionSimplificationPipeline accordingly.

4 years agoFix full unrolling with new pass manager.
Eric Christopher [Mon, 4 May 2020 20:48:56 +0000 (13:48 -0700)]
Fix full unrolling with new pass manager.

Last we looked at this and couldn't come up with a reason to change
it, but with a pragma for full loop unrolling we bypass every other
loop unroll and then fail to fully unroll a loop when the pragma is set.

Move the OnlyWhenForced out of the check and into the initialization
of the full unroll pass in the new pass manager. This doesn't show up
with the old pass manager.

Add a new option to opt so that we can turn off loop unrolling
manually since this is a difference between clang and opt.

Tested with check-clang and check-llvm.

4 years ago[ValueLattice] Fix uninitialized-value after D79036
Fangrui Song [Sat, 30 May 2020 02:48:33 +0000 (19:48 -0700)]
[ValueLattice] Fix uninitialized-value after D79036

Many check-clang-codegen tests failed.

4 years ago[AMDGPU] Remove assertion on S1024 SGPR to VGPR spill
Carl Ritson [Sat, 30 May 2020 02:15:39 +0000 (11:15 +0900)]
[AMDGPU] Remove assertion on S1024 SGPR to VGPR spill

Summary:
Replace an assertion that blocks S1024 SGPR to VGPR spill.
The assertion pre-dates S1024 and is not wave size dependent.

Reviewers: arsenm, sameerds, rampitec

Reviewed By: arsenm

Subscribers: qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80783

4 years agoAMDGPU: Optimize s_setreg_b32 to s_denorm_mode/s_round_mode
Matt Arsenault [Fri, 29 May 2020 00:55:45 +0000 (20:55 -0400)]
AMDGPU: Optimize s_setreg_b32 to s_denorm_mode/s_round_mode

This is a custom inserter because it was less work than teaching
tablegen a way to indicate that it is sometimes OK to have a no side
effect instruction in the output of a side effecting pattern.

The asm is needed to look like a read of the mode register to prevent
it from being deleted. However, there seems to be a bug where the mode
register def instructions are moved across the asm sideeffect by the
post-RA scheduler.

Another oddity is the immediate is formatted differently between
s_denorm_mode and s_round_mode.

4 years agoAMDGPU: Add new baseline tests for setreg handling
Matt Arsenault [Fri, 29 May 2020 20:11:58 +0000 (16:11 -0400)]
AMDGPU: Add new baseline tests for setreg handling

Most of these should be identical and use a common prefix, but
update_llc_test_checks is failing to generate shared checks for some
reason.

4 years agoAMDGPU: Move MIMG MMO check to verifier
Matt Arsenault [Fri, 29 May 2020 00:38:16 +0000 (20:38 -0400)]
AMDGPU: Move MIMG MMO check to verifier

4 years ago[SVE] Eliminate calls to default-false VectorType::get() from AMDGPU
Christopher Tetreault [Sat, 30 May 2020 00:44:51 +0000 (17:44 -0700)]
[SVE] Eliminate calls to default-false VectorType::get() from AMDGPU

Reviewers: efriedma, david-arm, fpetrogalli, arsenm

Reviewed By: david-arm

Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, tschuett, hiraditya, rkruppe, psnobl, kerbowa, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80328

4 years ago[jitlink] R_X86_64_PC32 support for the elf x86 jitlinker
Jared Wyles [Fri, 29 May 2020 23:11:42 +0000 (09:11 +1000)]
[jitlink] R_X86_64_PC32 support for the elf x86 jitlinker

Summary:

Adding in our first relocation type, and all the required plumbing to support the rest in following patches

Differential Revision: https://reviews.llvm.org/D80613

Reviewer: lhames

4 years ago[SVE] Eliminate calls to default-false VectorType::get() from Linker
Christopher Tetreault [Fri, 29 May 2020 23:17:40 +0000 (16:17 -0700)]
[SVE] Eliminate calls to default-false VectorType::get() from Linker

Reviewers: efriedma, tejohnson, sdesmalen, c-rhodes

Reviewed By: efriedma

Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80326

4 years ago[gn build] Port cf6cc662eee
LLVM GN Syncbot [Fri, 29 May 2020 23:53:23 +0000 (23:53 +0000)]
[gn build] Port cf6cc662eee

4 years ago[gn build] Port 34cfed24ebd
LLVM GN Syncbot [Fri, 29 May 2020 23:53:22 +0000 (23:53 +0000)]
[gn build] Port 34cfed24ebd

4 years ago[AMDGPU] Add loaded code object path URI definition to AMDGPUUsage
Tony [Fri, 29 May 2020 23:03:50 +0000 (19:03 -0400)]
[AMDGPU] Add loaded code object path URI definition to AMDGPUUsage

Differential Revision: https://reviews.llvm.org/D80407

4 years ago[gn build] (manually) port 0e265e31578
Nico Weber [Fri, 29 May 2020 23:49:11 +0000 (19:49 -0400)]
[gn build] (manually) port 0e265e31578

4 years ago[flang][NFC] Remove link-time dependency of Evaluate on Semantics
Tim Keith [Fri, 29 May 2020 23:39:13 +0000 (16:39 -0700)]
[flang][NFC] Remove link-time dependency of Evaluate on Semantics

Summary:
Some Symbol-related functions used in Evaluate were moved to
Evaluate/tools.h. This includes changing some member functions that were
replaced by non-member functions `IsDummy`, `GetUsedModule`, and
`CountLenParameters`.

Some member functions were made inline in `Scope`, `Symbol`,
`ArraySpec`, and `DeclTypeSpec`. The definitions were preceded by a
comment explaining why they are inline.

`IsConstantShape` was expanded inline in `IsDescriptor` because it isn't
used anywhere else

After this change, at least when compiling with clang on macos,
`libFortranEvaluate.a` has no undefined symbols that are satisfied by
`libFortranSemantics.a`.

Reviewers: klausler, PeteSteinfeld, sscalpone, jdoerfert, DavidTruby

Reviewed By: PeteSteinfeld

Subscribers: llvm-commits

Tags: #flang, #llvm

Differential Revision: https://reviews.llvm.org/D80762

4 years ago[mlir][spirv] Clean up coop matrix assembly declaration.
Thomas Raoux [Fri, 29 May 2020 23:34:56 +0000 (16:34 -0700)]
[mlir][spirv] Clean up coop matrix assembly declaration.

Address code review feedback and use declarative assembly format.

Differential Revision: https://reviews.llvm.org/D80687

4 years ago[libc++] Fix issues with the triviality of std::array
Louis Dionne [Fri, 29 May 2020 23:32:55 +0000 (16:32 -0700)]
[libc++] Fix issues with the triviality of std::array

The Standard is currently unimplementable. We have to pick between:

1. Not implementing constexpr support properly in std::array<T, 0>
2. Making std::array<T, 0> non-trivial even when T is trivial
3. Returning nullptr from std::array<T, 0>::begin()

Libc++ initially picked (1). In 77b9abfc8e89, we started implementing constexpr properly, but lost the guarantee of triviality. Since it seems like both (1) and (2) are really important, it seems like (3) is the only viable option for libc++, after all. This is also what other implementations are doing.

This patch moves libc++ from (1) to (3).

It also:
- Improves the test coverage for the various ways of initializing std::array
- Adds tests for the triviality of std::array
- Adds tests for the aggregate-ness of std::array

Reviewed By: #libc, miscco, EricWF, zoecarver

Differential Revision: https://reviews.llvm.org/D80821

4 years ago[diagtool] Install diagtool when LLVM_INSTALL_TOOLCHAIN_ONLY is ON.
Volodymyr Sapsai [Fri, 29 May 2020 00:06:33 +0000 (17:06 -0700)]
[diagtool] Install diagtool when LLVM_INSTALL_TOOLCHAIN_ONLY is ON.

Not sure about other platforms but `install-xcode-toolchain` was already
including diagtool in the toolchain. This change makes it possible to
install diagtool during Apple's 2-stage build.

Instead of dropping `if (NOT LLVM_INSTALL_TOOLCHAIN_ONLY)` conditional
I've switched to `add_clang_tool` which handles install targets. Also a
few other clang tools like clang-format, clang-scan-deps are using this
macro, so it is good to be consistent.

rdar://problem/15386909

Reviewed By: JDevlieghere

Differential Revision: https://reviews.llvm.org/D80770

4 years ago[SVE] Eliminate calls to default-false VectorType::get() from X86
Christopher Tetreault [Fri, 29 May 2020 22:52:33 +0000 (15:52 -0700)]
[SVE] Eliminate calls to default-false VectorType::get() from X86

Reviewers: efriedma, sdesmalen, c-rhodes, craig.topper

Reviewed By: craig.topper

Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80331

4 years ago[libFuzzer] Fixed description of fuzzer merge control file.
Adrian Herrera [Fri, 29 May 2020 22:59:58 +0000 (15:59 -0700)]
[libFuzzer] Fixed description of fuzzer merge control file.

Summary:
The description of the fuzzer merge control file appears to be incorrect/out of date.
No "DONE" line appears in the control file. Rather, FT and COV are the markers that appear
following the STARTED line.

Reviewers: metzman, kcc

Reviewed By: kcc

Subscribers: #sanitizers

Tags: #sanitizers

Differential Revision: https://reviews.llvm.org/D80788

4 years ago[mlir] Fix Windows build
Nicolas Vasilache [Fri, 29 May 2020 21:29:35 +0000 (17:29 -0400)]
[mlir] Fix Windows build

Summary:
MSVC does not seem to like certain forward declarations.

https://reviews.llvm.org/D80728 introduces an error where
seemingly unrelated .cpp files that include the .h
(but do not otherwise use the class that depends on the forward declaration).

Instead of forward declaration, include the full vector ops definition.

Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80841