platform/upstream/llvm.git
3 years ago[libcxx/variant] Implement workaround for GCC bug.
Michael Park [Sun, 30 Aug 2020 16:42:35 +0000 (12:42 -0400)]
[libcxx/variant] Implement workaround for GCC bug.

A parameter pack is deemed to be uncaptured, which is bogus... but it seems to
be because it's within an expression that involves `decltype` of an uncaptured
pack or something: https://godbolt.org/z/b8z3sh

Drive-by fix for uglified name.

Differential Revision: https://reviews.llvm.org/D86827

3 years ago[FastISel] update to use intrinsic's isCommutative(); NFC
Sanjay Patel [Sun, 30 Aug 2020 15:36:41 +0000 (11:36 -0400)]
[FastISel] update to use intrinsic's isCommutative(); NFC

This requires adding a missing 'const' to the definition because
the callers are using const args, but there should be no change
in behavior.

The intrinsic method was added with D86798 / rG096527214033

3 years ago[DAGCombiner] skip reciprocal divisor optimization for x/sqrt(x)
Sanjay Patel [Sun, 30 Aug 2020 14:51:28 +0000 (10:51 -0400)]
[DAGCombiner] skip reciprocal divisor optimization for x/sqrt(x)

In general, we probably want to try the multi-use reciprocal
transform before sqrt transforms, but x/sqrt(x) is a special-case
because that will always reduce to plain sqrt(x) or an estimate.

The AArch64 tests show that the transform is limited by TLI
hook to patterns where there are 3 or more uses of the divisor.
So this change can result in an extra division compared to
what we had, but that's the intended behvior based on the
current setting of that hook.

3 years ago[AArch64] add tests for multi-use fast sqrt/recip; NFC
Sanjay Patel [Sun, 30 Aug 2020 12:51:17 +0000 (08:51 -0400)]
[AArch64] add tests for multi-use fast sqrt/recip; NFC

3 years ago[x86] add tests for multi-use fast sqrt/recip; NFC
Sanjay Patel [Sun, 30 Aug 2020 12:40:27 +0000 (08:40 -0400)]
[x86] add tests for multi-use fast sqrt/recip; NFC

3 years ago[SLP] make commutative check apply only to binops; NFC
Sanjay Patel [Sun, 30 Aug 2020 12:32:13 +0000 (08:32 -0400)]
[SLP] make commutative check apply only to binops; NFC

As discussed in D86798, it's not clear if the caller code
works with a more liberal definition of "commutative" that
includes intrinsics like min/max. This makes the binop
restriction (current functionality is unchanged) explicit
until the code is audited/tested.

3 years ago[CVP] Regenerate test checks (NFC)
Nikita Popov [Sun, 30 Aug 2020 14:23:59 +0000 (16:23 +0200)]
[CVP] Regenerate test checks (NFC)

3 years ago[NFC][compiler-rt] Factor out __div[sdt]i3 and __mod[dt]i3 implementations
Anatoly Trosinenko [Sun, 30 Aug 2020 13:12:44 +0000 (16:12 +0300)]
[NFC][compiler-rt] Factor out __div[sdt]i3 and __mod[dt]i3 implementations

Reviewed By: MaskRay

Differential Revision: https://reviews.llvm.org/D86400

3 years ago[Hexagon] Fix perfect shuffle generation for single vectors
Krzysztof Parzyszek [Sat, 29 Aug 2020 22:07:48 +0000 (17:07 -0500)]
[Hexagon] Fix perfect shuffle generation for single vectors

Perfect shuffle instruction (vdealvdd/vshuffvdd) work on vector
pairs. When given a single input vector, half of it first needs
to be transposed into the other vector before the generated
shuffles can take effect. Also the first transpose needs to be
undone at the end (this last step was missing).

3 years ago[LV] Add some const to RecurrenceDescriptor. NFC
David Green [Sun, 30 Aug 2020 11:27:51 +0000 (12:27 +0100)]
[LV] Add some const to RecurrenceDescriptor. NFC

3 years ago[llvm-reduce] Function body reduction: don't forget to unset comdat
Tyker [Sun, 30 Aug 2020 10:17:21 +0000 (12:17 +0200)]
[llvm-reduce] Function body reduction: don't forget to unset comdat

althought the interstingness test should usually fail when the module is invalid
this changes reduces the frequency at which llvm-reduce generate invalid IR.

Reviewed By: lebedev.ri

Differential Revision: https://reviews.llvm.org/D86404

3 years agoReland [OpenMPOpt] ICV tracking for calls
sstefan1 [Sun, 30 Aug 2020 09:27:48 +0000 (11:27 +0200)]
Reland [OpenMPOpt] ICV tracking for calls

The problem with module slice has been addressed in D86319

Introduce two new AAs. AAICVTrackerFunctionReturned which checks if a
function can have a unique ICV value after it is finished, and
AAICVCallSiteReturned which checks AAICVTrackerFunctionReturned for a
call site. This enables us to check the value of a call and if it
changes the ICV. This also changes the approach in
`getReplacementValues()` to a worklist-based approach so we can explore
all relevant BBs.

Differential Revision: https://reviews.llvm.org/D85544

3 years ago[Attributor] Introduce module slice.
sstefan1 [Sun, 30 Aug 2020 08:29:07 +0000 (10:29 +0200)]
[Attributor] Introduce module slice.

Summary:
The module slice describes which functions we can analyze and transform
while working on an SCC as part of the Attributor-CGSCC pass. So far we
simply restricted it to the SCC.

Reviewers: jdoerfert

Differential Revision: https://reviews.llvm.org/D86319

3 years agoImprove doc comments for several methods returning bools
Kamlesh Kumar [Sun, 30 Aug 2020 07:59:53 +0000 (13:29 +0530)]
Improve doc comments for several methods returning bools

Differential Revision: https://reviews.llvm.org/D86848

3 years ago[OpenMPOpt][NFC] add reproducer for problem found in D85544
sstefan1 [Sun, 30 Aug 2020 07:59:25 +0000 (09:59 +0200)]
[OpenMPOpt][NFC] add reproducer for problem found in D85544

3 years ago[LangRef] Apply a missing comment from D86189
Juneyoung Lee [Sun, 30 Aug 2020 05:56:10 +0000 (14:56 +0900)]
[LangRef] Apply a missing comment from D86189

3 years ago[LangRef] State that storing an aggregate fills padding with undef
Juneyoung Lee [Wed, 19 Aug 2020 02:21:05 +0000 (11:21 +0900)]
[LangRef] State that storing an aggregate fills padding with undef

This patch makes LangRef be explicit about the value of padding when storing an aggregate.
It states that when an aggregate is stored into memory, padding is filled with undef.

Here is a clue that supports this change (edited to reflect the discussion from llvm-dev):

- IPSCCP ignores padding and directly stores a constant aggregate if possible. It loses the data stored in the padding. https://godbolt.org/z/xzenYs Memcpyopt ignores (the preexisting value of) padding when copying an aggregate or storing a constant: https://godbolt.org/z/hY6ndd / https://godbolt.org/z/3WMP5a

The two items below are not relevant with this patch because Clang lowers load/store of individual field of struct into load/stores of the corresponding pointer with a primitive type. Also, when copy is needed, it uses memcpy instead of load/store of an aggregate, as discussed in the llvm-dev. However, this patch is still valid (as discussed) because it is needed to explain the two optimizations above.

- According to C17, the value of padding bytes when storing values in structures or unions is unspecified.

- I updated Alive2 and it did not find any problematic transformation from LLVM unit tests and while running translation validation of a few C programs.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D86189

3 years ago[Attributor] Fix callsite check in AAUndefinedBehavior
Shinji Okumura [Sun, 30 Aug 2020 04:17:02 +0000 (13:17 +0900)]
[Attributor] Fix callsite check in AAUndefinedBehavior

This is the next patch of D86842
When we check `noundef` attribute violation at callsites, we do not have to require `nonnull` in the following two cases.
1. An argument is known to be simplified to undef
2. An argument is known to be dead

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D86845

3 years ago[Attributor][NFC] Fix dependency type in AAUndefinedBehaviorImpl::updateImpl
Shinji Okumura [Sun, 30 Aug 2020 03:34:50 +0000 (12:34 +0900)]
[Attributor][NFC] Fix dependency type in AAUndefinedBehaviorImpl::updateImpl

This patch fixes wrong dependency type in AAUB.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D86842

3 years agoSet alignment of .llvmbc and .llvmcmd to 1
Fangrui Song [Sun, 30 Aug 2020 01:27:34 +0000 (18:27 -0700)]
Set alignment of .llvmbc and .llvmcmd to 1

Otherwise their alignment is dependent on the size of the section.  If the size
is large than 16, the alignment will be 16.

16 is a bad choice for both .llvmbc and .llvmcmd because the padding between two
contributions from input sections is of a variable size.

A bitstream is actually guaranteed to be 4-byte aligned, but consumers don't
need this property.

3 years agoRemove OpenBSD/sparc support
Brad Smith [Sun, 30 Aug 2020 00:47:18 +0000 (20:47 -0400)]
Remove OpenBSD/sparc support

3 years ago[ORC] Add getDFSLinkOrder / getReverseDFSLinkOrder methods to JITDylib.
Lang Hames [Sat, 29 Aug 2020 21:05:04 +0000 (14:05 -0700)]
[ORC] Add getDFSLinkOrder / getReverseDFSLinkOrder methods to JITDylib.

DFS and Reverse-DFS linkage orders are used to order execution of
deinitializers and initializers respectively.

This patch replaces uses of special purpose DFS order functions in
MachOPlatform and LLJIT with uses of the new methods.

3 years ago[libc++] Temporarily force-set the LIBCXX_TEST_CONFIG cache value
Louis Dionne [Sat, 29 Aug 2020 22:07:26 +0000 (18:07 -0400)]
[libc++] Temporarily force-set the LIBCXX_TEST_CONFIG cache value

This ensures that existing CMake build trees will start using the new
default without having to nuke their build directories.

3 years ago[libc++] Move the default site config template alongside other config files
Louis Dionne [Sat, 29 Aug 2020 21:13:02 +0000 (17:13 -0400)]
[libc++] Move the default site config template alongside other config files

3 years ago[libc++] Add from-scratch configuration files for the test suite
Louis Dionne [Fri, 28 Aug 2020 21:56:24 +0000 (17:56 -0400)]
[libc++] Add from-scratch configuration files for the test suite

This commit adds the first from-scratch configuration files for running
the libc++ test suite without using the old configuration:

- libcxx-trunk-shared.cfg.py:
   Runs the test suite against a trunk libc++ shared library.
- libcxx-trunk-static.cfg.py:
   Runs the test suite against a trunk libc++ static library.

Differential Revision: https://reviews.llvm.org/D81866

3 years ago[Attributor] Fix AANoUndef identification
Shinji Okumura [Sat, 29 Aug 2020 20:34:24 +0000 (05:34 +0900)]
[Attributor] Fix AANoUndef identification

Even though `noundef` IR attribute might be attached to non-void type values, AANoUndef is mistakenly identified for pointer type values only.
This patch fixes that.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D86737

3 years ago[InstSimplify] Reduce code duplication in simplifySelectWithICmpCond (NFC)
Nikita Popov [Sat, 29 Aug 2020 20:31:06 +0000 (22:31 +0200)]
[InstSimplify] Reduce code duplication in simplifySelectWithICmpCond (NFC)

Canonicalize icmp ne to icmp eq and implement all the folds only once.

3 years ago[InstSimplify] Protect against more poison in SimplifyWithOpReplaced (PR47322)
Nikita Popov [Sat, 29 Aug 2020 08:55:55 +0000 (10:55 +0200)]
[InstSimplify] Protect against more poison in SimplifyWithOpReplaced (PR47322)

Replace the check for poison-producing instructions in
SimplifyWithOpReplaced() with the generic helper canCreatePoison()
that properly handles poisonous shifts and thus avoids the problem
from PR47322.

This additionally fixes a bug in IIQ.UseInstrInfo=false mode, which
previously could have caused this code to ignore poison flags.
Setting UseInstrInfo=false should reduce the possible optimizations,
not increase them.

This is not a full solution to the problem, as poison could be
introduced more indirectly. This is just a minimal, easy to backport
fix.

Differential Revision: https://reviews.llvm.org/D86834

3 years ago[LV] Check opt-for-size before expanding runtime checks.
Florian Hahn [Sat, 29 Aug 2020 19:21:19 +0000 (20:21 +0100)]
[LV] Check opt-for-size before expanding runtime checks.

Move bail out when optimizing for size before runtime check generation.
In that case, we do not use the result of the expansion, the expanded
instruction will be dead and cleaned up later.

By doing the check before expanding the runtime-checks, we can save a
bit of unnecessary work.

3 years ago[LVI] Remove unnecessary lambda capture (NFC)
Nikita Popov [Sat, 29 Aug 2020 19:33:19 +0000 (21:33 +0200)]
[LVI] Remove unnecessary lambda capture (NFC)

3 years agoReapply [LVI] Normalize pointer behavior
Nikita Popov [Wed, 4 Dec 2019 19:51:31 +0000 (20:51 +0100)]
Reapply [LVI] Normalize pointer behavior

This got reverted because a dependency was reverted. It has since
been reapplied, so reapply this as well.

-----

Related to D69686. As noted there, LVI currently behaves differently
for integer and pointer values: For integers, the block value is always
valid inside the basic block, while for pointers it is only valid at
the end of the basic block. I believe the integer behavior is the
correct one, and CVP relies on it via its getConstantRange() uses.

The reason for the special pointer behavior is that LVI checks whether
a pointer is dereferenced in a given basic block and marks it as
non-null in that case. Of course, this information is valid only after
the dereferencing instruction, or in conservative approximation,
at the end of the block.

This patch changes the treatment of dereferencability: Instead of
including it inside the block value, we instead treat it as something
similar to an assume (it essentially is a non-nullness assume) and
incorporate this information in intersectAssumeOrGuardBlockValueConstantRange()
if the context instruction is the terminator of the basic block.
This happens either when determining an edge-value internally in LVI,
or when a terminator was explicitly passed to getValueAt(). The latter
case makes this more powerful than the previous implementation as
a side-effect, and this does actually seem benefitial in practice.

Of course, we do not want to recompute dereferencability on each
intersectAssume call, so we need a new cache for this. The
dereferencability analysis requires walking the entire basic block
and computing underlying objects of all memory operands. This was
previously done separately for each queried pointer value. In the
new implementation (both because this makes the caching simpler,
and because it is faster), I instead only walk the full BB once and
cache all the dereferenced pointers. So the traversal is now performed
only once per BB, instead of once per queried pointer value.

I think the overall model now makes more sense than before, and there
will be no more pitfalls due to differing integer/pointer behavior.

Differential Revision: https://reviews.llvm.org/D69914

3 years ago[NFC][Local] EliminateDuplicatePHINodes(): add STATISTIC()
Roman Lebedev [Sat, 29 Aug 2020 19:02:35 +0000 (22:02 +0300)]
[NFC][Local] EliminateDuplicatePHINodes(): add STATISTIC()

3 years ago[NFCI][Local] Rewrite EliminateDuplicatePHINodes to optionally check hashing invariants
Roman Lebedev [Sat, 29 Aug 2020 17:18:25 +0000 (20:18 +0300)]
[NFCI][Local] Rewrite EliminateDuplicatePHINodes to optionally check hashing invariants

EarlyCSE has a mode to verify the invariant that hash equality equals
key equality, but EliminateDuplicatePHINodes() doesn't.

I've verified that this would have caught the stage2-stage3 mismatches
5ec2b757cc7d37ff0d03b36ee863b0962fe78108 revert has fixed,
that were introduced last time in 3e69871ab5a66fb55913a2a2f5e7f5b42899a4c9.

3 years ago[Attributor][NFC] Do not manifest noundef for positions to be changed to undef
Shinji Okumura [Sat, 29 Aug 2020 18:23:41 +0000 (03:23 +0900)]
[Attributor][NFC] Do not manifest noundef for positions to be changed to undef

This patch fixes AANoUndef manifestation.
We should not manifest noundef for positions that will be changed to undef.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D86835

3 years ago[Attributor][NFC] rerun update_test_checks without --scrub-attributes
sstefan1 [Sat, 29 Aug 2020 17:31:13 +0000 (19:31 +0200)]
[Attributor][NFC] rerun update_test_checks without --scrub-attributes

3 years ago[DSE,MemorySSA] Return early when hitting a MemoryPhi.
Florian Hahn [Mon, 24 Aug 2020 11:12:15 +0000 (12:12 +0100)]
[DSE,MemorySSA] Return early when hitting a MemoryPhi.

A MemoryPhi can never be eliminated. If we hit one, return the Phi, so
the caller can continue traversing the incoming accesses.

This saves some unnecessary read clobber checks and improves
compile-time
http://llvm-compile-time-tracker.com/compare.php?from=1ffc58b6d098ce8fa71f3a80fe75b990f633f921&to=d0fa8d1982380b57d7b6067528104bc373dbe07a&stat=instructions

3 years ago[IR] Inline AttrBuilder::addAttribute. It just sets 1 bit. NFC.
Benjamin Kramer [Sat, 29 Aug 2020 17:08:17 +0000 (19:08 +0200)]
[IR] Inline AttrBuilder::addAttribute. It just sets 1 bit. NFC.

3 years ago[Sema] Simplify ShouldDiagnoseUnusedDecl, NFC
Aaron Puchert [Sat, 29 Aug 2020 16:10:16 +0000 (18:10 +0200)]
[Sema] Simplify ShouldDiagnoseUnusedDecl, NFC

Instead of writing to a flag and then returning based on that flag we
can also return directly. The flag name also doesn't provide additional
information, it just reflects the name of the function (isReferenced).

3 years ago[Sema] ICK_Function_Conversion is a third kind conversion
Aaron Puchert [Sat, 29 Aug 2020 16:13:56 +0000 (18:13 +0200)]
[Sema] ICK_Function_Conversion is a third kind conversion

Not sure if this has any effect, but it was inconsistent before.

Reviewed By: rsmith

Differential Revision: https://reviews.llvm.org/D67113

3 years ago[Instruction] Speculatively undo isIdenticalToWhenDefined() PHI handling changes
Roman Lebedev [Sat, 29 Aug 2020 16:38:33 +0000 (19:38 +0300)]
[Instruction] Speculatively undo isIdenticalToWhenDefined() PHI handling changes

The stage2-stage3 differences persist even without instcombine-based
PHI CSE, so this is the only possible reason.

3 years ago[EarlyCSE] fold commutable intrinsics
Sanjay Patel [Sat, 29 Aug 2020 16:01:53 +0000 (12:01 -0400)]
[EarlyCSE] fold commutable intrinsics

Handling the new min/max intrinsics is the motivation, but it
turns out that we have a bunch of other intrinsics with this
missing bit of analysis too.

The FP min/max tests show that we are intersecting FMF,
so that part should be safe too.

As noted in https://llvm.org/PR46897 , there is a commutative
property specifier for intrinsics, but no corresponding function
attribute, and so apparently no uses of that bit. We may want to
remove that next.

Follow-up patches should wire up the Instruction::isCommutative()
to this IntrinsicInst specialization. That requires updating
callers to be aware of the more general commutative property
(not just binops).

Differential Revision: https://reviews.llvm.org/D86798

3 years ago[EarlyCSE] add tests for commutative intrinsics; NFC
Sanjay Patel [Thu, 27 Aug 2020 13:34:45 +0000 (09:34 -0400)]
[EarlyCSE] add tests for commutative intrinsics; NFC

3 years ago[TargetLowering] Strip tailing whitespace (NFC)
Nikita Popov [Sat, 29 Aug 2020 16:08:03 +0000 (18:08 +0200)]
[TargetLowering] Strip tailing whitespace (NFC)

3 years ago[InstCombine] Take 3: Perform trivial PHI CSE
Roman Lebedev [Sat, 29 Aug 2020 13:31:06 +0000 (16:31 +0300)]
[InstCombine] Take 3: Perform trivial PHI CSE

The original take 1 was 6102310d814ad73eab60a88b21dd70874f7a056f,
which taught InstSimplify to do that, which seemed better at time,
since we got EarlyCSE support for free.

However, it was proven that we can not do that there,
the simplified-to PHI would not be reachable from the original PHI,
and that is not something InstSimplify is allowed to do,
as noted in the commit ed90f15efb40d26b5d3ead3bb8e9e284218e0186
that reverted it:
> It appears to cause compilation non-determinism and caused stage3 mismatches.

Then there was take 2 3e69871ab5a66fb55913a2a2f5e7f5b42899a4c9,
which was InstCombine-specific, but it again showed stage2-stage3 differences,
and reverted in bdaa3f86a040b138c58de41d73d35b76fdec1380.
This is quite alarming.

Here, let's try to change how we find existing PHI candidate:
due to the worklist order, and the way PHI nodes are inserted
(it may be inserted as the first one, or maybe not), let's look at *all*
PHI nodes in the block.

Effects on vanilla llvm test-suite + RawSpeed:
```
| statistic name                                     | baseline  | proposed  |      Î” |        % |    \|%\| |
|----------------------------------------------------|-----------|-----------|-------:|---------:|---------:|
| asm-printer.EmittedInsts                           | 7942329   | 7942457   |    128 |    0.00% |    0.00% |
| assembler.ObjectBytes                              | 254295632 | 254312480 |  16848 |    0.01% |    0.01% |
| correlated-value-propagation.NumPhis               | 18412     | 18347     |    -65 |   -0.35% |    0.35% |
| early-cse.NumCSE                                   | 2183283   | 2183267   |    -16 |    0.00% |    0.00% |
| early-cse.NumSimplify                              | 550105    | 541842    |  -8263 |   -1.50% |    1.50% |
| instcombine.NumAggregateReconstructionsSimplified  | 73        | 4506      |   4433 | 6072.60% | 6072.60% |
| instcombine.NumCombined                            | 3640311   | 3644419   |   4108 |    0.11% |    0.11% |
| instcombine.NumDeadInst                            | 1778204   | 1783205   |   5001 |    0.28% |    0.28% |
| instcombine.NumPHICSEs                             | 0         | 22490     |  22490 |    0.00% |    0.00% |
| instcombine.NumWorklistIterations                  | 2023272   | 2024400   |   1128 |    0.06% |    0.06% |
| instcount.NumCallInst                              | 1758395   | 1758802   |    407 |    0.02% |    0.02% |
| instcount.NumInvokeInst                            | 59478     | 59502     |     24 |    0.04% |    0.04% |
| instcount.NumPHIInst                               | 330557    | 330545    |    -12 |    0.00% |    0.00% |
| instcount.TotalBlocks                              | 1077138   | 1077220   |     82 |    0.01% |    0.01% |
| instcount.TotalFuncs                               | 101442    | 101441    |     -1 |    0.00% |    0.00% |
| instcount.TotalInsts                               | 8831946   | 8832606   |    660 |    0.01% |    0.01% |
| simplifycfg.NumHoistCommonCode                     | 24186     | 24187     |      1 |    0.00% |    0.00% |
| simplifycfg.NumInvokes                             | 4300      | 4410      |    110 |    2.56% |    2.56% |
| simplifycfg.NumSimpl                               | 1019813   | 999767    | -20046 |   -1.97% |    1.97% |
```
So it fires 22490 times, which is less than ~24k the take 1 did,
but more than what take 2 did (22228 times)
.
It allows foldAggregateConstructionIntoAggregateReuse() to actually work
after PHI-of-extractvalue folds did their thing. Previously SimplifyCFG
would have done this PHI CSE, of all places. Additionally, allows some
more `invoke`->`call` folds to happen (+110, +2.56%).

All in all, expectedly, this catches less things overall,
but all the motivational cases are still caught, so all good.

3 years ago[UpdateTestChecks] Don't skip attributes when comparing functions
sstefan1 [Sat, 29 Aug 2020 13:35:14 +0000 (15:35 +0200)]
[UpdateTestChecks] Don't skip attributes when comparing functions

3 years agoRevert "[InstCombine] Take 2: Perform trivial PHI CSE"
Roman Lebedev [Sat, 29 Aug 2020 13:00:07 +0000 (16:00 +0300)]
Revert "[InstCombine] Take 2: Perform trivial PHI CSE"

While the original variant with doing this in InstSimplify (rightfully)
caused questions and ultimately was detected to be a culprit
of stage2-stage3 mismatch, it was expected that
InstCombine-based implementation would be fine.

But apparently it's not, as
http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/24095/steps/compare-compilers/logs/stdio
suggests.

Which suggests that somewhere in InstCombine there is a loop
over nondeterministically sorted container, which causes
different worklist ordering.

This reverts commit 3e69871ab5a66fb55913a2a2f5e7f5b42899a4c9.

3 years ago[InstCombine] Return replaceInstUsesWith() result (NFC)
Nikita Popov [Sat, 29 Aug 2020 12:49:57 +0000 (14:49 +0200)]
[InstCombine] Return replaceInstUsesWith() result (NFC)

Follow the usual usage pattern for this function and return the
result.

3 years ago[AArch64] Generate and parse SEH assembly directives
Martin Storsjö [Fri, 7 Aug 2020 09:44:48 +0000 (12:44 +0300)]
[AArch64] Generate and parse SEH assembly directives

This ensures that you get the same output regardless if generating
code directly to an object file or if generating assembly and
assembling that.

Add implementations of the EmitARM64WinCFI*() methods in
AArch64TargetAsmStreamer, and fill in one blank in MCAsmStreamer.

Add corresponding directive handlers in AArch64AsmParser and
COFFAsmParser.

Some SEH directive names have been picked to match the prior art
for SEH assembly directives for x86_64, e.g. the spelling of
".seh_startepilogue" matching the preexisting ".seh_endprologue".

For the directives for saving registers, the exact spelling
from the arm64 documentation is picked, e.g. ".seh_save_reg" (to follow
that naming for all the other ones, e.g. ".seh_save_fregp_x"), while
the corresponding one for x86_64 is plain ".seh_savereg" without the
second underscore.

Directives in the epilogues have the same names as in prologues,
e.g. .seh_savereg, even though the registers are restored, not
saved, at that point.

Differential Revision: https://reviews.llvm.org/D86529

3 years ago[MC] [Win64EH] Fill in FuncletOrFuncEnd if missing
Martin Storsjö [Thu, 20 Aug 2020 08:39:25 +0000 (11:39 +0300)]
[MC] [Win64EH] Fill in FuncletOrFuncEnd if missing

This can happen e.g. for code that declare .seh_proc/.seh_endproc
in assembly, or for code that use .seh_handlerdata (which triggers
the unwind info to be emitted before the end of the function).

The TextSection field must be made non-const to be able to use it
with Streamer.SwitchSection().

Differential Revision: https://reviews.llvm.org/D86528

3 years ago[InstCombine] foldAggregateConstructionIntoAggregateReuse(): use InstCombiner::replac...
Roman Lebedev [Sat, 29 Aug 2020 12:04:07 +0000 (15:04 +0300)]
[InstCombine] foldAggregateConstructionIntoAggregateReuse(): use InstCombiner::replaceInstUsesWith() instead of RAUW

We really shouldn't use RAUW in InstCombine
because we should consistently update Worklist to avoid extra iterations.

3 years ago[InstCombine] canonicalizeICmpPredicate(): use InstCombiner::replaceInstUsesWith...
Roman Lebedev [Sat, 29 Aug 2020 11:59:09 +0000 (14:59 +0300)]
[InstCombine] canonicalizeICmpPredicate(): use InstCombiner::replaceInstUsesWith() instead of RAUW

We really shouldn't use RAUW in InstCombine
because we should consistently update Worklist to avoid extra iterations.

3 years ago[NFC][InstCombine] Fix some comments: the code already uses IC::replaceInstUsesWith()
Roman Lebedev [Sat, 29 Aug 2020 11:54:08 +0000 (14:54 +0300)]
[NFC][InstCombine] Fix some comments: the code already uses IC::replaceInstUsesWith()

3 years ago[NFC] Instruction::isIdenticalToWhenDefined(): s/nessesairly/necessarily/
Roman Lebedev [Sat, 29 Aug 2020 11:53:15 +0000 (14:53 +0300)]
[NFC] Instruction::isIdenticalToWhenDefined(): s/nessesairly/necessarily/

3 years ago[NFC][InstCombine] Add STATISTIC() for how many iterations we did
Roman Lebedev [Sat, 29 Aug 2020 11:52:15 +0000 (14:52 +0300)]
[NFC][InstCombine] Add STATISTIC() for how many iterations we did

As we've established, if it takes more than two iterations
(one to perform folding and one to ensure that no folding opportunities
remain) per function, then there are worklist management issues.
So it may be interesting to keep track of it.

3 years ago[NFC][InstCombine] select.ll: remove outdated TODO comment
Roman Lebedev [Sat, 29 Aug 2020 11:45:48 +0000 (14:45 +0300)]
[NFC][InstCombine] select.ll: remove outdated TODO comment

Fixed by 3e69871ab5a66fb55913a2a2f5e7f5b42899a4c9

3 years ago[InstCombine] visitPHINode(): use InstCombiner::replaceInstUsesWith() instead of...
Roman Lebedev [Sat, 29 Aug 2020 11:44:16 +0000 (14:44 +0300)]
[InstCombine] visitPHINode(): use InstCombiner::replaceInstUsesWith() instead of RAUW

As noted in post-commit review, we really shouldn't use RAUW in InstCombine
because we should consistently update Worklist to avoid extra iterations.

3 years ago[InstCombine] Take 2: Perform trivial PHI CSE
Roman Lebedev [Sat, 29 Aug 2020 07:42:38 +0000 (10:42 +0300)]
[InstCombine] Take 2: Perform trivial PHI CSE

The original take was 6102310d814ad73eab60a88b21dd70874f7a056f,
which taught InstSimplify to do that, which seemed better at time,
since we got EarlyCSE support for free.

However, it was proven that we can not do that there,
the simplified-to PHI would not be reachable from the original PHI,
and that is not something InstSimplify is allowed to do,
as noted in the commit ed90f15efb40d26b5d3ead3bb8e9e284218e0186
that reverted it :
> It appears to cause compilation non-determinism and caused stage3 mismatches.

However InstCombine already does many different optimizations,
so it should be a safe place to do it here.

Note that we still can't just compare incoming values ranges,
because there is no guarantee that these PHI's we'd simplify to
were already re-visited and sorted.
However coming up with a test is problematic.

Effects on vanilla llvm test-suite + RawSpeed:
```
| statistic name                                     | baseline  | proposed  |      Î” |        % |      |%| |
|----------------------------------------------------|-----------|-----------|-------:|---------:|---------:|
| instcombine.NumPHICSEs                             | 0         | 22228     |  22228 |    0.00% |    0.00% |
| asm-printer.EmittedInsts                           | 7942329   | 7942456   |    127 |    0.00% |    0.00% |
| assembler.ObjectBytes                              | 254295632 | 254313792 |  18160 |    0.01% |    0.01% |
| early-cse.NumCSE                                   | 2183283   | 2183272   |    -11 |    0.00% |    0.00% |
| early-cse.NumSimplify                              | 550105    | 541842    |  -8263 |   -1.50% |    1.50% |
| instcombine.NumAggregateReconstructionsSimplified  | 73        | 4506      |   4433 | 6072.60% | 6072.60% |
| instcombine.NumCombined                            | 3640311   | 3666911   |  26600 |    0.73% |    0.73% |
| instcombine.NumDeadInst                            | 1778204   | 1783318   |   5114 |    0.29% |    0.29% |
| instcount.NumCallInst                              | 1758395   | 1758804   |    409 |    0.02% |    0.02% |
| instcount.NumInvokeInst                            | 59478     | 59502     |     24 |    0.04% |    0.04% |
| instcount.NumPHIInst                               | 330557    | 330549    |     -8 |    0.00% |    0.00% |
| instcount.TotalBlocks                              | 1077138   | 1077221   |     83 |    0.01% |    0.01% |
| instcount.TotalFuncs                               | 101442    | 101441    |     -1 |    0.00% |    0.00% |
| instcount.TotalInsts                               | 8831946   | 8832611   |    665 |    0.01% |    0.01% |
| simplifycfg.NumInvokes                             | 4300      | 4410      |    110 |    2.56% |    2.56% |
| simplifycfg.NumSimpl                               | 1019813   | 999740    | -20073 |   -1.97% |    1.97% |
```
So it fires ~22k times, which is less than ~24k the take 1 did.
It allows foldAggregateConstructionIntoAggregateReuse() to actually work
after PHI-of-extractvalue folds did their thing. Previously SimplifyCFG
would have done this PHI CSE, of all places. Additionally, allows some
more `invoke`->`call` folds to happen (+110, +2.56%).

All in all, expectedly, this catches less things overall,
but all the motivational cases are still caught, so all good.

3 years ago[NFC][InstSimplify] Add a note to PHI CSE tests that they are all negative tests
Roman Lebedev [Sat, 29 Aug 2020 09:12:53 +0000 (12:12 +0300)]
[NFC][InstSimplify] Add a note to PHI CSE tests that they are all negative tests

As discussed in https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20200824/824235.html
even though it seems worthwhile doing so in InstSimplify,
we really can't do that there, because the other PHI wouldn't be
def-reachable from the original PHI.

3 years ago[NFC][InstCombine] Add tests for PHI CSE
Roman Lebedev [Sat, 29 Aug 2020 07:38:28 +0000 (10:38 +0300)]
[NFC][InstCombine] Add tests for PHI CSE

As discussed in https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20200824/824235.html
even though it seems worthwhile doing so in InstSimplify,
we really can't do that there, because the other PHI wouldn't be
def-reachable from the original PHI.

So ignoring whether or not EarlyCSE should also know to do this,
InstCombine is the place.

3 years ago[PPC] Fix platform definitions when compiling FreeBSD powerpc64 as LE
Dimitry Andric [Sat, 29 Aug 2020 10:02:51 +0000 (12:02 +0200)]
[PPC] Fix platform definitions when compiling FreeBSD powerpc64 as LE

As a prerequisite to doing experimental buids of pieces of FreeBSD PowerPC64 as little-endian, allow actually targeting it.

This is needed so basic platform definitions are pulled in. Without it, the compiler will only run freestanding.

Reviewed By: MaskRay

Differential Revision: https://reviews.llvm.org/D73425

3 years ago[InstCombine] Fix typo in comment (NFC)
Nikita Popov [Sat, 29 Aug 2020 08:17:17 +0000 (10:17 +0200)]
[InstCombine] Fix typo in comment (NFC)

As pointed out in post-commit review of D63060.

3 years ago[Target][AArch64] Allow for char as int8_t in AArch64AsmParser.cpp
Rainer Orth [Sat, 29 Aug 2020 08:01:04 +0000 (10:01 +0200)]
[Target][AArch64] Allow for char as int8_t in AArch64AsmParser.cpp

A couple of AArch64 tests were failing on Solaris, both sparc and x86:

  LLVM :: MC/AArch64/SVE/add-diagnostics.s
  LLVM :: MC/AArch64/SVE/cpy-diagnostics.s
  LLVM :: MC/AArch64/SVE/cpy.s
  LLVM :: MC/AArch64/SVE/dup-diagnostics.s
  LLVM :: MC/AArch64/SVE/dup.s
  LLVM :: MC/AArch64/SVE/mov-diagnostics.s
  LLVM :: MC/AArch64/SVE/mov.s
  LLVM :: MC/AArch64/SVE/sqadd-diagnostics.s
  LLVM :: MC/AArch64/SVE/sqsub-diagnostics.s
  LLVM :: MC/AArch64/SVE/sub-diagnostics.s
  LLVM :: MC/AArch64/SVE/subr-diagnostics.s
  LLVM :: MC/AArch64/SVE/uqadd-diagnostics.s
  LLVM :: MC/AArch64/SVE/uqsub-diagnostics.s

For example, reduced from `MC/AArch64/SVE/add-diagnostics.s`:

  add     z0.b, z0.b, #0, lsl #8

missed the expected diagnostics

  $ ./bin/llvm-mc -triple=aarch64 -show-encoding -mattr=+sve add.s
  add.s:1:21: error: immediate must be an integer in range [0, 255] with a shift amount of 0
  add     z0.b, z0.b, #0, lsl #8
                      ^

The message is `Match_InvalidSVEAddSubImm8`, emitted in the generated
`lib/Target/AArch64/AArch64GenAsmMatcher.inc` for `MCK_SVEAddSubImm8`.
When comparing the call to `::AArch64Operand::isSVEAddSubImm<char>` on both
Linux/x86_64 and Solaris, I find

  875     bool IsByte = std::is_same<int8_t, std::make_signed_t<T>>::value;

is `false` on Solaris, unlike Linux.

The problem boils down to the fact that `int8_t` is plain `char` on
Solaris: both the sparc and i386 psABIs have `char` as signed.  However,
with

  9887     DiagnosticPredicate DP(Operand.isSVEAddSubImm<int8_t>());

in `lib/Target/AArch64/AArch64GenAsmMatcher.inc`, `std::make_signed_t<int8_t>`
above yieds `signed char`, so `std::is_same<int8_t, signed char>` is `false`.

This can easily be fixed by also allowing for `int8_t` here and in a few
similar places.

Tested on `amd64-pc-solaris2.11`, `sparcv9-sun-solaris2.11`, and
`x86_64-pc-linux-gnu`.

Differential Revision: https://reviews.llvm.org/D85225

3 years ago[Attributes] Merge calls to getFnAttribute/hasFnAttribute using Attribute::isValid...
Craig Topper [Sat, 29 Aug 2020 06:15:34 +0000 (23:15 -0700)]
[Attributes] Merge calls to getFnAttribute/hasFnAttribute using Attribute::isValid. NFC

Rather than calling hasFnAttribute and then calling getFnAttribute
if the attribute exists, its better to just call getFnAttribute and
then check if we got a valid attribute back.

3 years ago[NFC][InstructionSimplify] Add a warning about not simplifying to not def-reachable
Roman Lebedev [Sat, 29 Aug 2020 06:57:04 +0000 (09:57 +0300)]
[NFC][InstructionSimplify] Add a warning about not simplifying to not def-reachable

See
https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20200824/824235.html
and
https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20200824/824967.html

InstSimply is not allowed to perform simplifications to instructions
that are not def-reachable from the original instruction.

3 years ago[NFC][STLExtras] Add make_first_range(), similar to existing make_second_range()
Roman Lebedev [Sat, 29 Aug 2020 06:47:09 +0000 (09:47 +0300)]
[NFC][STLExtras] Add make_first_range(), similar to existing make_second_range()

Having just one of the two seens weird.
I wanted to use it a few times, but it wasn't there.

3 years ago[DWARFYAML] Make the debug_abbrev_offset field optional.
Xing GUO [Sat, 29 Aug 2020 06:53:08 +0000 (14:53 +0800)]
[DWARFYAML] Make the debug_abbrev_offset field optional.

This patch helps make the debug_abbrev_offset field optional. We don't
need to calculate the value of this field in the future.

Reviewed By: jhenderson

Differential Revision: https://reviews.llvm.org/D86614

3 years ago[LLD][PowerPC][test] Disable ELF/ppc64-pcrel-long-branch-error.s
Hubert Tong [Sat, 29 Aug 2020 04:08:52 +0000 (00:08 -0400)]
[LLD][PowerPC][test] Disable ELF/ppc64-pcrel-long-branch-error.s

Following 0becc27ebfec, `ppc64-pcrel-long-branch-error.s` fails in some
environments with out-of-memory errors associated with buffering the
output in-memory. Since the alternative of writing to an allocated file
is also known to cause problems, we will disable the test
unconditionally (pending a mechanism to disable the test selectively).

3 years ago[DAGCombiner] Enhance (zext(setcc))
Kai Luo [Sat, 29 Aug 2020 03:24:25 +0000 (03:24 +0000)]
[DAGCombiner] Enhance (zext(setcc))

Current `v:t = zext(setcc x,y,cc)` will be transformed to `select x, y, 1:t, 0:t, cc`. It misses some opportunities if x's type size is less than `t`'s size. This patch enhances the above transformation.

Reviewed By: spatel

Differential Revision: https://reviews.llvm.org/D86687

3 years ago[compiler-rt][tsan] Remove unnecesary typedefs
Marco Vanotti [Sat, 29 Aug 2020 00:11:26 +0000 (17:11 -0700)]
[compiler-rt][tsan] Remove unnecesary typedefs

These typedefs are not used anywhere else in this compilation unit.

Differential Revision: https://reviews.llvm.org/D86826

3 years ago[lldb] Hoist --framework argument out of LLDB_TEST_COMMON_ARGS (NFC)
Jonas Devlieghere [Sat, 29 Aug 2020 01:15:28 +0000 (18:15 -0700)]
[lldb] Hoist --framework argument out of LLDB_TEST_COMMON_ARGS (NFC)

Give the framework argument its own variable (LLDB_FRAMEWORK_DIR) so
that we can configure it in lit.site.cfg.py if we so desire.

3 years ago[lldb] Make the lit configuration values optional for the API tests
Jonas Devlieghere [Sat, 29 Aug 2020 01:05:01 +0000 (18:05 -0700)]
[lldb] Make the lit configuration values optional for the API tests

LIT uses a model where the test suite is configurable trough a
lit.site.cfg file. Most of the time we use the lit.site.cfg with values
that match the current build configuration, generated by CMake.

Nothing prevents you from running the test suite with a different
configuration, either by overriding some of these values from the
command line, or by passing a different lit.site.cfg.

The latter is currently tedious. Many configuration values are optional
but they still need to be set because lit.cfg.py is accessing them
directly. This patch changes the code to use getattr to return the
attribute if it exists. This makes it possible to specify a minimal
lit.site.cfg with only the mandatory/desired configuration values.

Differential revision: https://reviews.llvm.org/D86821

3 years ago[ObjC][ARC] In HandlePotentialAlterRefCount, check whether an
Akira Hatanaka [Sat, 29 Aug 2020 00:44:56 +0000 (17:44 -0700)]
[ObjC][ARC] In HandlePotentialAlterRefCount, check whether an
instruction can decrement the reference count, not whether it can alter
it

This prevents the state transition from S_Use to S_CanRelease when doing
a bottom-up traversal and the transition from S_Retain to S_CanRelease
when doing a top-down traversal when the visited instruction can
increment the ref count but cannot decrement it. This allows the ARC
optimizer to remove retain/release pairs which were previously not
removed.

rdar://problem/21793154

3 years agoUse report_fatal_error instead of llvm::errs() + abort() (NFC)
Mehdi Amini [Sat, 29 Aug 2020 00:34:59 +0000 (00:34 +0000)]
Use report_fatal_error instead of llvm::errs() + abort() (NFC)

This is making the error reporting in line with other fatal errors.

3 years ago[gcov][test] Don't write a.gcno in CWD
Fangrui Song [Sat, 29 Aug 2020 00:34:11 +0000 (17:34 -0700)]
[gcov][test] Don't write a.gcno in CWD

3 years ago[clang] Enable -fsanitize=thread on Fuchsia.
Marco Vanotti [Fri, 28 Aug 2020 22:42:09 +0000 (15:42 -0700)]
[clang] Enable -fsanitize=thread on Fuchsia.

This CL modifies clang enabling using -fsanitize=thread on fuchsia. The
change doesn't build the runtime for fuchsia, it just enables the
instrumentation to be used.

pair-programmed-with: mdempsky@google.com
Change-Id: I816c4d240d1f15e9eae2803fb8ba3a7bf667ed51

Reviewed By: mcgrathr, phosek

Differential Revision: https://reviews.llvm.org/D86822

3 years ago[ARC] Update brcc test.
Pete Couperus [Tue, 25 Aug 2020 18:17:06 +0000 (11:17 -0700)]
[ARC] Update brcc test.

3 years agoRevert "[InstSimplify][EarlyCSE] Try to CSE PHI nodes in the same basic block"
Owen Anderson [Fri, 28 Aug 2020 23:32:25 +0000 (23:32 +0000)]
Revert "[InstSimplify][EarlyCSE] Try to CSE PHI nodes in the same basic block"

This reverts commit 6102310d814ad73eab60a88b21dd70874f7a056f.  It
appears to cause compilation non-determinism and caused stage3
mismatches.

3 years ago[gcov] Increment counters with atomicrmw if -fsanitize=thread
Fangrui Song [Fri, 28 Aug 2020 23:32:35 +0000 (16:32 -0700)]
[gcov] Increment counters with atomicrmw if -fsanitize=thread

Without this patch, `clang --coverage -fsanitize=thread` may fail spuriously
because non-atomic counter increments can be detected as data races.

3 years ago[lldb] Get rid of LLDB_LIB_DIR and LLDB_IMPLIB_DIR in dotest
Jonas Devlieghere [Fri, 28 Aug 2020 22:43:35 +0000 (15:43 -0700)]
[lldb] Get rid of LLDB_LIB_DIR and LLDB_IMPLIB_DIR in dotest

This patch removes the rather confusing LLDB_LIB_DIR and LLDB_IMPLIB_DIR
environment variables. They are confusing because LLDB_LIB_DIR would
point to the bin subdirectory in the build root while LLDB_IMPLIB_DIR
would point to the lib subdirectory. The reason far this was
LLDB.framework, which gets build under bin.

This patch replaces their uses with configuration.lldb_framework_path
and configuration.lldb_libs_dir respectively.

Differential revision: https://reviews.llvm.org/D86817

3 years ago[lldb] Dervice dotest.py path from config.lldb_src_root (NFC)
Jonas Devlieghere [Fri, 28 Aug 2020 22:42:43 +0000 (15:42 -0700)]
[lldb] Dervice dotest.py path from config.lldb_src_root (NFC)

3 years agoAdd Location, Region and Block to MLIR Python bindings.
Stella Laurenzo [Thu, 27 Aug 2020 06:48:42 +0000 (23:48 -0700)]
Add Location, Region and Block to MLIR Python bindings.

* This is just enough to create regions/blocks and iterate over them.
* Does not yet implement the preferred iteration strategy (python pseudo containers).
* Refinements need to come after doing basic mappings of operations and values so that the whole hierarchy can be used.

Differential Revision: https://reviews.llvm.org/D86683

3 years agoGlobalISel: Combine out redundant sext_inreg
Matt Arsenault [Fri, 28 Aug 2020 14:43:17 +0000 (10:43 -0400)]
GlobalISel: Combine out redundant sext_inreg

The scalar tests don't work yet, since computeNumSignBits apparently
doesn't handle sextload yet, and sext folds into the load first.

3 years ago[early-ifcvt] Add OptRemarks
Jon Roelofs [Fri, 28 Aug 2020 21:22:32 +0000 (15:22 -0600)]
[early-ifcvt] Add OptRemarks

3 years agoAMDGPU: Fix incorrectly deleting copies after spilling SGPR tuples
Matt Arsenault [Fri, 28 Aug 2020 18:58:33 +0000 (14:58 -0400)]
AMDGPU: Fix incorrectly deleting copies after spilling SGPR tuples

The implicit def of the super register would appear to kill any live
uses of components before the spill, and would be deleted by
MachineCopyPropagation. We need to add implicit uses of the super
register, similarly to what copyPhysReg does. VGPR tuples appear to be
correctly handled already. I need to double check the SGPR->memory
path.

3 years ago[UpdateTestChecks] include { in function signature check line
sstefan1 [Fri, 28 Aug 2020 08:19:04 +0000 (10:19 +0200)]
[UpdateTestChecks] include { in function signature check line

After D85099, if we have attribute group in the function signature that hasn't
been seen before, and later a callsite with the same attribute group, filecheck will evaluate
the first attribute group to for example '#0 {'. We now include { in the args_and_sig group to avoid this.

Differential Revision: https://reviews.llvm.org/D86769

3 years agoReland "[test] Exit with an error if no tests are run."
Jordan Rupprecht [Fri, 28 Aug 2020 21:27:37 +0000 (14:27 -0700)]
Reland "[test] Exit with an error if no tests are run."

This reverts commit a06c28df3e8c85ceb665d3d9a1ebc2853dfd87a9 (reland adb5c23f8c0d60eeec41dcbe21d1b26184e1c97d).

The issue with PExpect tests on Windows should be fixed with e5e05ecf65aba836802154279efbc8cbce6fe2ea.

3 years ago[test] Pin some RUNs in potential.ll to legacy PM
Arthur Eubanks [Fri, 28 Aug 2020 21:05:10 +0000 (14:05 -0700)]
[test] Pin some RUNs in potential.ll to legacy PM

There are corresponding NPM RUNs.

3 years agoRename AnalysisManager::slice in AnalysisManager::nest (NFC)
Mehdi Amini [Fri, 28 Aug 2020 20:17:38 +0000 (20:17 +0000)]
Rename AnalysisManager::slice in AnalysisManager::nest (NFC)

The naming wasn't reflecting the intent of this API, "nest" is aligning
it with the pass manager API.

3 years agoAdd new warning for compound punctuation tokens that are split across macro expansion...
Richard Smith [Thu, 27 Aug 2020 23:08:11 +0000 (16:08 -0700)]
Add new warning for compound punctuation tokens that are split across macro expansions or split by whitespace.

For example:

    #define FOO(x) (x)
    FOO({});

... forms a statement-expression after macro expansion. This warning
applies to '({' and '})' delimiting statement-expressions, '[[' and ']]'
delimiting attributes, and '::*' introducing a pointer-to-member.

The warning for forming these compound tokens across macro expansions
(or across files!) is enabled by default; the warning for whitespace
within the tokens is not, but is included in -Wall.

Differential Revision: https://reviews.llvm.org/D86751

3 years ago[Attributes] Add a method to check if an Attribute has AttrKind None. Use instead...
Craig Topper [Fri, 28 Aug 2020 20:02:42 +0000 (13:02 -0700)]
[Attributes] Add a method to check if an Attribute has AttrKind None. Use instead of hasAttribute(Attribute::None)

There's a special case in hasAttribute for None when pImpl is null. If pImpl is not null we dispatch to pImpl->hasAttribute which will always return false for Attribute::None.

So if we just want to check for None its sufficient to just check that pImpl is null. Which can even be done inline.

This patch adds a helper for that case which I hope will speed up our getSubtargetImpl implementations.

Differential Revision: https://reviews.llvm.org/D86744

3 years ago[LLD][PowerPC] Remove redundant file write out in the test cases
Victor Huang [Fri, 28 Aug 2020 20:09:17 +0000 (15:09 -0500)]
[LLD][PowerPC] Remove redundant file write out in the test cases

Fix the time out test failure on lld-x86_64-freebsd build bot.

Peer reviewed by: stefanp, nemanjai

3 years ago[ObjCARCOpt] Port objc-arc to NPM
Arthur Eubanks [Wed, 15 Jul 2020 16:33:48 +0000 (09:33 -0700)]
[ObjCARCOpt] Port objc-arc to NPM

Since doInitialization() in the legacy pass modifies the module, the NPM
pass is a Module pass.

Reviewed By: ahatanak, ychen

Differential Revision: https://reviews.llvm.org/D86178

3 years ago[SROA] Improve handleling of assumes bundles by SROA
Tyker [Fri, 28 Aug 2020 19:46:16 +0000 (21:46 +0200)]
[SROA] Improve handleling of assumes bundles by SROA

This patch fixes this crash https://gcc.godbolt.org/z/Ps8d1e
And gives SROA the ability to remove assumes if it allows promoting an alloca to register
Without removing assumes when it can't promote to register.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D86570

3 years ago[InstCombine] usub.sat(a, b) + b => umax(a, b) (PR42178)
Nikita Popov [Wed, 26 Aug 2020 20:27:52 +0000 (22:27 +0200)]
[InstCombine] usub.sat(a, b) + b => umax(a, b) (PR42178)

Fixes https://bugs.llvm.org/show_bug.cgi?id=42178 by folding
usub.sat(a, b) + b to umax(a, b). The backend will expand umax
back to usubsat if that is profitable.

We may also want to handle uadd.sat(a, b) - b in the future.

Differential Revision: https://reviews.llvm.org/D63060

3 years ago[LIBOMPTARGET]Do not try to optimize bases for the next parameters.
Alexey Bataev [Fri, 28 Aug 2020 12:53:42 +0000 (08:53 -0400)]
[LIBOMPTARGET]Do not try to optimize bases for the next parameters.

PrivateArgumentManager shall immediately allocate firstprivates if they
are bases for the next parameters and the next paramaters rely on the
fact that the base musst be allocated already.

Differential Revision: https://reviews.llvm.org/D86781

3 years agoSkip analysis re-computation when no changes are reported
serge-sans-paille [Fri, 28 Aug 2020 08:24:48 +0000 (10:24 +0200)]
Skip analysis re-computation when no changes are reported

This is a follow-up to https://reviews.llvm.org/D80707, generalized to
CallGraphSCC, Loop and Region

Differential Revision: https://reviews.llvm.org/D86442

3 years ago[ARM] Skip combining base updates for vld1x NEON intrinsics
Sjoerd Meijer [Fri, 28 Aug 2020 13:51:52 +0000 (14:51 +0100)]
[ARM] Skip combining base updates for vld1x NEON intrinsics

Skip this for now, to avoid a backend crash in:

  UNREACHABLE executed at llvm/lib/Target/ARM/ARMISelLowering.cpp:13412

This should fix PR45824.

Differential Revision: https://reviews.llvm.org/D86784

3 years agoStrength-reduce SmallVectors to arrays. NFCI.
Benjamin Kramer [Fri, 28 Aug 2020 19:14:20 +0000 (21:14 +0200)]
Strength-reduce SmallVectors to arrays. NFCI.

3 years ago[CodeGenPrepare] Zap the argument of llvm.assume when deleting it
Benjamin Kramer [Fri, 28 Aug 2020 17:31:36 +0000 (19:31 +0200)]
[CodeGenPrepare] Zap the argument of llvm.assume when deleting it

We know that the argument is mostly likely dead, so we can purge it
early. Otherwise it would make it to codegen, and can block further
optimizations.

3 years ago[lldb/test] Use shorter test case names in TestStandardUnwind
Jordan Rupprecht [Fri, 28 Aug 2020 18:49:50 +0000 (11:49 -0700)]
[lldb/test] Use shorter test case names in TestStandardUnwind

TestStandardUnwind uses the full absolute path to a set of C/C++ files as the test case name, which in turn is used in the name of a log file. When the source file is long, and the directory where log files are stored is also long, this causes an OSError because the log filename is too long.

Reviewed By: JDevlieghere

Differential Revision: https://reviews.llvm.org/D86752