platform/upstream/llvm.git
3 years agolibclc/spirv: Add various functions
Daniel Stone [Thu, 10 Sep 2020 03:15:41 +0000 (23:15 -0400)]
libclc/spirv: Add various functions

Adds fma,fmod,ldexp.

Reviewer: jenatali jvesely

Differential Revision: https://reviews.llvm.org/D85911

3 years ago[mlir][Linalg] Add Utility method to get loop ranges for a LinalgOp.
MaheshRavishankar [Thu, 10 Sep 2020 05:20:12 +0000 (22:20 -0700)]
[mlir][Linalg] Add Utility method to get loop ranges for a LinalgOp.

Also refactor the getViewSizes method to work on LinalgOp instead of
being a templated version. Keeping the templated version for
compatibility.

Differential Revision: https://reviews.llvm.org/D87303

3 years ago[PowerPC] [FPEnv] Disable strict FP mutation by default
Qiu Chaofan [Thu, 10 Sep 2020 05:28:09 +0000 (13:28 +0800)]
[PowerPC] [FPEnv] Disable strict FP mutation by default

22a0edd0 introduced a config IsStrictFPEnabled, which controls the
strict floating point mutation (transforming some strict-fp operations
into non-strict in ISel). This patch disables the mutation by default
since we've finished PowerPC strict-fp enablement in backend.

Reviewed By: uweigand

Differential Revision: https://reviews.llvm.org/D87222

3 years ago[lit] Use correct variable name for libxml2
Petr Hosek [Thu, 10 Sep 2020 05:03:13 +0000 (22:03 -0700)]
[lit] Use correct variable name for libxml2

This addresses an issue introduced in c4d7536136b3.

3 years ago[CMake] Simplify CMake handling for libxml2
Petr Hosek [Fri, 24 Jul 2020 22:47:38 +0000 (15:47 -0700)]
[CMake] Simplify CMake handling for libxml2

This matches the changes made to handling of zlib done in 10b1b4a
where we rely on find_package and the imported target rather than
manually appending the library and include paths. The use of
LLVM_LIBXML2_ENABLED has been replaced by LLVM_ENABLE_LIBXML2
thus reducing the number of variables.

Differential Revision: https://reviews.llvm.org/D84563

3 years ago[NFC] Move definition of variable now only used in debug builds
Jordan Rupprecht [Thu, 10 Sep 2020 03:23:59 +0000 (20:23 -0700)]
[NFC] Move definition of variable now only used in debug builds

3 years agomlir/Transforms/BufferPlacement.h: Add missing override
David Blaikie [Thu, 10 Sep 2020 01:17:44 +0000 (18:17 -0700)]
mlir/Transforms/BufferPlacement.h: Add missing override

3 years ago[gcov] Delete flush_fn_list (unused since D83149)
Fangrui Song [Thu, 10 Sep 2020 00:24:45 +0000 (17:24 -0700)]
[gcov] Delete flush_fn_list (unused since D83149)

3 years ago[llvm-install-name-tool] Add -V flag
Alexander Shaposhnikov [Thu, 10 Sep 2020 00:11:08 +0000 (17:11 -0700)]
[llvm-install-name-tool] Add -V flag

This diff adds -V alias for --version to make llvm-install-name-tool
consistent with other tools (llvm-objcopy, llvm-strip, etc).

Test plan: make check-all

Differential revision: https://reviews.llvm.org/D87264

3 years ago[gcov] Delete unused llvm_gcda_increment_indirect_counter
Fangrui Song [Wed, 9 Sep 2020 23:57:33 +0000 (16:57 -0700)]
[gcov] Delete unused llvm_gcda_increment_indirect_counter

It has been unused since r157564 (2012).

3 years agoAMDGPU: Hoist check for VGPRs
Matt Arsenault [Wed, 9 Sep 2020 20:58:52 +0000 (16:58 -0400)]
AMDGPU: Hoist check for VGPRs

3 years agoAMDGPU: Skip all meta instructions in hazard recognizer
Matt Arsenault [Wed, 9 Sep 2020 22:08:48 +0000 (18:08 -0400)]
AMDGPU: Skip all meta instructions in hazard recognizer

This was not adding a necessary nop due to thinking the kill counted.

3 years agoAMDGPU: Fix inserting waitcnts before kill uses
Matt Arsenault [Wed, 9 Sep 2020 14:24:35 +0000 (10:24 -0400)]
AMDGPU: Fix inserting waitcnts before kill uses

3 years ago[Asan] Return nullptr for invalid chunks
Vitaly Buka [Wed, 9 Sep 2020 23:17:37 +0000 (16:17 -0700)]
[Asan] Return nullptr for invalid chunks

CHUNK_ALLOCATED. CHUNK_QUARANTINE are only states
which make AsanChunk useful for GetAsanChunk callers.
In either case member of AsanChunk are not useful.

Fix few cases which didn't expect nullptr. Most of the callers are already
expects nullptr.

Reviewed By: morehouse

Differential Revision: https://reviews.llvm.org/D87135

3 years ago[ValueTracking] isKnownNonZero, computeKnownBits for freeze
Juneyoung Lee [Wed, 9 Sep 2020 17:55:06 +0000 (02:55 +0900)]
[ValueTracking] isKnownNonZero, computeKnownBits for freeze

This implements support for isKnownNonZero, computeKnownBits when freeze is involved.

```
  br (x != 0), BB1, BB2
BB1:
  y = freeze x
```

In the above program, we can say that y is non-zero. The reason is as follows:

(1) If x was poison, `br (x != 0)` raised UB
(2) If x was fully undef, the branch again raised UB
(3) If x was non-zero partially undef, say `undef | 1`, `freeze x` will return a nondeterministic value which is also non-zero.
(4) If x was just a concrete value, it is trivial

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D75808

3 years ago[libunwind] Bare-metal DWARF: set dso_base to 0
Ryan Prichard [Wed, 9 Sep 2020 22:43:35 +0000 (15:43 -0700)]
[libunwind] Bare-metal DWARF: set dso_base to 0

Previously, DwarfFDECache::findFDE used 0 as a special value meaning
"search the entire cache, including dynamically-registered FDEs".
Switch this special value to -1, which doesn't make sense as a DSO
base.

Fixes PR47335.

Reviewed By: compnerd, #libunwind

Differential Revision: https://reviews.llvm.org/D86748

3 years ago[AMDGPU] Fix for folding v2.16 literals.
dfukalov [Fri, 4 Sep 2020 19:44:01 +0000 (22:44 +0300)]
[AMDGPU] Fix for folding v2.16 literals.

It was found some packed immediate operands (e.g. `<half 1.0, half 2.0>`) are
incorrectly processed so one of two packed values were lost.

Introduced new function to check immediate 32-bit operand can be folded.
Converted condition about current op_sel flags value to fall-through.

Fixes: SWDEV-247595

Reviewed By: rampitec

Differential Revision: https://reviews.llvm.org/D87158

3 years agoMark masked.{store,scatter,compressstore} intrinsics as write-only
Krzysztof Parzyszek [Wed, 9 Sep 2020 22:27:04 +0000 (17:27 -0500)]
Mark masked.{store,scatter,compressstore} intrinsics as write-only

3 years ago[AArch64][GlobalISel] Share address mode selection code for memops
Jessica Paquette [Wed, 9 Sep 2020 16:45:54 +0000 (09:45 -0700)]
[AArch64][GlobalISel] Share address mode selection code for memops

We were missing support for the G_ADD_LOW + ADRP folding optimization in the
manual selection code for G_LOAD, G_STORE, and G_ZEXTLOAD.

As a result, we were missing cases like this:

```
@foo = external hidden global i32*
define void @baz(i32* %0) {
store i32* %0, i32** @foo
ret void
}
```

https://godbolt.org/z/16r7ad

This functionality already existed in the addressing mode functions for the
importer. So, this patch makes the manual selection code use
`selectAddrModeIndexed` rather than duplicating work.

This is a 0.2% geomean code size improvement for CTMark at -O3.

There is one code size increase (0.1% on lencod) which is likely because
`selectAddrModeIndexed` doesn't look through constants.

Differential Revision: https://reviews.llvm.org/D87397

3 years ago[DSE,MemorySSA] Handle atomic stores explicitly in isReadClobber.
Florian Hahn [Wed, 9 Sep 2020 18:36:41 +0000 (19:36 +0100)]
[DSE,MemorySSA] Handle atomic stores explicitly in isReadClobber.

Atomic stores are modeled as MemoryDef to model the fact that they may
not be reordered, depending on the ordering constraints.

Atomic stores that are monotonic or weaker do not limit re-ordering, so
we do not have to treat them as potential read clobbers.

Note that llvm/test/Transforms/DeadStoreElimination/MSSA/atomic.ll
already contains a set of negative test cases.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D87386

3 years ago[DAGCombiner] Fold fmin/fmax of NaN
Nikita Popov [Wed, 9 Sep 2020 20:35:02 +0000 (22:35 +0200)]
[DAGCombiner] Fold fmin/fmax of NaN

fminnum(X, NaN) is X, fminimum(X, NaN) is NaN. This mirrors the
behavior of existing InstSimplify folds.

This is expected to improve the reduction lowerings in D87391,
which use NaN as a neutral element.

Differential Revision: https://reviews.llvm.org/D87415

3 years ago[ARM] Add additional fmin/fmax with nan tests (NFC)
Nikita Popov [Wed, 9 Sep 2020 21:48:44 +0000 (23:48 +0200)]
[ARM] Add additional fmin/fmax with nan tests (NFC)

Adding these to ARM which has both FMINNUM and FMINIMUM.

3 years ago[lldb] Pass the arch as part of the triple in the ARCH_CFLAGS
Jonas Devlieghere [Wed, 9 Sep 2020 21:41:00 +0000 (14:41 -0700)]
[lldb] Pass the arch as part of the triple in the ARCH_CFLAGS

3 years agoAdd REQUIRES: asserts to a test that uses an asserts only flag.
Amara Emerson [Wed, 9 Sep 2020 21:30:47 +0000 (14:30 -0700)]
Add REQUIRES: asserts to a test that uses an asserts only flag.

3 years ago[GlobalISel] Enable usage of BranchProbabilityInfo in IRTranslator.
Amara Emerson [Fri, 28 Aug 2020 23:21:34 +0000 (16:21 -0700)]
[GlobalISel] Enable usage of BranchProbabilityInfo in IRTranslator.

We weren't using this before, so none of the MachineFunction CFG edges had the
branch probability information added. As a result, block placement later in the
pipeline was flying blind.

This is enabled only with optimizations enabled like SelectionDAG.

Differential Revision: https://reviews.llvm.org/D86824

3 years ago[X86] Add tests for minnum/maxnum with constant NaN (NFC)
Nikita Popov [Wed, 9 Sep 2020 20:35:56 +0000 (22:35 +0200)]
[X86] Add tests for minnum/maxnum with constant NaN (NFC)

3 years ago[GlobalISel][IRTranslator] Generate better conditional branch lowering.
Amara Emerson [Mon, 24 Aug 2020 21:10:38 +0000 (14:10 -0700)]
[GlobalISel][IRTranslator] Generate better conditional branch lowering.

This is a port of the functionality from SelectionDAG, which tries to find
a tree of conditions from compares that are then combined using OR or AND,
before using that result as the input to a branch. Instead of naively
lowering the code as is, this change converts that into a sequence of
conditional branches on the sub-expressions of the tree.

Like SelectionDAG, we re-use the case block codegen functionality from
the switch lowering utils, which causes us to generate some different code.
The result of which I've tried to mitigate in earlier combine patches.

Differential Revision: https://reviews.llvm.org/D86665

3 years ago[GlobalISel] Rewrite the elide-br-by-swapping-icmp-ops combine to do less.
Amara Emerson [Mon, 24 Aug 2020 17:46:50 +0000 (10:46 -0700)]
[GlobalISel] Rewrite the elide-br-by-swapping-icmp-ops combine to do less.

This combine previously tried to take sequences like:
  %cond = G_ICMP pred, a, b
  G_BRCOND %cond, %truebb
  G_BR %falsebb
%truebb:
  ...
%falsebb:
  ...

and by inverting the compare predicate and swapping branch targets, delete the
G_BR and instead have a single conditional branch to the falsebb. Since in an
earlier patch we have a combine to fold not(icmp) into just an inverted icmp,
we don't need this combine to do as much. This patch instead generalizes the
combine by just looking for:
  G_BRCOND %cond, %truebb
  G_BR %falsebb
%truebb:
  ...
%falsebb:
  ...

and then inverting the condition using a not (xor). The xor can be folded away
in a separate combine. This change also lets us avoid some optimization code
in the IRTranslator.

I also think that deleting G_BRs in the combiner is unnecessary. That's
something that targets can decide to do at selection time and could simplify
generic code in future.

Differential Revision: https://reviews.llvm.org/D86664

3 years ago[libc][MPFRWrapper] Provide a way to include MPFR header in downstream repos.
Siva Chandra Reddy [Wed, 9 Sep 2020 18:28:14 +0000 (11:28 -0700)]
[libc][MPFRWrapper] Provide a way to include MPFR header in downstream repos.

Reviewed By: asteinhauser

Differential Revision: https://reviews.llvm.org/D87412

3 years agoWordsmith RegionBranchOpInterface verification errors
Sean Silva [Tue, 8 Sep 2020 22:49:50 +0000 (15:49 -0700)]
Wordsmith RegionBranchOpInterface verification errors

I was having a lot of trouble parsing the messages. In particular, the
messages like:

```
<stdin>:3:8: error: 'scf.if' op  along control flow edge from Region #0 to scf.if source #1 type '!npcomprt.tensor' should match input #1 type 'tensor<?xindex>'
```

In particular, one thing that kept catching me was parsing the "to scf.if
source #1 type" as one thing, but really it is
"to parent results: source type #1".

Differential Revision: https://reviews.llvm.org/D87334

3 years ago[X86] Add support for using fast short rep mov for memcpy lowering.
Hiroshi Yamauchi [Fri, 21 Aug 2020 19:44:36 +0000 (12:44 -0700)]
[X86] Add support for using fast short rep mov for memcpy lowering.

Disabled by default behind an option.

Differential Revision: https://reviews.llvm.org/D86883

3 years ago[AMDGPU] Correct gfx1031 XNACK setting documentation
Tony [Sat, 5 Sep 2020 22:53:47 +0000 (22:53 +0000)]
[AMDGPU] Correct gfx1031 XNACK setting documentation

- gfx1031 does not support XNACK.

Differential Revision: https://reviews.llvm.org/D87198

3 years ago[MC] Resolve the difference of symbols in consecutive MCDataFragements
Jian Cai [Wed, 9 Sep 2020 18:58:22 +0000 (11:58 -0700)]
[MC] Resolve the difference of symbols in consecutive MCDataFragements

Try to resolve the difference of two symbols in consecutive MCDataFragments.
This is important for an idiom like "foo:instr; .if . - foo; instr; .endif"
(https://bugs.llvm.org/show_bug.cgi?id=43795).

Reviewed By: nickdesaulniers

Differential Revision: https://reviews.llvm.org/D69411

3 years ago[gcov] Give the __llvm_gcov_ctr load instruction a name for more readable output
Fangrui Song [Wed, 9 Sep 2020 19:31:25 +0000 (12:31 -0700)]
[gcov] Give the __llvm_gcov_ctr load instruction a name for more readable output

3 years ago[Hexagon] Account for truncating pairs to non-pairs when widening truncates
Krzysztof Parzyszek [Wed, 9 Sep 2020 19:28:00 +0000 (14:28 -0500)]
[Hexagon] Account for truncating pairs to non-pairs when widening truncates

Added missing selection patterns for vpackl.

3 years ago[InstCombine] add tests for add/sub-of-shl; NFC
Sanjay Patel [Tue, 8 Sep 2020 21:00:40 +0000 (17:00 -0400)]
[InstCombine] add tests for add/sub-of-shl; NFC

3 years ago[gcov] Don't split entry block; add a synthetic entry block instead
Fangrui Song [Wed, 9 Sep 2020 19:06:39 +0000 (12:06 -0700)]
[gcov] Don't split entry block; add a synthetic entry block instead

The entry block is split at the first instruction where `shouldKeepInEntry`
returns false. The created basic block has a br jumping to the original entry
block. The new basic block causes the function label line and the other entry
block lines to be covered by different basic blocks, which can affect line
counts with special control flows (fork/exec in the entry block requires
heuristics in llvm-cov gcov to get consistent line counts).

  int main() { // BB0
    return 0;  // BB2 (due to entry block splitting)
  }
  // BB1 is the exit block (since gcov 4.8)

This patch adds a synthetic entry block (like PGOInstrumentation and GCC) and
inserts an edge from the synthetic entry block to the original entry block. We
can thus remove the tricky `shouldKeepInEntry` and entry block splitting. The
number of basic blocks does not change, but the emitted .gcno files will be
smaller because we can save one GCOV_TAG_LINES tag.

  // BB0 is the synthetic entry block with a single edge to BB2
  int main() { // BB2
    return 0;  // BB2
  }
  // BB1 is the exit block (since gcov 4.8)

3 years agoRevert a test using padding bits in atomics
Olivier Giroux [Wed, 9 Sep 2020 19:14:53 +0000 (12:14 -0700)]
Revert a test using padding bits in atomics

3 years ago[NFC] Separate bitcode reading for FUNC_CODE_INST_CMPXCHG(_OLD)
Guillaume Chatelet [Wed, 9 Sep 2020 19:10:30 +0000 (19:10 +0000)]
[NFC] Separate bitcode reading for FUNC_CODE_INST_CMPXCHG(_OLD)

This is preparatory work to unable storing alignment for AtomicCmpXchgInst.
See D83136 for context and bug: https://bugs.llvm.org/show_bug.cgi?id=27168

This is the fixed version of D83375, which was submitted and reverted.

Differential Revision: https://reviews.llvm.org/D87373

3 years agoImplements [[likely]] and [[unlikely]] in IfStmt.
Mark de Wever [Wed, 9 Sep 2020 17:12:32 +0000 (19:12 +0200)]
Implements [[likely]] and [[unlikely]] in IfStmt.

This is the initial part of the implementation of the C++20 likelihood
attributes. It handles the attributes in an if statement.

Differential Revision: https://reviews.llvm.org/D85091

3 years ago[DSE] Explicitly not use MSSA in testcase for now
Krzysztof Parzyszek [Wed, 9 Sep 2020 18:44:29 +0000 (13:44 -0500)]
[DSE] Explicitly not use MSSA in testcase for now

It fails for some reason, but it shouldn't stop switching to MSSA in DSE.

3 years ago[debugserver] Extract function for default launch flavor
Dave Lee [Tue, 8 Sep 2020 20:43:15 +0000 (13:43 -0700)]
[debugserver] Extract function for default launch flavor

Extract a function for turning `eLaunchFlavorDefault` into a concreate `eLaunchFlavor` value.

This new function encapsulates the few compile time variables involved, and also prevents clang unused code diagnostics.

Differential Revision: https://reviews.llvm.org/D87327

3 years ago[DSE] Handle masked stores
Krzysztof Parzyszek [Wed, 2 Sep 2020 19:06:58 +0000 (14:06 -0500)]
[DSE] Handle masked stores

3 years agoRevert "[Attributor] Re-enable a run line in noalias.ll"
Johannes Doerfert [Wed, 9 Sep 2020 15:09:30 +0000 (10:09 -0500)]
Revert "[Attributor] Re-enable a run line in noalias.ll"

The underlying issue is still there, just hides on most systems, even
some Windows builds :(

See:
http://lab.llvm.org:8011/builders/llvm-clang-x86_64-expensive-checks-win/builds/25479/steps/test-check-all/logs/FAIL%3A%20LLVM%3A%3Anoalias.ll

This reverts commit 2600c9e2efce1dc4c64870b00a45ae0082c685fc.

3 years ago[libc++] Fix variant benchmark build for some configurations.
Eric Fiselier [Wed, 9 Sep 2020 18:25:17 +0000 (14:25 -0400)]
[libc++] Fix variant benchmark build for some configurations.

The benchmarks expect to be built in C++17 or newer, but this
isn't always how CMake configures the C++ dialect. Instead
we need to explicitly set the CXX_STANDARD target property.

3 years ago[lldb/Docs] Correct LLDB_ENABLE_TESTS to LLDB_INCLUDE_TESTS
Dave Lee [Wed, 9 Sep 2020 05:53:08 +0000 (22:53 -0700)]
[lldb/Docs] Correct LLDB_ENABLE_TESTS to LLDB_INCLUDE_TESTS

Fix references to LLDB_ENABLE_TESTS.

Differential Revision: https://reviews.llvm.org/D87345

3 years ago[libc][NFC] Add spec files as dependencies of integration test.
Siva Chandra Reddy [Wed, 9 Sep 2020 17:19:37 +0000 (10:19 -0700)]
[libc][NFC] Add spec files as dependencies of integration test.

3 years ago[DAGCombine] Skip re-visiting EntryToken to avoid compile time explosion
Ulrich Weigand [Wed, 9 Sep 2020 17:09:52 +0000 (19:09 +0200)]
[DAGCombine] Skip re-visiting EntryToken to avoid compile time explosion

During the main DAGCombine loop, whenever a node gets replaced, the new
node and all its users are pushed onto the worklist.  Omit this if the
new node is the EntryToken (e.g. if a store managed to get optimized
out), because re-visiting the EntryToken and its users will not uncover
any additional opportunities, but there may be a large number of such
users, potentially causing compile time explosion.

This compile time explosion showed up in particular when building the
SingleSource/UnitTests/matrix-types-spec.cpp test-suite case on any
platform without SIMD vector support.

Reviewed By: arsenm
Differential Revision: https://reviews.llvm.org/D86963

3 years agoUpdate atomic feature macros, synopsis, signatures to match C++20. Improve test cover...
Olivier Giroux [Wed, 9 Sep 2020 17:00:09 +0000 (10:00 -0700)]
Update atomic feature macros, synopsis, signatures to match C++20. Improve test coverage for non-lock-free atomics.

3 years ago[NFC][MLInliner] Don't initialize in an assert.
Mircea Trofin [Wed, 9 Sep 2020 16:55:06 +0000 (09:55 -0700)]
[NFC][MLInliner] Don't initialize in an assert.

Since the build bots have assertions enabled, this flew under the radar.

3 years ago[libc][obvious] Fix strtok_r signature in the spec.
Siva Chandra Reddy [Wed, 9 Sep 2020 15:32:51 +0000 (08:32 -0700)]
[libc][obvious] Fix strtok_r signature in the spec.

3 years ago[OpenMP] Begin Printing Information Dumps In Libomptarget and Plugins
Joseph Huber [Fri, 4 Sep 2020 19:03:49 +0000 (15:03 -0400)]
[OpenMP] Begin Printing Information Dumps In Libomptarget and Plugins

Summary:
This patch starts adding support for adding information dumps to libomptarget
and rtl plugins. The information printing is controlled by the
LIBOMPTARGET_INFO environment variable introduced in D86483. The goal of this
patch is to provide the user with additional information about the device
during kernel execution and providing the user with information dumps in the
case of failure. This patch added the ability to dump the pointer mapping table
as well as printing the number of blocks and threads in the cuda RTL.

Reviewers: jdoerfort gkistanova ye-luo

Subscribers: guansong openmp-commits sstefan1 yaxunl ye-luo

Tags: #OpenMP

Differential Revision: https://reviews.llvm.org/D87165

3 years agoX86CallFrameOptimization.cpp - use const references where possible. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 15:33:19 +0000 (16:33 +0100)]
X86CallFrameOptimization.cpp - use const references where possible. NFCI.

3 years ago[DSE] Add testcase that uses masked loads and stores
Krzysztof Parzyszek [Wed, 9 Sep 2020 15:23:00 +0000 (10:23 -0500)]
[DSE] Add testcase that uses masked loads and stores

3 years ago[mlir][Linalg] Reduction dimensions specified in TC definition of ConvOps.
Jakub Lichman [Tue, 8 Sep 2020 15:04:35 +0000 (15:04 +0000)]
[mlir][Linalg] Reduction dimensions specified in TC definition of ConvOps.

This commit specifies reduction dimensions for ConvOps. This prevents
running reduction loops in parallel and enables easier detection of kernel dimensions
which we will need later on.

Differential Revision: https://reviews.llvm.org/D87288

3 years agoX86FrameLowering::adjustStackWithPops - cleanup auto usage. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 15:13:55 +0000 (16:13 +0100)]
X86FrameLowering::adjustStackWithPops - cleanup auto usage. NFCI.

Don't use auto for non-obvious types, and use const references.

3 years ago[PowerPC] Fix STRICT_FRINT/STRICT_FNEARBYINT lowering
Qiu Chaofan [Wed, 9 Sep 2020 14:38:58 +0000 (22:38 +0800)]
[PowerPC] Fix STRICT_FRINT/STRICT_FNEARBYINT lowering

In standard C library, both rint and nearbyint returns rounding result
in current rounding mode. But nearbyint never raises inexact exception.
On PowerPC, x(v|s)r(d|s)pic may modify FPSCR XX, raising inexact
exception. So we can't select constrained fnearbyint into xvrdpic.

One exception here is xsrqpi, which will not raise inexact exception, so
fnearbyint f128 is okay here.

Reviewed By: uweigand

Differential Revision: https://reviews.llvm.org/D87220

3 years ago[AMDGPU] Simplify S_SETREG_B32 case in EmitInstrWithCustomInserter
Jay Foad [Wed, 9 Sep 2020 13:55:48 +0000 (14:55 +0100)]
[AMDGPU] Simplify S_SETREG_B32 case in EmitInstrWithCustomInserter

NFC.

3 years ago[FLANG] Generate error for invalid selector.
compinder [Wed, 9 Sep 2020 13:32:51 +0000 (19:02 +0530)]
[FLANG] Generate error for invalid selector.

Fix of PR47339

Differential Revision: https://reviews.llvm.org/D87073/new/

3 years ago[AMDGPU][MC] Improved diagnostic messages for invalid registers
Dmitry Preobrazhensky [Wed, 9 Sep 2020 12:58:12 +0000 (15:58 +0300)]
[AMDGPU][MC] Improved diagnostic messages for invalid registers

Corrected parser to issue meaningful error messages for invalid and malformed registers.

See bug 41303: https://bugs.llvm.org/show_bug.cgi?id=41303

Reviewers: arsenm, rampitec

Differential Revision: https://reviews.llvm.org/D87234

3 years ago[MachinePipeliner] Fix II_setByPragma initialization
Alon Kom [Wed, 9 Sep 2020 13:17:53 +0000 (13:17 +0000)]
[MachinePipeliner] Fix II_setByPragma initialization

II_setByPragma was not reset between 2 calls of the MachinePipleiner pass

Reviewed By: bcahoon

Differential Revision: https://reviews.llvm.org/D87088

3 years ago[Statepoints] Update DAG root after emitting statepoint.
Denis Antrushin [Mon, 7 Sep 2020 15:04:07 +0000 (22:04 +0700)]
[Statepoints] Update DAG root after emitting statepoint.

Since we always generate CopyToRegs for statepoint results,
we must update DAG root after emitting statepoint, so that
these copies are scheduled before any possible local uses.
Note: getControlRoot() flushes all PendingExports, not only
those we generates for relocates. If that'll become a problem,
we can change it to flushing relocate exports only.

Reviewed By: reames

Differential Revision: https://reviews.llvm.org/D87251

3 years agoCommandLine.h - use auto const reference in ValuesClass::apply for range loop. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 13:20:41 +0000 (14:20 +0100)]
CommandLine.h - use auto const reference in ValuesClass::apply for range loop. NFCI.

3 years agoRevert "[AMDGPU] Support disassembly for AMDGPU kernel descriptors"
Ronak Chauhan [Wed, 9 Sep 2020 12:31:28 +0000 (18:01 +0530)]
Revert "[AMDGPU] Support disassembly for AMDGPU kernel descriptors"

This reverts commit 487a80531006add8102d50dbcce4b6fd729ab1f6.

Tests fail on big endian machines.

3 years ago[KnownBits] Move SelectionDAG::computeKnownBits ISD::ABS handling to KnownBits::abs
Simon Pilgrim [Wed, 9 Sep 2020 12:22:39 +0000 (13:22 +0100)]
[KnownBits] Move SelectionDAG::computeKnownBits ISD::ABS handling to KnownBits::abs

Move the ISD::ABS handling to a KnownBits::abs handler, to simplify future implementations in ValueTracking/GlobalISel.

3 years agoAPInt.h - return directly from clearUnusedBits in single word cases. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 11:21:55 +0000 (12:21 +0100)]
APInt.h - return directly from clearUnusedBits in single word cases. NFCI.

Consistently use the same pattern of returning *this from the clearUnusedBits() call to allow us to early out from the isSingleWord() path and avoid an else statement.

3 years ago[elf2yaml] Fix dumping a debug section whose name is not recognized.
Xing GUO [Wed, 9 Sep 2020 12:06:00 +0000 (20:06 +0800)]
[elf2yaml] Fix dumping a debug section whose name is not recognized.

If the debug section's name isn't recognized, it should be
dumped as a raw content section.

Reviewed By: jhenderson, grimar

Differential Revision: https://reviews.llvm.org/D87346

3 years ago[cmake] Use absolute paths for modules search
Diana Picus [Thu, 3 Sep 2020 11:39:29 +0000 (13:39 +0200)]
[cmake] Use absolute paths for modules search

For out of tree builds, the user generally needs to specify LLVM_DIR and
MLIR_DIR on the command line so that the correct LLVM and MLIR
installations are picked up.

If the provided paths are absolute, everything works fine, however for
buildbots it is customary to work with relative paths, and that makes it
difficult for CMake to find the right modules to include.

This patch changes CMakeLists.txt to convert LLVM_DIR and MLIR_DIR to
absolute paths before adding them to CMAKE_MODULE_PATH. The inputs are
assumed to be relative to the source directory (llvm-project/flang).

Differential Revision: https://reviews.llvm.org/D87083

3 years ago[mlir][Linalg] Integration tests for convolutions added.
Jakub Lichman [Tue, 8 Sep 2020 11:26:15 +0000 (11:26 +0000)]
[mlir][Linalg] Integration tests for convolutions added.

This commit introduces end-to-end integration tests for
convolutions that test multiple ways of ConvOps lowering.

Differential Revision: https://reviews.llvm.org/D87277

3 years ago[UnifyFunctionExitNodes] Fix Modified status for unreachable blocks
David Stenberg [Wed, 9 Sep 2020 08:59:41 +0000 (10:59 +0200)]
[UnifyFunctionExitNodes] Fix Modified status for unreachable blocks

If a function had at most one return block, the pass would return false
regardless if an unified unreachable block was created.

This patch fixes that by refactoring runOnFunction into two separate
helper functions for handling the unreachable blocks respectively the
return blocks, as suggested by @bjope in a review comment.

This was caught using the check introduced by D80916.

Reviewed By: serge-sans-paille

Differential Revision: https://reviews.llvm.org/D85818

3 years ago[BuildLibCalls] Add more noundef to library functions
Juneyoung Lee [Wed, 9 Sep 2020 11:31:51 +0000 (20:31 +0900)]
[BuildLibCalls] Add more noundef to library functions

This patch follows D85345 and adds more noundef attributes to return values/arguments of library functions
that are mostly about accessing the file system or processes.

A few functions like `chmod` or `times` use typedef `mode_t` and `clock_t`.
They are neither struct nor union, so they cannot contain undef even if they're lowered to iN in IR. So, it is fine to add noundef to them.

- clock_t's actual type is size_t (C17, 7.27.1.3), so it isn't struct or union.

- For mode_t, either int or long is used in practice because programmers use bit manipulation. So, I think it is okay that it's never aggregate in practice.

After this patch, the remaining library functions are those that eagerly participate in optimizations: they can be removed, reordered, or
introduced by a transformation from primitive IR operations.
For them, a few testings is needed, since it may not be valid to add noundef anymore even if C standard says it's okay.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D85894

3 years ago[flang]Add Semantic Checks for OpenMP Allocate Clause
Irina Dobrescu [Wed, 9 Sep 2020 10:50:13 +0000 (11:50 +0100)]
[flang]Add Semantic Checks for OpenMP Allocate Clause

Reviewed By: kiranchandramohan, clementval, kiranktp, raghavendhra

Differential Revision: https://reviews.llvm.org/D86051

3 years ago[ValueTracking] Add UndefOrPoison/Poison-only version of relevant functions
Juneyoung Lee [Tue, 8 Sep 2020 02:41:19 +0000 (11:41 +0900)]
[ValueTracking] Add UndefOrPoison/Poison-only version of relevant functions

This patch adds isGuaranteedNotToBePoison and programUndefinedIfUndefOrPoison.

isGuaranteedNotToBePoison will be used at D75808. The latter function is used at isGuaranteedNotToBePoison.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D84242

3 years agoTrigramIndex.cpp - remove unnecessary includes. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 10:26:21 +0000 (11:26 +0100)]
TrigramIndex.cpp - remove unnecessary includes. NFCI.

TrigramIndex.h already includes most of these.

3 years agoARMTargetParser.cpp - use auto const references in for range loops. NFCI.
Simon Pilgrim [Wed, 9 Sep 2020 10:17:49 +0000 (11:17 +0100)]
ARMTargetParser.cpp - use auto const references in for range loops. NFCI.

Fix static analysis warnings about unnecessary copies.

3 years ago[APFloat] Fix uninitialized variable in IEEEFloat constructors
Simon Pilgrim [Wed, 9 Sep 2020 09:58:59 +0000 (10:58 +0100)]
[APFloat] Fix uninitialized variable in IEEEFloat constructors

Some constructors of IEEEFloat do not initialize member variable exponent.
Fix it by initializing exponent with the following values:

For NaNs, the `exponent` is `maxExponent+1`.
For Infinities, the `exponent` is `maxExponent+1`.
For Zeroes, the `exponent` is `maxExponent-1`.

Patch by: @nullptr.cpp (Yang Fan)

Differential Revision: https://reviews.llvm.org/D86997

3 years ago[NFC][Asan] Fit ChunkHeader into redzone
Vitaly Buka [Wed, 9 Sep 2020 10:06:46 +0000 (03:06 -0700)]
[NFC][Asan] Fit ChunkHeader into redzone

In code as-is min redzone and ChunkHeader are 16 byte.
This patch just makes sure that redzone is calculated correctly if we
extend ChunkHeader.

3 years ago[DomTree] Use SmallVector<DomTreeNodeBase *, 4> instead of std::vector.
Florian Hahn [Wed, 9 Sep 2020 09:24:49 +0000 (10:24 +0100)]
[DomTree] Use SmallVector<DomTreeNodeBase *, 4> instead of std::vector.

Currentl DomTreeNodeBase is using std::vectot to store it's children.
Using SmallVector should be more efficient in terms of compile-time.

A size of 4 seems to be the sweet-spot in terms of compile-time,
according to

http://llvm-compile-time-tracker.com/compare.php?from=9933188c90615c9c264ebb69117f09726e909a25&to=d7a801d027648877b20f0e00e822a7a64c58d976&stat=instructions

This results in the following geomean improvements

```
                       geomean insts     max rss
O3                          -0.31 %       +0.02 %
ReleaseThinLTO              -0.35 %       -0.12 %
ReleaseLTO                  -0.28 %       -0.12 %
O0                          -0.06 %       -0.02 %
NewPM O3                    -0.36 %       +0.05 %
ReleaseThinLTO (link only)  -0.44 %       -0.10 %
ReleaseLTO-g (link only):   -0.32 %       -0.03 %
```

I am not sure if there's any other benefits of using std::vector over
SmallVector.

Reviewed By: kuhar, asbirlea

Differential Revision: https://reviews.llvm.org/D87319

3 years ago[ARM] Fixup of a few test cases. NFC.
Sjoerd Meijer [Wed, 9 Sep 2020 09:40:23 +0000 (10:40 +0100)]
[ARM] Fixup of a few test cases. NFC.

After changing the semantics of get.active.lane.mask, I missed a few tests
that should use now the tripcount instead of the backedge taken count.

3 years ago[AMDGPU] Workaround for LDS Misalignment bug on GFX10
Mirko Brkusanin [Wed, 9 Sep 2020 09:28:36 +0000 (11:28 +0200)]
[AMDGPU] Workaround for LDS Misalignment bug on GFX10

Add subtarget feature check to avoid using ds_read/write_b96/128 with too
low alignment if a bug is present on that specific hardware.
Add this "feature" to GFX 10.1.1 as it is also affected.
Add global-isel test.

3 years agoRename MemRefDescriptor::getElementType() to MemRefDescriptor::getElementPtrType().
Christian Sigg [Wed, 9 Sep 2020 05:41:56 +0000 (07:41 +0200)]
Rename MemRefDescriptor::getElementType() to MemRefDescriptor::getElementPtrType().

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D87284

3 years agoTemporairly revert "Thread safety analysis: Consider global variables in scope" ...
Roman Lebedev [Wed, 9 Sep 2020 09:08:46 +0000 (12:08 +0300)]
Temporairly revert "Thread safety analysis: Consider global variables in scope" & followup

This appears to cause false-positives because it started to warn on local non-global variables.

Repro posted to https://reviews.llvm.org/D84604#2262745

This reverts commit 9dcc82f34ea9b623d82d2577b93aaf67d36dabd2.
This reverts commit b2ce79ef66157dd752e3864ece57915e23a73f5d.

3 years ago[mlir] Added support for loops to BufferPlacement transformation.
Marcel Koester [Fri, 7 Aug 2020 10:22:45 +0000 (12:22 +0200)]
[mlir] Added support for loops to BufferPlacement transformation.

The current BufferPlacement transformation cannot handle loops properly. Buffers
passed via backedges will not be freed automatically introducing memory leaks.
This CL adds support for loops to overcome these limitations.

Differential Revision: https://reviews.llvm.org/D85513

3 years ago[lldb] Enable std::pair in CxxModuleHandler
Raphael Isemann [Wed, 9 Sep 2020 08:35:56 +0000 (10:35 +0200)]
[lldb] Enable std::pair in CxxModuleHandler

This adds support for substituting std::pair instantiations with enabled
import-std-module.

With the fixes in parent revisions we can currently substitute a single pair
(however, a result that returns a second pair currently causes LLDB to crash
while importing the second template instantiation).

Reviewed By: aprantl

Differential Revision: https://reviews.llvm.org/D85141

3 years ago[Test] Add failing test for pr47457
Max Kazantsev [Wed, 9 Sep 2020 08:44:25 +0000 (15:44 +0700)]
[Test] Add failing test for pr47457

3 years ago[lldb] Fix a crash when the ASTImporter is giving us two Imported callbacks for the...
Raphael Isemann [Wed, 9 Sep 2020 08:16:56 +0000 (10:16 +0200)]
[lldb] Fix a crash when the ASTImporter is giving us two Imported callbacks for the same target decl

The ASTImporter has an `Imported(From, To)` callback that notifies subclasses
that a declaration has been imported in some way. LLDB uses this in the
`CompleteTagDeclsScope` to see which records have been imported into the scratch
context. If the record was declared inside the expression, then the
`CompleteTagDeclsScope` will forcibly import the full definition of that record
to the scratch context so that the expression AST can safely be disposed later
(otherwise we might end up going back to the deleted AST to complete the
minimally imported record). The way this is implemented is that there is a list
of decls that need to be imported (`m_decls_to_complete`) and we keep completing
the declarations inside that list until the list is empty. Every `To` Decl we
get via the `Imported` callback will be added to the list of Decls to be
completed.

There are some situations where the ASTImporter will actually give us two
`Imported` calls with the same `To` Decl. One way where this happens is if the
ASTImporter decides to merge an imported definition into an already imported
one. Another way is that the ASTImporter just happens to get two calls to
`ASTImporter::Import` for the same Decl. This for example happens when importing
the DeclContext of a Decl requires importing the Decl itself, such as when
importing a RecordDecl that was declared inside a function.

The bug addressed in this patch is that when we end up getting two `Imported`
calls for the same `To` Decl, then we would crash in the
`CompleteTagDeclsScope`.  That's because the first time we complete the Decl we
remove the Origin tracking information (that maps the Decl back to from where it
came from). The next time we try to complete the same `To` Decl the Origin
tracking information is gone and we hit the `to_context_md->getOrigin(decl).ctx
== m_src_ctx` assert (`getOrigin(decl).ctx` is a nullptr the second time as the
Origin was deleted).

This is actually a regression coming from D72495. Before D72495
`m_decls_to_complete` was actually a set so every declaration in there could
only be queued once to be completed. The set was changed to a vector to make the
iteration over it deterministic, but that also causes that we now potentially
end up trying to complete a Decl twice.

This patch essentially just reverts D72495 and makes the `CompleteTagDeclsScope`
use a SetVector for the list of declarations to be completed. The SetVector
should filter out the duplicates (as the original `set` did) and also ensure that
the completion order is deterministic. I actually couldn't find any way to cause
LLDB to reproduce this bug by merging declarations (this would require that we
for example declare two namespaces in a non-top-level expression which isn't
possible). But the bug reproduces very easily by just declaring a class in an
expression, so that's what the test is doing.

Reviewed By: shafik

Differential Revision: https://reviews.llvm.org/D85648

3 years ago[EarlyCSE] Explicitly require AAResultsWrapperPass.
Florian Hahn [Wed, 9 Sep 2020 08:00:41 +0000 (09:00 +0100)]
[EarlyCSE] Explicitly require AAResultsWrapperPass.

The MemorySSAWrapperPass depends on AAResultsWrapperPass and if
MemorySSA is preserved but AAResultsWrapperPass is not, this could lead
to a crash when updating the last user of the MemorySSAWrapperPass.

Alternatively AAResultsWrapperPass could be marked preserved by GVN, but
I am not sure if that would be safe. I am not sure what is required in
order to preserve AAResultsWrapperPass. At the moment, it seems like a
couple of passes that do similar transforms to GVN are preserving it.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D87137

3 years ago[lldb] Don't infinite loop in SemaSourceWithPriorities::CompleteType when trying...
Raphael Isemann [Wed, 9 Sep 2020 07:54:47 +0000 (09:54 +0200)]
[lldb] Don't infinite loop in SemaSourceWithPriorities::CompleteType when trying to complete a forward decl

SemaSourceWithPriorities is a special SemaSource that wraps our normal LLDB
ExternalASTSource and the ASTReader (which is used for the C++ module loading).
It's only active when the `import-std-module` setting is turned on.

The `CompleteType` function there in `SemaSourceWithPriorities` is looping over
all ExternalASTSources and asks each to complete the type. However, that loop is
in another loop that keeps doing that until the type is complete. If that
function is ever called on a type that is a forward decl then that causes LLDB
to go into an infinite loop.

I remember I added that second loop and the comment because I thought I saw a
similar pattern in some other Clang code, but after some grepping I can't find
that code anywhere and it seems the rest of the code base only calls
CompleteType once (It would also be kinda silly to have calling it multiple
times). So it seems that's just a silly mistake.

The is implicitly tested by importing `std::pair`, but I also added a simpler
dedicated test that creates a dummy libc++ module with some forward declarations
and then imports them into the scratch AST context. At some point the
ASTImporter will check if one of the forward decls could be completed by the
ExternalASTSource, which will cause the `SemaSourceWithPriorities` to go into an
infinite loop once it receives the `CompleteType` call.

Reviewed By: shafik

Differential Revision: https://reviews.llvm.org/D87289

3 years ago[MLIR][Shape] Lower `shape_of` to `dynamic_tensor_from_elements`
Frederik Gossen [Wed, 9 Sep 2020 07:53:13 +0000 (07:53 +0000)]
[MLIR][Shape] Lower `shape_of` to `dynamic_tensor_from_elements`

Take advantage of the new `dynamic_tensor_from_elements` operation in `std`.
Instead of stack-allocated memory, we can now lower directly to a single `std`
operation.

Differential Revision: https://reviews.llvm.org/D86935

3 years ago[MLIR][Standard] Update `tensor_from_elements` assembly format
Frederik Gossen [Wed, 9 Sep 2020 07:44:38 +0000 (07:44 +0000)]
[MLIR][Standard] Update `tensor_from_elements` assembly format

Remove the redundant parenthesis that are used for none of the other operation
formats.

Differential Revision: https://reviews.llvm.org/D86287

3 years ago[gn build] Port c0e5e3fbfa5
LLVM GN Syncbot [Wed, 9 Sep 2020 07:32:57 +0000 (07:32 +0000)]
[gn build] Port c0e5e3fbfa5

3 years ago[Ignore Expressions] Fix performance regression by inlining `Ignore*SingleStep`
Eduardo Caldas [Tue, 8 Sep 2020 11:32:02 +0000 (11:32 +0000)]
[Ignore Expressions] Fix performance regression by inlining `Ignore*SingleStep`

We also add a `const` versions of `IgnoreExprNodes`

Differential Revision: https://reviews.llvm.org/D87278

3 years ago[lldb][doc] Mention python3-dev instead of python2.7-dev in build docs
Raphael Isemann [Wed, 9 Sep 2020 07:29:51 +0000 (09:29 +0200)]
[lldb][doc] Mention python3-dev instead of python2.7-dev in build docs

3 years ago[MLIR] Add debug support for ignored patterns
Frederik Gossen [Wed, 9 Sep 2020 07:16:45 +0000 (07:16 +0000)]
[MLIR] Add debug support for ignored patterns

The rewrite engine's cost model may determine some patterns to be irrelevant
ahead of their application. These patterns were silently ignored previously and
now cause a message in `--debug` mode.

Differential Revision: https://reviews.llvm.org/D87290

3 years ago[Statepoints] Properly handle const base pointer.
Denis Antrushin [Mon, 7 Sep 2020 15:04:07 +0000 (22:04 +0700)]
[Statepoints] Properly handle const base pointer.

Current code in InstEmitter assumes all GC pointers are either
VRegs or stack slots - hence, taking only one operand.
But it is possible to have constant base, in which case it
occupies two machine operands.

Add a convinience function to StackMaps to get index of next
meta argument and use it in InsrEmitter to properly advance to
the next statepoint meta operand.

Reviewed By: reames

Differential Revision: https://reviews.llvm.org/D87252

3 years ago[ARM] Try to rematerialize VCTP instructions
Sam Parker [Wed, 9 Sep 2020 06:32:30 +0000 (07:32 +0100)]
[ARM] Try to rematerialize VCTP instructions

We really want to try and avoid spilling P0, which can be difficult
since there's only one register, so try to rematerialize any VCTP
instructions.

Differential Revision: https://reviews.llvm.org/D87280

3 years ago[Attributor][NFC] Improve check lines in depgraph.ll
Johannes Doerfert [Tue, 8 Sep 2020 20:58:58 +0000 (15:58 -0500)]
[Attributor][NFC] Improve check lines in depgraph.ll

This adds the check lines with -NEXT so we see any change in the future.

3 years ago[Attributor] Cleanup `::initialize` of various AAs
Johannes Doerfert [Sun, 30 Aug 2020 19:14:33 +0000 (14:14 -0500)]
[Attributor] Cleanup `::initialize` of various AAs

This commit cleans up the ::initialize method of various AAs in the
following ways:
  - If an associated function is required, give up on declarations.
    This was discovered as a real problem when lots of llvm.dbg.XXX
    call sites were assumed `noreturn` until proven otherwise. That
    does not make any sense and caused huge regressions and missed
    deductions.
  - Require more associated declarations for function interface AAs.
  - Use the IRAttribute::initialize to determine if function interface
    AAs can be used in IPO, don't replicate the checks (especially
    isFunctionIPOAmendable) all over the place. Arguably the function
    declaration check should be moved to some central place to.

3 years ago[llvm-cov gcov] Simply computation of line counts and exit block counter
Fangrui Song [Wed, 9 Sep 2020 06:15:37 +0000 (23:15 -0700)]
[llvm-cov gcov] Simply computation of line counts and exit block counter

3 years ago[Attributor] Associate the callback callee with a call site argument (if any)
Johannes Doerfert [Fri, 4 Sep 2020 16:20:28 +0000 (11:20 -0500)]
[Attributor] Associate the callback callee with a call site argument (if any)

If we have a callback, call site arguments were already associated with
the callback callee. Now we also associate the function with the
callback callee, thus we know ensure that the following holds true (if
all return nonnull):
   `getAssociatedArgument()->getParent() == getAssociatedFunction()`

To test this an early exit from
  `AAMemoryBehaviorCallSiteArgument::initialize``
is included as well. Without the change to getAssociatedFunction() this
kind of early exit for declarations would cause callback call site
arguments to miss out.