platform/upstream/llvm.git
4 years agoAMDGPU/GlobalISel: Don't hardcode maximum register size
Matt Arsenault [Sat, 13 Jun 2020 22:32:30 +0000 (18:32 -0400)]
AMDGPU/GlobalISel: Don't hardcode maximum register size

This is a somewhat artifical limit, so avoid repeating it many places
in case it changes.

4 years ago[libc++] Add the ability to run arbitrary programs using the DSL
Louis Dionne [Mon, 15 Jun 2020 18:49:21 +0000 (14:49 -0400)]
[libc++] Add the ability to run arbitrary programs using the DSL

This is useful for checking runtime properties of the target system.
This is a partial re-application of 3ea9450bda90. This part was tested
to work on a Windows host with a SSH executor.

4 years agoAMDGPU/GlobalISel: Extend load/store workaround to i128 vectors
Matt Arsenault [Mon, 15 Jun 2020 17:42:47 +0000 (13:42 -0400)]
AMDGPU/GlobalISel: Extend load/store workaround to i128 vectors

4 years agoRevert "[mlir] Add support for lowering tanh to LLVMIR."
Mehdi Amini [Mon, 15 Jun 2020 18:33:34 +0000 (18:33 +0000)]
Revert "[mlir] Add support for lowering tanh to LLVMIR."

This reverts commit 32c757e4f808c68a7e34eb712fead0a49cdf814a.

Broke the build bot:

******************** TEST 'MLIR :: Examples/standalone/test.toy' FAILED ********************
[...]
/tmp/ci-KIMiRFcVZt/lib/libMLIRLinalgToLLVM.a(LinalgToLLVM.cpp.o): In function `(anonymous namespace)::ConvertLinalgToLLVMPass::runOnOperation()':
LinalgToLLVM.cpp:(.text._ZN12_GLOBAL__N_123ConvertLinalgToLLVMPass14runOnOperationEv+0x100): undefined reference to `mlir::populateExpandTanhPattern(mlir::OwningRewritePatternList&, mlir::MLIRContext*)'

4 years ago[LLVM] Change isa<> to a variadic function template
Rahul Joshi [Mon, 15 Jun 2020 18:25:14 +0000 (18:25 +0000)]
[LLVM] Change isa<> to a variadic function template

Change isa<> to a variadic function template, so that it can be used to test against one of multiple types as follows:
   isa<Type0, Type1, Type2>(Val)

Differential Revision: https://reviews.llvm.org/D81045

4 years ago[lit] Include unexecuted tests in xUnit report
Julian Lettner [Fri, 5 Jun 2020 22:45:43 +0000 (15:45 -0700)]
[lit] Include unexecuted tests in xUnit report

Pass in all discovered tests to report generators.

The XunitReport generator now creates testcase items for unexecuted
tests and documents why they have been skipped.  This makes it easier
to compare test runs with different filters or configurations, or across
platforms.

I don't know who is using the JsonReport generator and what the
expectations there are (it doesn't have tests), so decided to preserve
the old behavior by filtering out the unexecuted tests.

Reviewed By: jdenny

Differential Revision: https://reviews.llvm.org/D81316

4 years ago[RuntimeDyld] Add dependence on Core.
Lang Hames [Mon, 15 Jun 2020 18:07:16 +0000 (11:07 -0700)]
[RuntimeDyld] Add dependence on Core.

Commit 498dd745f55 introduced a dependence on Core. This patch updates
LLVMbuild.txt to reflect this.

4 years agoAMDGPU/GlobalISel: Correct memory size in test
Matt Arsenault [Mon, 15 Jun 2020 18:12:02 +0000 (14:12 -0400)]
AMDGPU/GlobalISel: Correct memory size in test

4 years ago[mlir] [integration-test] [VectorOps] Start an integration test directory for MLIR
aartbik [Mon, 15 Jun 2020 17:39:10 +0000 (10:39 -0700)]
[mlir] [integration-test] [VectorOps] Start an integration test directory for MLIR

Summary:
This CL introduces an integration test directory for MLIR in general, with
vector dialect integration tests in particular as a first working suite. To
run all the integration tests (and currently just the vector suite):

$ cmake --build . --target check-mlir-integration
[0/1] Running the MLIR integration tests
Testing Time: 0.24s
Passed: 22

The general call is to contribute to this integration test directory with more
tests and other suites, running end-to-end examples that may be too heavy for
the regular test directory, but should be tested occasionally to verify the
health of MLIR.

Background discussion at:
https://llvm.discourse.group/t/vectorops-rfc-add-suite-of-integration-tests-for-vector-dialect-operations/1213/

Reviewers: nicolasvasilache, reidtatge, andydavis1, rriddle, ftynse, mehdi_amini, jpienaar, stephenneuendorffer

Reviewed By: nicolasvasilache, stephenneuendorffer

Subscribers: mgorny, mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, msifontes

Tags: #mlir

Differential Revision: https://reviews.llvm.org/D81626

4 years ago[Target/PPC] Silence an unused variable warning. NFC.
Davide Italiano [Mon, 15 Jun 2020 18:04:30 +0000 (11:04 -0700)]
[Target/PPC] Silence an unused variable warning. NFC.

4 years agoRecommit "[X86] Separate imm from relocImm handling."
Craig Topper [Mon, 15 Jun 2020 17:59:43 +0000 (10:59 -0700)]
Recommit "[X86] Separate imm from relocImm handling."

Fix the copy/paste mistake that caused it to fail previously

4 years ago[DSE,MSSA] Port partial store merging.
Florian Hahn [Mon, 15 Jun 2020 14:40:07 +0000 (15:40 +0100)]
[DSE,MSSA] Port partial store merging.

Port partial constant store merging logic to MemorySSA backed DSE. The
heavy lifting is done by the existing helper function. It is used in
context where we already ensured that the later instruction can
eliminate the earlier one, if it is a complete overwrite.

4 years ago[mlir] Add support for lowering tanh to LLVMIR.
Hanhan Wang [Mon, 15 Jun 2020 17:26:32 +0000 (10:26 -0700)]
[mlir] Add support for lowering tanh to LLVMIR.

Summary:
Add a pattern for expanding tanh op into exp form.
A `tanh` is expanded into:
   1) 1-exp^{-2x} / 1+exp^{-2x}, if x => 0
   2) exp^{2x}-1 / exp^{2x}+1  , if x < 0.

Differential Revision: https://reviews.llvm.org/D81618

4 years ago[ORC] Honor linker private global prefix on symbol names.
Lang Hames [Sun, 14 Jun 2020 02:03:21 +0000 (19:03 -0700)]
[ORC] Honor linker private global prefix on symbol names.

If a symbol name begins with the linker private global prefix (as
described by the DataLayout) then it should be treated as non-exported,
regardless of its LLVM IR visibility value.

4 years ago[libc++abi] Ensure custom libc++ header paths are honoured during libc++abi build
Louis Dionne [Mon, 15 Jun 2020 17:20:04 +0000 (13:20 -0400)]
[libc++abi] Ensure custom libc++ header paths are honoured during libc++abi build

This is necessary for standalone builds where the libc++ in use has a
custom configuration set up inside __config_site -- one needs to build
libc++abi against the installed headers of libc++ (which are properly
configured) instead of the ones inside libcxx/include.

See https://reviews.llvm.org/rGe619e9d#927848 for details.

4 years ago[gn build] (manually) merge 3b29376e3fca43
Nico Weber [Mon, 15 Jun 2020 17:21:34 +0000 (13:21 -0400)]
[gn build] (manually) merge 3b29376e3fca43

4 years ago[WebAssembly] Adding 64-bit version of R_WASM_MEMORY_ADDR_* relocs
Wouter van Oortmerssen [Fri, 5 Jun 2020 16:03:12 +0000 (09:03 -0700)]
[WebAssembly] Adding 64-bit version of R_WASM_MEMORY_ADDR_* relocs

This adds 4 new reloc types.

A lot of code that previously assumed any memory or offset values could be contained in a uint32_t (and often truncated results from functions returning 64-bit values) have been upgraded to uint64_t. This is not comprehensive: it is only the values that come in contact with the new relocation values and their dependents.

A new tablegen mapping was added to automatically upgrade loads/stores in the assembler, which otherwise has no way to select for these instructions (since they are indentical other than for the offset immediate). It follows a similar technique to https://reviews.llvm.org/D53307

Differential Revision: https://reviews.llvm.org/D81704

4 years ago[llvm][SveEmitter] SVE ACLE for quadword permute intrinsics.
Francesco Petrogalli [Mon, 15 Jun 2020 16:23:41 +0000 (16:23 +0000)]
[llvm][SveEmitter] SVE ACLE for quadword permute intrinsics.

Summary:
The following intrinsics have been added, guarded by the macro
`__ARM_FEATURE_SVE_MATMUL_FP64`:

* svtrn1q[_*]
* svtrn2q[_*]
* svuzp1q[_*]
* svuzp2q[_*]
* svzip1q[_*]
* svzip2q[_*]

Supported types:

* svint[8|16|32|64]_t
* svuint[8|16|32|64]_t
* svfloat[16|32|64]_t

TODO: add support for svbfloat16_t

Reviewers: efriedma, sdesmalen, kmclaughlin, rengolin

Reviewed By: sdesmalen

Subscribers: tschuett, kristof.beyls, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D80851

4 years ago[OpenMP][Tests] fix data race in an OpenMP runtime test
Joachim Protze [Sun, 14 Jun 2020 14:10:10 +0000 (16:10 +0200)]
[OpenMP][Tests] fix data race in an OpenMP runtime test

Reviewed by: AndreyChurbanov

Differential Revision: https://reviews.llvm.org/D81804

4 years ago[X86] Remove printanymem/printopaquemem from the InstPrinters. Just tell tablegen...
Craig Topper [Mon, 15 Jun 2020 16:45:02 +0000 (09:45 -0700)]
[X86] Remove printanymem/printopaquemem from the InstPrinters. Just tell tablegen to printMemReference directly. NFC

Most of the wrappers exist to print the memory size in Intel syntax
and then call the printMemReference. But printanymem/printopaquemem
don't print anything extra in Intel syntax so just drop them.

4 years ago[DSE] Hoist partial store merging code into function (NFC).
Florian Hahn [Mon, 15 Jun 2020 14:37:33 +0000 (15:37 +0100)]
[DSE] Hoist partial store merging code into function (NFC).

Hoist the general logic into a new function, because it can be re-used
by the MemorySSA backed DSE as well.

4 years ago[GlobalISel] Simplify G_ADD when it has (0-X) on the LHS or RHS
Jessica Paquette [Sat, 4 Apr 2020 00:11:25 +0000 (17:11 -0700)]
[GlobalISel] Simplify G_ADD when it has (0-X) on the LHS or RHS

This implements the following combines:

((0-A) + B) -> B-A
(A + (0-B)) -> A-B

Porting over the basic algebraic combines from the DAGCombiner. There are
several combines which fold adds away into subtracts. This is just the simplest
one.

I noticed that add combines are some of the most commonly hit across CTMark,
(via print statements when they fire), so I'm porting over some of the obvious
ones.

This gives some minor code size improvements on CTMark at -O3 on AArch64.

Differential Revision: https://reviews.llvm.org/D77453

4 years ago[flang] unit test support for out-of-tree and in-tree using google tests framework
sameeran joshi [Tue, 2 Jun 2020 17:15:44 +0000 (22:45 +0530)]
[flang] unit test support for out-of-tree and in-tree using google tests framework

Review comments addressed.

4 years ago[libc] Enable string functions for aarch64.
Siva Chandra [Mon, 15 Jun 2020 16:36:10 +0000 (09:36 -0700)]
[libc] Enable string functions for aarch64.

4 years ago[lldb] Remove indentation before help output.
Jonas Devlieghere [Mon, 15 Jun 2020 16:25:33 +0000 (09:25 -0700)]
[lldb] Remove indentation before help output.

This patch remove the indentation before the command help output.
Supposedly it was meant to be aligned with the different subcommands.

Differential revision: https://reviews.llvm.org/D81783

4 years ago[libc] Extract an architecture independent copy of memcpy implementation.
Siva Chandra Reddy [Wed, 10 Jun 2020 21:41:06 +0000 (14:41 -0700)]
[libc] Extract an architecture independent copy of memcpy implementation.

Along that way, platform specific options to memcpy, memset and bzero
builds have been enclosed in conditionals. Also, the optimization level
has been set to -O2 for the memory function builds to actually see the
static functions inlined.

Reviewers: gchatelet

Differential Revision: https://reviews.llvm.org/D81621

4 years ago[llvm][SVE] IR intrinsics for quadword permutation instructions.
Francesco Petrogalli [Mon, 15 Jun 2020 16:10:37 +0000 (16:10 +0000)]
[llvm][SVE] IR intrinsics for quadword permutation instructions.

Summary:
Adding intrinsics and codegen patterns for:

* trn1 <Zd>.q, <Zm>.q, <Zn>.q
* trn2 <Zd>.q, <Zm>.q, <Zn>.q
* zip1 <Zd>.q, <Zm>.q, <Zn>.q
* zip2 <Zd>.q, <Zm>.q, <Zn>.q
* uzp1 <Zd>.q, <Zm>.q, <Zn>.q
* uzp2 <Zd>.q, <Zm>.q, <Zn>.q

These instructions are defined in Armv8.6-A.

Reviewers: sdesmalen, efriedma, kmclaughlin

Reviewed By: sdesmalen

Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80850

4 years ago[flang] Change DIE("unreachable") cases to use llvm_unreachable
David Truby [Wed, 6 May 2020 17:38:28 +0000 (18:38 +0100)]
[flang] Change DIE("unreachable") cases to use llvm_unreachable

Reviewers: sscalpone

Subscribers: llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D79507

4 years ago[libc++] Raise an exception if a Lit feature resolves to an invalid name
Louis Dionne [Mon, 15 Jun 2020 16:19:32 +0000 (12:19 -0400)]
[libc++] Raise an exception if a Lit feature resolves to an invalid name

This allows reporting issues early when creating feature names.

4 years ago[clangd] Follow-up to fix lit-test bustage in type-hierarchy.test
Nathan Ridge [Mon, 15 Jun 2020 16:17:14 +0000 (12:17 -0400)]
[clangd] Follow-up to fix lit-test bustage in type-hierarchy.test

This is a follow-up to D81845.

4 years ago[mlir][shape] Update test case to new op asm format
Jacques Pienaar [Mon, 15 Jun 2020 16:04:54 +0000 (09:04 -0700)]
[mlir][shape] Update test case to new op asm format

4 years ago[clangd] Support typeHierarchy/resolve for children of parents as well
Nathan Ridge [Mon, 15 Jun 2020 00:17:15 +0000 (20:17 -0400)]
[clangd] Support typeHierarchy/resolve for children of parents as well

Summary:
The initial implementation of typeHierarchy/resolve only supported
cases where an initial request was made for children, and then
typeHierarchy/resolve was used to get additional levels of children.

However, a client may also want to make an initial request for
parents, and then show other children of those parents, so support
typeHierarchy/resolve for items returned in response to a request
for parents as well.

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81845

4 years ago[mlir] Introduce callback-based builders for AffineForOp
Alex Zinenko [Mon, 15 Jun 2020 09:41:06 +0000 (11:41 +0200)]
[mlir] Introduce callback-based builders for AffineForOp

Similarly to `scf::ForOp`, introduce additional `function_ref` arguments to
`AffineForOp::build` that can be used to populate the body of the loop during
its construction. Provide compatibility functions for constructing affine loop
nests using `edsc::ScopedContext`.

`edsc::AffineLoopNestBuilder` and reletad functionality is now deprecated and
will be removed soon, users are expected to switch to `affineLoopNestBuilder`
that provides similar functionality with a simpler OpBuilder-based
implementation.

Differential Revision: https://reviews.llvm.org/D81754

4 years ago[mlir] Affine symbols: do not expect AffineScope to always exist
Alex Zinenko [Mon, 15 Jun 2020 09:40:56 +0000 (11:40 +0200)]
[mlir] Affine symbols: do not expect AffineScope to always exist

In the affine symbol and dimension check, the code currently assumes
`getAffineScope` and its users `isValidDim` and `isValidSymbol` are only called
on values defined in regions that have a parent Op with `AffineScope` trait.
This is not necessarily the case, and these functions may be called on valid IR
that does not satisfy this assumption. Return `nullptr` from `getAffineScope`
if there is no parent op with `AffineScope` trait. Treat this case
conservatively in `isValidSymbol` by only accepting as symbols the values that
are guaranteed to be symbols (constants, and certain operations). No
modifications are necessary to `isValidDim` that delegates most of the work to
`isValidDim`.

Differential Revision: https://reviews.llvm.org/D81753

4 years ago[mlir][shape] Add a pattern to rewrite `shape.reduce` as `scf.for`.
Alexander Belyaev [Mon, 15 Jun 2020 15:14:21 +0000 (17:14 +0200)]
[mlir][shape] Add a pattern to rewrite `shape.reduce` as `scf.for`.

Differential Revision: https://reviews.llvm.org/D81694

4 years ago[analyzer] Reasoning about comparison expressions in RangeConstraintManager
Denys Petrov [Mon, 15 Jun 2020 12:31:10 +0000 (15:31 +0300)]
[analyzer] Reasoning about comparison expressions in RangeConstraintManager

Summary:

Implemented RangeConstraintManager::getRangeForComparisonSymbol which handles comparison operators.
RangeConstraintManager::getRangeForComparisonSymbol cares about the sanity of comparison expressions sequences helps reasonably to branch an exploded graph.
It can significantly reduce the graph and speed up the analysis. For more details, please, see the differential revision.

This fixes https://bugs.llvm.org/show_bug.cgi?id=13426

Differential Revision: https://reviews.llvm.org/D78933

4 years agoAMDGPU/GlobalISel: Fix 8-byte aligned, 96-bit scalar loads
Matt Arsenault [Sat, 30 May 2020 23:50:55 +0000 (19:50 -0400)]
AMDGPU/GlobalISel: Fix 8-byte aligned, 96-bit scalar loads

These are legal since we can do a 96-bit load on some subtargets, but
this is only for vector loads. If we can't widen the load, it needs to
be broken down once known scalar. For 16-byte alignment, widen to a
128-bit load.

4 years ago[WebAssembly] Adding 64-bit versions of all load & store ops.
Wouter van Oortmerssen [Tue, 19 May 2020 00:45:31 +0000 (17:45 -0700)]
[WebAssembly] Adding 64-bit versions of all load & store ops.

Context: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md
This is just a first step, adding the new instruction variants while keeping the existing 32-bit functionality working.
Some of the basic load/store tests have new wasm64 versions that show that the basics of the target are working.
Further features need implementation, but these will be added in followups to keep things reviewable.

Differential Revision: https://reviews.llvm.org/D80769

4 years ago[NFCI] Change the data structure of MaybeODRUseExprSet
Erich Keane [Mon, 15 Jun 2020 15:10:41 +0000 (08:10 -0700)]
[NFCI] Change the data structure of MaybeODRUseExprSet

In 1eddce41, I fixed a non-deterministic result problem by switching a
SmallPtrSet to a SmallSetVector to ensure we iterated it
deterministically.  Unfortunately, this seems to show a surprisingly
significant compiletime impact.

This patch does 2 things in an attempt to fix this:

First, it makes the 'small size' optimization 4 instead of 2.  As these
are pointers, this only increases the size of Sema by 4
sizeof(pointer)s (2 for the set, 2 for the vector).

Second, instead of using SmallSetVector, which is a SmallVector +
SmallDenseSet, it uses a SetVector of SmallVector + SmallPtrSet.  The
hope is that the pointer-specific optimizations of the SmallPtrSet will
minimize the impact on compile-time.

4 years ago[DSE,MSSA] Delete instructions after printing it.
Florian Hahn [Mon, 15 Jun 2020 14:59:09 +0000 (15:59 +0100)]
[DSE,MSSA] Delete instructions after printing it.

Also enables a now-passing test case, that exposed a crash caused by the
wrong order.

4 years ago[X86][SSE] Add LowerVectorAllZero helper for checking if all bits of a vector are...
Simon Pilgrim [Mon, 15 Jun 2020 14:54:12 +0000 (15:54 +0100)]
[X86][SSE] Add LowerVectorAllZero helper for checking if all bits of a vector are zero.

Pull the lowering code out of LowerVectorAllZeroTest (and rename it MatchVectorAllZeroTest).

We should be able to reuse this in combineVectorSizedSetCCEquality as well.

Another cleanup to simplify D81547.

4 years ago[PowerPC] Do not add the relocation addend to the instruction encoding
Stefan Pintilie [Mon, 15 Jun 2020 14:40:02 +0000 (09:40 -0500)]
[PowerPC] Do not add the relocation addend to the instruction encoding

We should not be adding the relocation addend to the instruction encoding.
This patch removes that and sets those bits to zero.

Differential Revision: https://reviews.llvm.org/D81082

4 years ago[DSE,MSSA] Add additional merging test cases (NFC).
Florian Hahn [Mon, 15 Jun 2020 11:34:12 +0000 (12:34 +0100)]
[DSE,MSSA] Add additional merging test cases (NFC).

Additional tests added ahead of partial overlapping store merging.

4 years ago[clang-tidy] Add diagnostics level to YAML output
Dmitry Polukhin [Sat, 2 May 2020 09:35:41 +0000 (02:35 -0700)]
[clang-tidy] Add diagnostics level to YAML output

Summary:
Also added BuildDirectory for completness and removed unused `Fix`.

Test Plan: check-all

Reviewers: alexfh, gribozavr2

Subscribers: xazax.hun, cfe-commits

Tags: #clang-tools-extra, #clang

Differential Revision: https://reviews.llvm.org/D79285

4 years ago[NFC] Remove unnecessary require global-isel from tests
Dominik Montada [Mon, 15 Jun 2020 14:35:18 +0000 (16:35 +0200)]
[NFC] Remove unnecessary require global-isel from tests

4 years ago[NFC] Add braces to if-statement in MachineVerifier
Dominik Montada [Mon, 15 Jun 2020 14:33:56 +0000 (16:33 +0200)]
[NFC] Add braces to if-statement in MachineVerifier

4 years ago[X86][SSE] LowerVectorAllZeroTest - add support for >256-bit vectors
Simon Pilgrim [Mon, 15 Jun 2020 14:30:06 +0000 (15:30 +0100)]
[X86][SSE] LowerVectorAllZeroTest - add support for >256-bit vectors

Reduce by splitting the vector until we reach the target size for PTEST/MOVMSK_PCMPEQ. There might be some cases where AVX512 can perform this with 512-bit vectors but so far I haven't encountered any such pattern that reaches LowerVectorAllZeroTest.

Prep work for D81547

4 years agoRevert "[X86] Separate imm from relocImm handling."
Hans Wennborg [Mon, 15 Jun 2020 14:08:25 +0000 (16:08 +0200)]
Revert "[X86] Separate imm from relocImm handling."

> relocImm was a complexPattern that handled both ConstantSDNode
> and X86Wrapper. But it was only applied selectively because using
> it would cause patterns to be not importable into FastISel or
> GlobalISel. So it only got applied to flag setting instructions,
> stores, RMW arithmetic instructions, and rotates.
>
> Most of the test changes are a result of making patterns available
> to GlobalISel or FastISel. The absolute-cmp.ll change is due to
> this fixing a pattern ordering issue to make an absolute symbol
> match to an 8-bit immediate before trying a 32-bit immediate.
>
> I tried to use PatFrags to reduce the repetition, but I was getting
> errors from TableGen.

This caused "Invalid EmitNode" assertions, see the llvm-commits thread for
discussion.

4 years ago[X86][SSE] LowerVectorAllZeroTest - remove unnecessary bitcasts
Simon Pilgrim [Mon, 15 Jun 2020 14:12:32 +0000 (15:12 +0100)]
[X86][SSE] LowerVectorAllZeroTest - remove unnecessary bitcasts

matchScalarReduction should return all its source vectors with the same type, so we can safely perform the OR reduction with the original type.

So we just need to bitcast for PTEST/PCMPEQB with the final reduced vector.

4 years ago[ARM][MachineOutliner] Fix no-lr-save testcase.
Yvan Roux [Mon, 15 Jun 2020 14:03:56 +0000 (16:03 +0200)]
[ARM][MachineOutliner] Fix no-lr-save testcase.

Now that saving LR into a register is handled, some register constraints
are needed to keep machine-outliner-no-lr-save.mir meaningful.

4 years ago[strictfp] Replace dangling strictfp attrs with nobuiltin
Kevin P. Neal [Thu, 14 May 2020 18:57:19 +0000 (11:57 -0700)]
[strictfp] Replace dangling strictfp attrs with nobuiltin

In preparation for a patch that will enforce new rules for the usage of
the strictfp attribute, this patch introduces auto-upgrade behavior that
will replace the strictfp attribute on callsites with nobuiltin if the
enclosing function declaration doesn't also have the strictfp attribute.

This auto-upgrade isn't being performed on .ll files because that would
prevent us from writing a test for the forthcoming verifier behavior.

Differential Revision: https://reviews.llvm.org/D70096

4 years ago[clang] Fix or emit diagnostic for checked arithmetic builtins with
Jeff Mott [Mon, 15 Jun 2020 13:08:58 +0000 (06:08 -0700)]
[clang] Fix or emit diagnostic for checked arithmetic builtins with
_ExtInt types

- Fix computed size for _ExtInt types passed to checked arithmetic
  builtins.
- Emit diagnostic when signed _ExtInt larger than 128-bits is passed
    to __builtin_mul_overflow.
- Change Sema checks for builtins to accept placeholder types.

Differential Revision: https://reviews.llvm.org/D81420

4 years ago[mlir] Fixed GCC compile issues and linking problems using SHARED_LIBS.
Marcel Koester [Mon, 15 Jun 2020 11:51:53 +0000 (13:51 +0200)]
[mlir] Fixed GCC compile issues and linking problems using SHARED_LIBS.

Differential Revision: https://reviews.llvm.org/D81839

4 years ago[Analyzer][StreamChecker] Add check for pointer escape.
Balázs Kéri [Mon, 15 Jun 2020 12:59:54 +0000 (14:59 +0200)]
[Analyzer][StreamChecker] Add check for pointer escape.

Summary:
After an escaped FILE* stream handle it is not possible to make
reliable checks on it because any function call can have effect
on it.

Reviewers: Szelethus, baloghadamsoftware, martong, NoQ

Reviewed By: NoQ

Subscribers: NoQ, rnkovacs, xazax.hun, baloghadamsoftware, szepet, a.sidorin, mikhail.ramalho, Szelethus, donat.nagy, dkrupp, gamesh411, Charusso, martong, ASDenysPetrov, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D80699

4 years ago[ARM][MachineOutliner] Add LR RegSave mode.
Yvan Roux [Mon, 15 Jun 2020 13:22:08 +0000 (15:22 +0200)]
[ARM][MachineOutliner] Add LR RegSave mode.

Outline chunks of code which need to save and restore the link register
when a spare register can be used to it.

Differential Revision: https://reviews.llvm.org/D80127

4 years ago[AArch64] Fix BTI instruction emission.
Daniel Kiss [Mon, 15 Jun 2020 13:02:38 +0000 (15:02 +0200)]
[AArch64] Fix BTI instruction emission.

Summary:
SCTLR_EL1.BT[01] controls the PACI[AB]SP compatibility with PBYTE 11
(see [1])
This bit will be set to zero so PACI[AB]SP are equal to BTI C
instruction only.

[1] https://developer.arm.com/docs/ddi0595/b/aarch64-system-registers/sctlr_el1

Reviewers: chill, tamas.petz, pbarrio, ostannard

Reviewed By: tamas.petz, ostannard

Subscribers: kristof.beyls, hiraditya, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81746

4 years ago[mlir] Remove OperandAdaptor
Jacques Pienaar [Mon, 15 Jun 2020 13:01:31 +0000 (06:01 -0700)]
[mlir] Remove OperandAdaptor

Use ::Adaptor alias instead uniformly. Makes the naming more consistent as
adaptor can refer to attributes now too.

Differential Revision: https://reviews.llvm.org/D81789

4 years agoAMDGPU/GlobalISel: Workaround some load/store type selection patterns
Matt Arsenault [Wed, 10 Jun 2020 00:05:05 +0000 (20:05 -0400)]
AMDGPU/GlobalISel: Workaround some load/store type selection patterns

The logic is written for what loads/stores should be selectable. There
are a set of cases that should be selectable, but due to missing MVTs
and/or selection patterns, will fail to select. I think eventually
load/store select patterns should ignore the type and only look at the
value size, but until that happens, bitcast these to equivalent i32
vectors.

4 years agoAMDGPU/GlobalISel: Use less artifical example to avoid abort=0
Matt Arsenault [Tue, 9 Jun 2020 20:21:07 +0000 (16:21 -0400)]
AMDGPU/GlobalISel: Use less artifical example to avoid abort=0

These were failing due to an unlegalizable G_CONCAT_VECTORS due to
registers with types that are naturally illegal.

4 years agoGlobalISel: Support lowering vector->vector G_BITCAST
Matt Arsenault [Tue, 9 Jun 2020 15:19:12 +0000 (11:19 -0400)]
GlobalISel: Support lowering vector->vector G_BITCAST

Extract subvectors and cast to the result element type before
remerging.

4 years agoFix clang/test/Modules/empty.modulemap by increasing limit.
Sander de Smalen [Mon, 15 Jun 2020 08:47:49 +0000 (09:47 +0100)]
Fix clang/test/Modules/empty.modulemap by increasing limit.

This fixes a buildbot failure on Builder llvm-clang-win-x-aarch64.

The size of the module increased to just over 30kb due to new
Decl TypeDefs being added to the module after D81459 that
adds new ACLE types.

4 years agoattempt to fix failing buildbots after 3bab88b7baa20b276faaee0aa7ca87f636c91877
Tyker [Sun, 14 Jun 2020 12:39:14 +0000 (14:39 +0200)]
attempt to fix failing buildbots after 3bab88b7baa20b276faaee0aa7ca87f636c91877

Prevent IR-gen from emitting consteval declarations

Summary: with this patch instead of emitting calls to consteval function. the IR-gen will emit a store of the already computed result.

4 years ago[DebugInfo] Report errors for truncated debug line standard opcode
James Henderson [Tue, 9 Jun 2020 14:37:55 +0000 (15:37 +0100)]
[DebugInfo] Report errors for truncated debug line standard opcode

Standard opcodes usually have ULEB128 arguments, so it is generally not
possible to recover from such errors. This patch causes the parser to
stop parsing the table in such situations.

Also don't emit the operands or add data to the table if there is an
error reading these opcodes.

Reviewed by: JDevlieghere

Differential Revision: https://reviews.llvm.org/D81470

4 years ago[MLIR][Standard] Simplify complex operand unpacking
Frederik Gossen [Mon, 15 Jun 2020 10:45:03 +0000 (10:45 +0000)]
[MLIR][Standard] Simplify complex operand unpacking

Apply post-commit suggestions (see https://reviews.llvm.org/D79479).

Differential Revision: https://reviews.llvm.org/D81731

4 years ago[MLIR][Standard] Add documentation for `std.dim` and fix test cases
Frederik Gossen [Mon, 15 Jun 2020 10:39:05 +0000 (10:39 +0000)]
[MLIR][Standard] Add documentation for `std.dim` and fix test cases

Apply post-commit suggestions (see https://reviews.llvm.org/D81551).
Add documentation, simplify, and fix test cases.

Differential Revision: https://reviews.llvm.org/D81722

4 years agoRevert "[lld-macho] No need to explicitly specify -arch in tests"
Kirill Bobyrev [Mon, 15 Jun 2020 10:27:30 +0000 (12:27 +0200)]
Revert "[lld-macho] No need to explicitly specify -arch in tests"

This reverts commit 51c5baacf36f5d5d0dbed00590a9e56a5f4a0d73 and also
337fb8c767ea93058517cb9b0be5daf21b448b71 - "[lld-macho] Set REQUIRES:
x86 on more tests".

These patches cause test crashes:

http://lab.llvm.org:8011/builders/ppc64le-lld-multistage-test/builds/10054

4 years ago[mlir] Extended BufferPlacement to support more sophisticated scenarios in which
Marcel Koester [Wed, 6 May 2020 13:06:19 +0000 (15:06 +0200)]
[mlir] Extended BufferPlacement to support more sophisticated scenarios in which
allocations cannot be moved freely and can remain in divergent control flow.

The current BufferPlacement pass does not support allocation nodes that carry
additional dependencies (like in the case of dynamic shaped types). These
allocations can often not be moved freely and in turn might remain in divergent
control-flow branches. This requires a different strategy with respect to block
arguments and aliases. This CL adds additinal functionality to support
allocation nodes in divergent control flow while avoiding memory leaks.

Differential Revision: https://reviews.llvm.org/D79850

4 years agoRevert "Prevent IR-gen from emitting consteval declarations"
Kirill Bobyrev [Mon, 15 Jun 2020 10:14:15 +0000 (12:14 +0200)]
Revert "Prevent IR-gen from emitting consteval declarations"

This reverts commit 3bab88b7baa20b276faaee0aa7ca87f636c91877.

This patch causes test failures:
http://lab.llvm.org:8011/builders/clang-cmake-armv7-quick/builds/17260

4 years ago[yaml2obj] - Introduce the "NoHeaders" key for "SectionHeaderTable"
Georgii Rymar [Thu, 11 Jun 2020 13:46:56 +0000 (16:46 +0300)]
[yaml2obj] - Introduce the "NoHeaders" key for "SectionHeaderTable"

We have an issue currently. The following YAML piece just ignores the `Excluded` key.

```
SectionHeaderTable:
  Sections: []
  Excluded:
    - Name: .foo
```

Currently the meaning is: exclude the whole table.

The code checks that the `Sections` key is empty and doesn't catch/check
invalid/duplicated/missed `Excluded` entries.

Also there is no way to exclude all sections except the first null section,
because `Sections: []` currently just excludes the whole the sections header table.

To fix it, I suggest a change of the behavior.

1) A new `NoHeaders` key is added. It provides an explicit syntax to drop the whole table.
2) The meaning of the following is changed:

```
SectionHeaderTable:
  Sections: []
  Excluded:
    - Name: .foo

```
Assuming there are 2 sections in the object (a null section and `.foo`), with this patch it
means: exclude the `.foo` section, keep the null section. The null section is an implicit
section and I think it is reasonable to make "Sections: []" to mean it is implicitly added.
It will be consistent with the global "Sections" tag that is used to describe sections.

3) `SectionHeaderTable->Sections` is now optional. No `Sections` is the same as
   `Sections: []` (I think it avoids a confusion).
4) Using of `NoHeaders` together with `Sections`/`Excluded` is not allowed.
5) It is possible to use the `Excluded` key without the `Sections` key now (in this case
   `Excluded` must contain all sections).
6) `SectionHeaderTable:` or `SectionHeaderTable: []` is not allowed.
7) When the `SectionHeaderTable` key is present, we still require all sections to be
   present in `Sections` and `Excluded` lists. No changes here, we are still strict.

Differential revision: https://reviews.llvm.org/D81655

4 years ago[X86][SSE] Add tests for and/or reduction results compared to zero
Simon Pilgrim [Mon, 15 Jun 2020 09:40:27 +0000 (10:40 +0100)]
[X86][SSE] Add tests for and/or reduction results compared to zero

These should fold to memcmp/ptest/movmsk+cmpeq patterns

4 years ago[clangd] Turn on RecoveryAST for clangd by default.
Haojian Wu [Fri, 12 Jun 2020 14:05:38 +0000 (16:05 +0200)]
[clangd] Turn on RecoveryAST for clangd by default.

Reviewers: sammccall

Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81739

4 years ago[VE] Support relocation information in MC layer
Kazushi (Jam) Marukawa [Mon, 15 Jun 2020 09:21:54 +0000 (11:21 +0200)]
[VE] Support relocation information in MC layer

Summary:
Change VEAsmParser to support identification with relocation information
in assmebler.  Change VEAsmBackend to support relocation information in
MC layer.  Change VEDisassembler and VEMCCodeEmitter to support binary
generation of branch target operands.  Add REFLONG fixup and variant kind
to support new R_VE_REFLONG ELF symbol.  And, add regression test in both
MC and CodeGen to check binary genaration with relocation information.

Differential Revision: https://reviews.llvm.org/D81553

4 years ago[MachineVerifier][GlobalISel] Check that branches have a MBB operand or are declared...
Dominik Montada [Wed, 10 Jun 2020 15:15:09 +0000 (17:15 +0200)]
[MachineVerifier][GlobalISel] Check that branches have a MBB operand or are declared indirect. Add missing properties to G_BRJT, G_BRINDIRECT

Summary:
Teach MachineVerifier to check branches for MBB operands if they are not declared indirect.

Add `isBarrier`, `isIndirectBranch` to `G_BRINDIRECT` and `G_BRJT`.
Without these, `MachineInstr.isConditionalBranch()` was giving a
false-positive for those instructions.

Reviewers: aemerson, qcolombet, dsanders, arsenm

Reviewed By: dsanders

Subscribers: hiraditya, wdng, simoncook, s.egerton, arsenm, rovka, llvm-commits

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D81587

4 years ago[Test] Add an example of unprofitable PR Phi insertion
Max Kazantsev [Mon, 15 Jun 2020 08:56:06 +0000 (15:56 +0700)]
[Test] Add an example of unprofitable PR Phi insertion

This test demonstrates weird behavior of SimplifyCFG: seems that bigger
size of block leads to worse optimization choice.

4 years agoPrevent IR-gen from emitting consteval declarations
Tyker [Sun, 14 Jun 2020 12:39:14 +0000 (14:39 +0200)]
Prevent IR-gen from emitting consteval declarations

Summary: with this patch instead of emitting calls to consteval function. the IR-gen will emit a store of the already computed result.

Reviewers: rsmith

Reviewed By: rsmith

Subscribers: cfe-commits

Tags: #clang

Differential Revision: https://reviews.llvm.org/D76420

4 years agoNFC: Make sure function arguments have the same name in declaration and definition
Kirill Bobyrev [Mon, 15 Jun 2020 08:45:08 +0000 (10:45 +0200)]
NFC: Make sure function arguments have the same name in declaration and definition

This code generates Clang-Tidy warnings otherwise.

4 years ago[mlir][shape] Lower Shape `ConstSizeOp` to Standard `ConstantOp`.
Alexander Belyaev [Fri, 12 Jun 2020 13:50:03 +0000 (15:50 +0200)]
[mlir][shape] Lower Shape `ConstSizeOp` to Standard `ConstantOp`.

Differential Revision: https://reviews.llvm.org/D81735

4 years ago[CostModel] getCFInstrCost in getUserCost.
Sam Parker [Mon, 15 Jun 2020 07:58:25 +0000 (08:58 +0100)]
[CostModel] getCFInstrCost in getUserCost.

Have BasicTTI call the base implementation so that both agree on the
default behaviour, which the default being a cost of '1'. This has
required an X86 specific implementation as it seems to be very
reliant on those instructions being free. Changes are also made to
AMDGPU so that their implementations distinguish between cost kinds,
so that the unrolling isn't affected. PowerPC also has its own
implementation to prevent changes to the reg-usage vectorizer test.

The cost model test changes now reflect that ret instructions are not
generally free.

Differential Revision: https://reviews.llvm.org/D79164

4 years ago[lldb] Handle all Clang::Type::Builtin enums
Kirill Bobyrev [Mon, 15 Jun 2020 08:18:59 +0000 (10:18 +0200)]
[lldb] Handle all Clang::Type::Builtin enums

Cleanup after https://reviews.llvm.org/D81459

4 years ago[CMake][runtimes] Skip adding 2nd set of the same variables for a generic target
Kristina Bessonova [Wed, 10 Jun 2020 14:18:30 +0000 (16:18 +0200)]
[CMake][runtimes] Skip adding 2nd set of the same variables for a generic target

No need to parse and add the same variables twice if runtimes is being
built for a generic target (i.e. w/o multilib).

Reviewed By: phosek

Differential Revision: https://reviews.llvm.org/D81574

4 years ago[NFCI][CostModel] Unify FNeg cost
Sam Parker [Thu, 11 Jun 2020 08:59:12 +0000 (09:59 +0100)]
[NFCI][CostModel] Unify FNeg cost

Enable TTIImpl::getUserCost to handle FNeg so that
getInstructionThroughput can call that instead. This means we can
remove the code in the AMDGPU backend too.

Differential Revision: https://reviews.llvm.org/D81635

4 years ago[IR] Prefer hasFnAttribute() where possible (NFC)
Nikita Popov [Sun, 14 Jun 2020 21:52:39 +0000 (23:52 +0200)]
[IR] Prefer hasFnAttribute() where possible (NFC)

When checking for an enum function attribute, use hasFnAttribute()
rather than hasAttribute() at FunctionIndex, because it is
significantly faster (and more concise to boot).

4 years ago[CostModel] Unify ExtractElement cost.
Sam Parker [Mon, 15 Jun 2020 07:27:14 +0000 (08:27 +0100)]
[CostModel] Unify ExtractElement cost.

Move the cost modelling, with the reduction pattern matching, from
getInstructionThroughput into generic TTIImpl::getUserCost. The
modelling in the AMDGPU backend can now be removed.

Differential Revision: https://reviews.llvm.org/D81643

4 years ago[NFC] Bail early simplifying unconditional branches
Max Kazantsev [Mon, 15 Jun 2020 06:58:54 +0000 (13:58 +0700)]
[NFC] Bail early simplifying unconditional branches

4 years ago[msan] Fix comment of __msan::Origin::isHeapOrigin
Fangrui Song [Mon, 15 Jun 2020 06:00:04 +0000 (23:00 -0700)]
[msan] Fix comment of __msan::Origin::isHeapOrigin

4 years agoRevert "Return "[InstCombine] Simplify compare of Phi with constant inputs against...
Sam Parker [Mon, 15 Jun 2020 06:46:28 +0000 (07:46 +0100)]
Revert "Return "[InstCombine] Simplify compare of Phi with constant inputs against a constant""

This reverts commit 23291b9863c8af7ad348c4a7d85d8d784df88eb1.

This caused performance regressions.

4 years ago[SVE] Ensure proper mangling of ACLE tuple types
Sander de Smalen [Mon, 15 Jun 2020 06:20:55 +0000 (07:20 +0100)]
[SVE] Ensure proper mangling of ACLE tuple types

The AAPCS specifies that the tuple types such as `svint32x2_t`
should use their `arm_sve.h` names when mangled instead of their
builtin names.

This patch also renames the internal types for the tuples to
be prefixed with `__clang_`, so they are not misinterpreted as
specified internal types like the non-tuple types which *are* defined
in the AAPCS. Using a builtin type for the tuples is a purely
a choice of the Clang implementation.

Reviewers: rsandifo-arm, c-rhodes, efriedma, rengolin

Reviewed By: efriedma

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81721

4 years ago[SveEmitter] Add SVE tuple types and builtins for svundef.
Sander de Smalen [Mon, 15 Jun 2020 06:20:43 +0000 (07:20 +0100)]
[SveEmitter] Add SVE tuple types and builtins for svundef.

This patch adds new SVE types to Clang that describe tuples of SVE
vectors. For example `svint32x2_t` which maps to the twice-as-wide
vector `<vscale x 8 x i32>`. Similarly, `svint32x3_t` will map to
`<vscale x 12 x i32>`.

It also adds builtins to return an `undef` vector for a given
SVE type.

Reviewers: c-rhodes, david-arm, ctetreau, efriedma, rengolin

Reviewed By: c-rhodes

Tags: #clang

Differential Revision: https://reviews.llvm.org/D81459

4 years ago[SafeStack,NFC] Make StackColoring read-only
Vitaly Buka [Mon, 15 Jun 2020 06:03:13 +0000 (23:03 -0700)]
[SafeStack,NFC] Make StackColoring read-only

Move core which removes markers out of StackColoring.

4 years ago[SafeStack,NFC] Remove unneded branch
Vitaly Buka [Mon, 15 Jun 2020 04:28:34 +0000 (21:28 -0700)]
[SafeStack,NFC] Remove unneded branch

4 years ago[SafeStack,NFC] Fix naming style
Vitaly Buka [Mon, 15 Jun 2020 02:04:47 +0000 (19:04 -0700)]
[SafeStack,NFC] Fix naming style

4 years ago[SafeStack,NFC] Cleanup LiveRange interface
Vitaly Buka [Mon, 15 Jun 2020 01:59:23 +0000 (18:59 -0700)]
[SafeStack,NFC] Cleanup LiveRange interface

4 years ago[SafeStack,NFC] "const" cleanup
Vitaly Buka [Sun, 14 Jun 2020 23:31:09 +0000 (16:31 -0700)]
[SafeStack,NFC] "const" cleanup

4 years ago[SafeStack,NFC] Add BlockLifetimeInfo constructor
Vitaly Buka [Sun, 14 Jun 2020 23:06:07 +0000 (16:06 -0700)]
[SafeStack,NFC] Add BlockLifetimeInfo constructor

4 years ago[SafeStack,NFC] Use IntrinsicInst instead of Instruction
Vitaly Buka [Sun, 14 Jun 2020 22:58:10 +0000 (15:58 -0700)]
[SafeStack,NFC] Use IntrinsicInst instead of Instruction

4 years ago[SafeStack,NFC] Move ClColoring into SafeStack.cpp
Vitaly Buka [Sun, 14 Jun 2020 22:32:07 +0000 (15:32 -0700)]
[SafeStack,NFC] Move ClColoring into SafeStack.cpp

This allows to reuse the code in other components.

4 years ago[SafeStack,NFC] Move unconditional code into constructor
Vitaly Buka [Sun, 14 Jun 2020 21:35:34 +0000 (14:35 -0700)]
[SafeStack,NFC] Move unconditional code into constructor

Prepare to move ClColoring from SafeStackCode to SafeStackLayout.
This will allow to reuse the code in other components.

4 years ago[Test] Update test with check script, add two more motivating cases
Max Kazantsev [Mon, 15 Jun 2020 05:36:33 +0000 (12:36 +0700)]
[Test] Update test with check script, add two more motivating cases

4 years ago[PowerPC] fma chain break to expose more ILP
Chen Zheng [Mon, 15 Jun 2020 01:34:52 +0000 (21:34 -0400)]
[PowerPC] fma chain break to expose more ILP

This patch tries to reassociate two patterns related to FMA to expose
more ILP on PowerPC.

// Pattern 1:
//   A =  FADD X,  Y          (Leaf)
//   B =  FMA  A,  M21,  M22  (Prev)
//   C =  FMA  B,  M31,  M32  (Root)
// -->
//   A =  FMA  X,  M21,  M22
//   B =  FMA  Y,  M31,  M32
//   C =  FADD A,  B

// Pattern 2:
//   A =  FMA  X,  M11,  M12  (Leaf)
//   B =  FMA  A,  M21,  M22  (Prev)
//   C =  FMA  B,  M31,  M32  (Root)
// -->
//   A =  FMUL M11,  M12
//   B =  FMA  X,  M21,  M22
//   D =  FMA  A,  M31,  M32
//   C =  FADD B,  D

Reviewed By: jsji

Differential Revision: https://reviews.llvm.org/D80175

4 years ago[NewPM] Avoid redundant CGSCC run for updated SCC
Wenlei He [Tue, 26 May 2020 20:53:35 +0000 (13:53 -0700)]
[NewPM] Avoid redundant CGSCC run for updated SCC

Summary:
When an SCC got split due to inlining, we have two mechanisms for reprocessing the updated SCC, first is UR.UpdatedC
that repeatedly rerun the new, current SCC; second is a worklist for all newly split SCCs. We can avoid rerun of
the same SCC when the SCC is set to be processed by both mechanisms *back to back*. In pathological cases, such redundant
rerun could cause exponential size growth due to inlining along cycles, even when there's no SCC mutation and hence
convergence is not a problem.

Note that it's ok to have SCC updated and rerun immediately, and also in the work list if we have actually moved an SCC
to be topologically "below" the current one due to merging. In that case, we will need to revisit the current SCC after
those moved SCCs. For that reason, the redundant avoidance here only targets back to back rerun of the same SCC - the
case described by the now removed FIXME comment.

Reviewers: chandlerc, wmi

Subscribers: llvm-commits, hoy

Tags: #llvm

Differential Revision: https://reviews.llvm.org/D80589