review.tizen.org Git - platform/upstream/llvm.git/log

[DAGCombiner][X86] Fold a CONCAT_VECTORS of SHUFFLE_VECTOR and it's operand into wider SHUFFLE_VECTOR

This was showing as a source of *many* regressions
with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.

commit | commitdiff | tree

Roman Lebedev [Sun, 1 Jan 2023 18:34:02 +0000 (21:34 +0300)]

[NFC][X86] Add tests for concatenation of shuffle's operand to the shuffle

commit | commitdiff | tree

Florian Hahn [Sun, 1 Jan 2023 19:48:48 +0000 (19:48 +0000)]

[VPlan] Adjust mergeReplicateRegions to be in line with mergeBlock (NFC)

Adjust mergeReplicateRegions to be in line with
mergeBlocksIntoPredecessors added in 36d70a6aea6b by collecting only the
valid candidates first.

Also rename to mergeReplicateRegionsIntoSuccessors and add missing
doc-comment.

This addresses post-commit suggestions by @Ayal.

commit | commitdiff | tree

Tom Praschan [Sat, 31 Dec 2022 15:24:55 +0000 (16:24 +0100)]

[clangd] Add extension for adding context (enclosing function or class) in references results

Relevant issue: https://github.com/clangd/clangd/issues/177

Reviewed By: nridge

Differential Revision: https://reviews.llvm.org/D137894

commit | commitdiff | tree

Kevin Sala [Sat, 31 Dec 2022 20:07:56 +0000 (21:07 +0100)]

[OpenMP][libomptarget] Add TargetParser as dependency in NextGen's JIT

This patch fixes an undefined reference to llvm::Triple::Triple(llvm::Twine const&).

Differential Revision: https://reviews.llvm.org/D140810

commit | commitdiff | tree

Ben Shi [Mon, 12 Dec 2022 03:40:57 +0000 (11:40 +0800)]

[lld][ELF] Support relocation R_AVR_LDS_STS_16 on AVRTiny devices

The relocation 'R_AVR_LDS_STS_16' is introduced for the compact
16-bit LDS/STS instructions on AVRTiny devices.

Reviewed By: MaskRay, aykevl

Differential Revision: https://reviews.llvm.org/D139807

commit | commitdiff | tree

Fangrui Song [Sun, 1 Jan 2023 06:39:20 +0000 (22:39 -0800)]

[X86] Revert -fno-plt __tls_get_addr workaround for old GNU ld

ENABLE_X86_RELAX_RELOCATIONS has defaulted to on in 2020.
This workaround is not exercised for a long time.

commit | commitdiff | tree

Fangrui Song [Sun, 1 Jan 2023 06:24:37 +0000 (22:24 -0800)]

[clang] Change CodeGenOptions::RelaxELFRelocations/assembler defaults to match MC default

MC default was flipped in 2016.
CMake ENABLE_X86_RELAX_RELOCATIONS defaults to on in 2020 (c41a18cf61790fc898dcda1055c3efbf442c14c0).
It makes sense for the CodeGenOptions::RelaxELFRelocations to match, so
that most -cc1/-cc1as command lines won't have this option.

This also fixes a minor issue: -fno-plt -S will now use GOT for
__tls_get_addr calls, matching -fno-plt -c.

commit | commitdiff | tree

Fangrui Song [Sun, 1 Jan 2023 04:50:54 +0000 (20:50 -0800)]

[ELF] Support TLS GD/LD relaxations for x86-32 -fno-plt

For x86-32, {clang,gcc} -fno-plt uses `call *___tls_get_addr@GOT(%reg)` instead
of `call ___tls_get_addr@PLT`. GD to IE/LE relaxations need to shift the offset
by one while LD to LE relaxation needs to use a different code sequence.

While here, fix some comments.

Fix https://github.com/llvm/llvm-project/issues/59769

Differential Revision: https://reviews.llvm.org/D140813

commit | commitdiff | tree

LLVM GN Syncbot [Sun, 1 Jan 2023 01:57:45 +0000 (01:57 +0000)]

[gn build] Port 89aad1e6a397

commit | commitdiff | tree

Owen Pan [Thu, 22 Dec 2022 09:21:17 +0000 (01:21 -0800)]

Reland [clang-format] Add an option to format integer literal separators

Previously committed in 46c94e5067b5 which was reverted in f0756e086010
due to a memory bug.

Closes #58949.

Differential Revision: https://reviews.llvm.org/D140543

commit | commitdiff | tree

Markus Böck [Sat, 31 Dec 2022 14:04:31 +0000 (15:04 +0100)]

[clang][NFC] Fix warning due to initializing a pointer with `false`

commit | commitdiff | tree

Florian Hahn [Sat, 31 Dec 2022 15:14:57 +0000 (15:14 +0000)]

[VPlan] Move GraphTraits definitions to separate header (NFC).

This reduces the size of VPlan.h and avoids future growth of the file
when the graph traits are extended in future patches.

Reviewed By: Ayal

Differential Revision: https://reviews.llvm.org/D140500

commit | commitdiff | tree

Uday Bondhugula [Sat, 31 Dec 2022 14:56:40 +0000 (20:26 +0530)]

[MLIR] Fix affine LICM pass for unknown region holding ops

Fix affine LICM pass for unknown region-holding ops. The logic was
completely ignoring regions of unknown ops leading to generation of
invalid IR on hoisting. Handle affine.parallel op among those with
regions that are supported.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140738

commit | commitdiff | tree

Uday Bondhugula [Sat, 24 Dec 2022 10:23:36 +0000 (15:53 +0530)]

NFC. Fix/improve style in affine-licm pass

The code here appears to be out of line with proper style and
guidelines. Fix this.

commit | commitdiff | tree

Mark de Wever [Fri, 23 Dec 2022 17:10:00 +0000 (18:10 +0100)]

[NFC][libc++] Removes concepts tests.

Concepts should be supported by all supported compilers. This is
probably a left over from an older version of the expected patch.

Reviewed By: #libc, philnik

Differential Revision: https://reviews.llvm.org/D140632

commit | commitdiff | tree

Jie Fu [Sat, 31 Dec 2022 07:31:33 +0000 (23:31 -0800)]

[docs] TestingGuide.rst: Fix incorrect description

This patch fixes two incorrect descriptions in TestingGuide.rst.

1. test/lit.site.cfg --> test/lit.site.cfg.py
   After https://reviews.llvm.org/D37838 , the `test/lit.site.cfg` had been added a .py extension.
   So it should be `test/lit.site.cfg.py`.

2. $(LLVM_OBJ_ROOT)/$(BuildMode)/bin --> $(LLVM_OBJ_ROOT)/bin
   The current build system doesn't create a $(BuildMode) directory any more.
   So it should be removed.

Reviewed By: mehdi_amini, MaskRay

Differential Revision: https://reviews.llvm.org/D140780

commit | commitdiff | tree

Fangrui Song [Sat, 31 Dec 2022 04:33:27 +0000 (20:33 -0800)]

[test] Opaquify cgprofile.ll

commit | commitdiff | tree

Qfrost [Sat, 31 Dec 2022 03:48:58 +0000 (11:48 +0800)]

llvm-readobj COFFDumper print PEHeader CheckSum
Differential Revision:https://reviews.llvm.org/D140555

commit | commitdiff | tree

Fangrui Song [Sat, 31 Dec 2022 03:43:19 +0000 (19:43 -0800)]

[msan] Remove non-working MSAN_LINUX_X86_64_OLD_MAPPING

r249754 (2015) added this macro (not set anywhere) for an escape hatch.
This old layout has been non-working for many years (Linux
arch/x86/include/asm/elf.h ELF_ET_DYN_BASE is outside the APP range):

    FATAL: Code 0x558547327980 is out of application range. Non-PIE build?
    FATAL: MemorySanitizer can not mmap the shadow memory.
    FATAL: Make sure to compile with -fPIE and to link with -pie.
    FATAL: Disabling ASLR is known to cause this error.
    FATAL: If running under GDB, try 'set disable-randomization off'.

Non-pie doesn't work either.

commit | commitdiff | tree

Yeting Kuo [Mon, 26 Dec 2022 09:52:56 +0000 (17:52 +0800)]

[RISCV] Use tail agnostic if inserting subvector/element at the end of a vector.

The patch tries to make more vslidup nodes use tail agnostic. The idea comes
from D125546 authored by Zack Chen.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D140669

commit | commitdiff | tree

Fangrui Song [Sat, 31 Dec 2022 02:23:54 +0000 (18:23 -0800)]

[CMake] Remove unused ALL_SCUDO_SUPPORTED_ARCH after D138157

commit | commitdiff | tree

Roman Lebedev [Sat, 31 Dec 2022 01:58:22 +0000 (04:58 +0300)]

[CVP] When expanding `urem`, always freeze the nominator

As per the post-commit feedback - that was not the correct precondition
to avoid it here. I think we should generally start changing mentality
about `freeze`, the fact that we have been conditioned to be afraid of it
(or of anything in LLVM in general) is the key problem here.

commit | commitdiff | tree

Roman Lebedev [Sat, 31 Dec 2022 01:43:29 +0000 (04:43 +0300)]

[NFC][CVP] Add more tests for urem expansion

As mentioned in post-commit feedback.

commit | commitdiff | tree

Roman Lebedev [Sat, 31 Dec 2022 00:23:54 +0000 (03:23 +0300)]

[DAGCombiner][TLI] Do not fuse bitcast to <1 x ?> into a load/store of a vector

Single-element vectors are legalized by splitting,
so the the memory operations would also get scalarized.
While we do have some support to reconstruct scalarized loads,
we clearly don't catch everything.

The comment for the affected AArch64 store suggests that
having two stores was the desired outcome in the first place.

This was showing as a source of *many* regressions
with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.

commit | commitdiff | tree

Roman Lebedev [Sat, 31 Dec 2022 00:10:43 +0000 (03:10 +0300)]

[NFC][Codegen][x86] Add tests for load/store of a single-element vectors

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 23:07:50 +0000 (02:07 +0300)]

[NFC][TLI] Move `isLoadBitCastBeneficial()` implementation into source file

... so any change to it does not cause 700 source files to be recompiled.

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 00:49:48 +0000 (03:49 +0300)]

[DAG] BUILD_VECTOR: absorb ZERO_EXTEND of a single first operand if all other ops are zeros

This kind of pattern seems to come up as regressions
with better ZERO_EXTEND_VECTOR_INREG recognition.

For initial implementation, this is quite restricted
to the minimal viable transform, otherwise there are
too many regressions to be dealt with.

commit | commitdiff | tree

Valery N Dmitriev [Fri, 30 Dec 2022 19:07:04 +0000 (11:07 -0800)]

[NFC] Test case intended to cover SLP cost for chain with masked gather loads.

SLP produces two gather loads (one feeds another).
For the first set of scalar loads GEP indices are all constant.
The result of the second load is then fed into reduction (as a seed).

Differential Revision: https://reviews.llvm.org/D140785

commit | commitdiff | tree

Eugene Zhulenev [Thu, 29 Dec 2022 21:24:45 +0000 (13:24 -0800)]

[mlir] Add a test for default valued dictionary attributes

+ Replace special placeholders ($_builder, etc...) in default attribute string

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140765

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 18:25:14 +0000 (21:25 +0300)]

[NFC][X86] Add exhaustive-ish coverage for broadcast of implicitly aext/zext element

Some of these even crash instruction selection for AVX512.
This is one of the patterns that comes up as regressions
with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.

https://godbolt.org/z/x88aqfrT5

commit | commitdiff | tree

Vitaly Buka [Fri, 30 Dec 2022 18:57:12 +0000 (10:57 -0800)]

Revert "[CodeGen] Temporarily disable-lsr in HWASAN build"

We can do the same with cmake on the bot.

This reverts commit 8f70b848d339cabfaa8f1379d41dae11b9b75014.

commit | commitdiff | tree

Benjamin Kramer [Fri, 30 Dec 2022 18:07:19 +0000 (19:07 +0100)]

[Sema] Avoid double hash lookup. NFCI

commit | commitdiff | tree

Florian Hahn [Fri, 30 Dec 2022 18:24:34 +0000 (18:24 +0000)]

[VPlan] Also consider operands of sink candidates in same block.

Even if the the sink candidate is already in the target block, its
operands can be candidates for sinking. Queue them up as well. Also
moves the queuing logic to a helper.

commit | commitdiff | tree

Thomas Symalla [Fri, 30 Dec 2022 13:25:51 +0000 (14:25 +0100)]

[NFC][AMDGPU] Make method declarations in SIInstrInfo equivalent to their definitions.

Some functions from SIInstrInfo have their operands named different in
their declarations vs. their defs. This was caught by cppcheck.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D140778

commit | commitdiff | tree

Benjamin Kramer [Fri, 30 Dec 2022 17:02:26 +0000 (18:02 +0100)]

[AArch64] Stringref'ize AArch64Subtarget constructor. NFCI

commit | commitdiff | tree

Ivan Butygin [Thu, 22 Dec 2022 19:50:52 +0000 (20:50 +0100)]

[mlir][gpu][spirv] Lower gpu reduction ops to spirv

Supports only "add" and "mul" ops for now. More ops will be added later.

Differential Revision: https://reviews.llvm.org/D140576

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 16:06:59 +0000 (19:06 +0300)]

[CVP] Expand bound `urem`s

This kind of thing happens really frequently in LLVM's very own
shuffle combining methods, and it is even considered bad practice
to use `%` there, instead of using this expansion directly.
Though, many of the cases there have variable divisors,
so this won't help everything.

Simple case: https://alive2.llvm.org/ce/z/PjvYf-
There's alternative expansion via `umin`:
https://alive2.llvm.org/ce/z/hWCVPb

BUT while we can transform the first expansion
into the `umin` one (e.g. for SCEV):
https://alive2.llvm.org/ce/z/iNxKmJ
... we can't go in the opposite direction.

Also, the non-`umin` expansion seems somewhat more codegen-friendly:
https://godbolt.org/z/qzjx5bqWK
https://godbolt.org/z/a7bj1axbx

There's second variant of precondition:
https://alive2.llvm.org/ce/z/zE6cbM
but there the numerator must be non-undef / must be frozen.

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 16:15:43 +0000 (19:15 +0300)]

[NFC][CVP] `processURem()`: add statistic and increase readability

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 16:03:11 +0000 (19:03 +0300)]

[NFC][CVP] Add tests for urem expansion

commit | commitdiff | tree

Roman Lebedev [Fri, 30 Dec 2022 14:12:21 +0000 (17:12 +0300)]

[NFC][PhaseOrdering] Re-autogenerate check lines in one test

commit | commitdiff | tree

Matt Arsenault [Fri, 30 Dec 2022 16:08:33 +0000 (11:08 -0500)]

ValueTracking: Fix canCreateUndefOrPoison for saturating shifts

These need to consider the shift amount.

commit | commitdiff | tree

Ivan Kosarev [Fri, 30 Dec 2022 15:05:21 +0000 (15:05 +0000)]

[AMDGPU][AsmParser] Refine parsing cache policy modifiers.

Reviewed By: dp, arsenm

Differential Revision: https://reviews.llvm.org/D140108

commit | commitdiff | tree

Teresa Johnson [Thu, 29 Dec 2022 20:11:38 +0000 (12:11 -0800)]

[MemProf] Fix inline propagation of memprof metadata

It isn't correct to always remove memprof metadata MIBs from the
original allocation call after inlining.

Let's say we have the following partial call graph:

C     D
\   /
  v v
   B   E
   |  /
   v v
    A

where A contains an allocation call. If both contexts including B have
the same allocation behavior, the context in the memprof metadata on the
allocation will be pruned, and we will have 2 MIBs with contexts:
A,B and A,E.

Previously, if we inlined A into B we propagate the matching MIBs onto
the inlined allocation call in B' (A,B in this case), and remove it from
the original out of line allocation in A. This is correct if we have a
single round of bottom up inlining.

However, in the compiler we can have multiple invocations of the inliner
pass (e.g. LTO). We may also inline non-bottom up with an alternative
inliner such as the ModuleInliner. In that case, we could end up first
inlining B into C, without having inlined A into B. The call graph then
looks like:

    D
    |
    v
C'  B   E
\  |  /
  v v v
    A

If we subsequently (perhaps on a later invocation of bottom up inlining)
inline A into B, the previous handling would propagate the memprof MIB
context A,B up into the inlined allocation in B', and remove it from the
original allocation in A. The propagation into B' is fine, however, by
removing it from A's allocation, we no longer reflect the context coming
from C'.

To fix this, simply prevent the removal of MIB from the original
allocation callsites.

Note that the memprof_inline.ll test has some changes to existing
checking to replace "noncold" with "notcold" in the metadata. The
corresponding CHECK was accidentally commented out in the old version
and thus this mistake was not previously detected.

Differential Revision: https://reviews.llvm.org/D140764

commit | commitdiff | tree

Alexey Bataev [Thu, 22 Dec 2022 20:07:44 +0000 (12:07 -0800)]

[SLP]Do not emit many extractelements, reuse the single one emitted.

We do not need to emit many extractelements for each particular use, we
can reuse the only one, just need to adjust it to make it dominate on
all uses.

Differential Revision: https://reviews.llvm.org/D140580

commit | commitdiff | tree

Sanjay Patel [Fri, 30 Dec 2022 13:47:51 +0000 (08:47 -0500)]

[InstSimplify] fold selects where true/false arm is the same as condition

We managed to fold related patterns in issue #59704,
but we were missing these more basic folds:
https://alive2.llvm.org/ce/z/y6d7SN

commit | commitdiff | tree

Sanjay Patel [Thu, 29 Dec 2022 19:51:30 +0000 (14:51 -0500)]

[InstSimplify] add tests for select-of-bool; NFC

commit | commitdiff | tree

Matt Arsenault [Fri, 30 Dec 2022 13:40:31 +0000 (08:40 -0500)]

IROutliner: Convert tests to opaque pointers

Some of these show improvements. outlining-bitcasts.ll might not be
relevant anymore (or should be rewritten to test some other type of
non-pointer bitcast).

commit | commitdiff | tree

Dmitry Preobrazhensky [Fri, 30 Dec 2022 13:29:47 +0000 (16:29 +0300)]

[AMDGPU][GFX11] Correct tied src2 of v_fmac_f16_e64

src2 was incorrectly defined as VSrc_f16 but it is tied to dst which is VGPR_32. As a result, disassembler failed to decode src2.

Differential Revision: https://reviews.llvm.org/D140299

commit | commitdiff | tree

Matt Arsenault [Fri, 30 Dec 2022 13:07:33 +0000 (08:07 -0500)]

AMDGPU: Use default attributes on image dim intrinsics

These were missing nocallback and willreturn

commit | commitdiff | tree

Dmitry Preobrazhensky [Fri, 30 Dec 2022 13:20:38 +0000 (16:20 +0300)]

[AMDGPU][MC][GFX11] Correct encoding of neg modifier for v_dot2_f32_bf16

Fix a bug with neg_lo:[0,1,0] and neg_hi:[0,1,0] modifiers - they are accepted but not encoded.

Differential Revision: https://reviews.llvm.org/D140470

commit | commitdiff | tree

Alfonso Gregory [Tue, 21 Sep 2021 15:15:00 +0000 (11:15 -0400)]

[NFC][IR] Remove unused assignment to Offset

This value is overwritten anyway, so let's remove it

commit | commitdiff | tree

Filipp Zhinkin [Tue, 27 Dec 2022 15:08:54 +0000 (19:08 +0400)]

[ScheduleDAG] Support REQ_SEQUENCE unscheduling

REG_SEQUENCE node requires special treatment during the
unscheduling because the node is untyped and neither its
class, nor cost could be retrieved the same way as for
typed nodes.

Related issue: https://github.com/llvm/llvm-project/issues/58911

Reviewed By: efriedma

Differential Revision: https://reviews.llvm.org/D138837

commit | commitdiff | tree

jacquesguan [Thu, 29 Dec 2022 02:51:05 +0000 (10:51 +0800)]

[mlir][Arith] Fold integer shift op with zero.

This revision folds arith.shrui, arith.shrsi and arith.shli with zero
rhs to lhs.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140749

commit | commitdiff | tree

jacquesguan [Fri, 30 Dec 2022 08:38:52 +0000 (16:38 +0800)]

[mlir][Arith] Remove redundant defination, NFC.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140774

commit | commitdiff | tree

Mark Danial [Fri, 30 Dec 2022 07:50:46 +0000 (09:50 +0200)]

[Flang] Add ppc64 support to Optimizer/CodeGen/Target.cpp for AIX 64 bit

Adding support for ppc64 (big endian) in order to support flang on 64 bit AIX

Reviewed By: clementval, kiranchandramohan

Differential Revision: https://reviews.llvm.org/D138390

commit | commitdiff | tree

Andrzej Warzynski [Thu, 29 Dec 2022 10:28:47 +0000 (10:28 +0000)]

[mlir] Simplify a test for vectorizing tensor.extract

Remove unused arguments and the corresponding logic (e.g. affine maps).

Differential Revision: https://reviews.llvm.org/D140755

commit | commitdiff | tree

Vitaly Buka [Fri, 30 Dec 2022 07:25:22 +0000 (23:25 -0800)]

[CodeGen] Temporarily disable-lsr in HWASAN build

HWASAN exposes some non-determinism in the pass and triggers:
ScalarEvolution.cpp:11540: bool llvm::ScalarEvolution::isLoopEntryGuardedByCond(const Loop *, ICmpInst::Predicate, const SCEV *, const SCEV *): Assertion `isAvailableAtLoopEntry(LHS, L) && "LHS is not available at Loop Entry"' failed.

E.g.
https://lab.llvm.org/buildbot/#/builders/236/builds/1629/steps/16/logs/stdio
is broken after D137838. I tried to split D137838 into smaller patches
and the one which reproduced was just a move of cpp from one dir to another.

Maybe it has something do to with comparison of tagged pointeres and
PtrSets used in pass.

Issues is hard to reproduce, even slight changes in path, or preprocessing
cpp file hide it.

commit | commitdiff | tree

Jun Zhang [Thu, 29 Dec 2022 06:40:40 +0000 (14:40 +0800)]

[clang][dataflow] Fix crash when having boolean-to-integral casts.

Since now we just ignore all (implicit) integral casts, treating the
resulting value as the same as the underlying value, it could cause
inconsistency between values after `Join` if in some paths the type
doesn't strictly match. This could cause intermittent crashes.

std::optional<bool> o;
int x;
if (o.has_value()) {
x = o.value();
}

Fixes: https://github.com/llvm/llvm-project/issues/59728

Signed-off-by: Jun Zhang <jun@junz.org>
Differential Revision: https://reviews.llvm.org/D140753

commit | commitdiff | tree

NAKAMURA Takumi [Fri, 30 Dec 2022 04:21:10 +0000 (13:21 +0900)]

[Bazel] Exclude lib/Headers/openmp_wrappers/stdlib.h out of builtin_headers

It has been there since llvmorg-16-init-14999-g07ff3c5ccce6

commit | commitdiff | tree

liqinweng [Fri, 30 Dec 2022 03:54:06 +0000 (11:54 +0800)]

[MLIR][Arith][NFC] Use the interface of 'getElementTypeOrSelf' to get the resType

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140608

commit | commitdiff | tree

Craig Topper [Fri, 30 Dec 2022 00:56:43 +0000 (16:56 -0800)]

[RISCV] Use SUB instead of XOR in lowerShiftLeftParts/lowerShiftRightParts./

isel is now capable of turning the SUB into XOR for shift amounts.
Though it uses NOT instead of XOR with ShiftSize-1.

By using SUB during lowering we enable more DAG combines with
other arithmetic on the shift amount.

commit | commitdiff | tree

Craig Topper [Fri, 30 Dec 2022 00:33:18 +0000 (16:33 -0800)]

[RISCV] RISCVDAGToDAGISel::selectShiftMask to shift by (sub size-1, X).

If the shift amount is (sub C, X) where C is -1 modulo the size of
the shift, we can replace the sub with a NOT.

We could also use XORI X, size-1, but NOT would work better with
c.not from the future Zce extension.

commit | commitdiff | tree

Matt Arsenault [Mon, 28 Nov 2022 22:26:14 +0000 (17:26 -0500)]

AMDGPU/clang: Remove target features from address space test builtins

It turns out we can codegen these on targets without flat addressing,
although the runtime probably didn't put anything useful there. The
proper diagnostic would be to disallow flat pointer uses or languages
with them, not this one edge case. Allows removing one of the special
cases requiring subtarget support in the device libraries.

commit | commitdiff | tree

Lei Zhang [Thu, 29 Dec 2022 23:27:29 +0000 (15:27 -0800)]

[mlir][spirv] Fail vector.bitcast conversion with different bitwidth

Depending on the target environment, we may need to emulate certain
types, which can cause issue with bitcast.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D140437

commit | commitdiff | tree

Matt Arsenault [Thu, 29 Dec 2022 23:11:14 +0000 (18:11 -0500)]

libclc: Add parentheses to silence warning

Fixes #59209

commit | commitdiff | tree

Dmitry Borisenkov [Thu, 29 Dec 2022 23:04:55 +0000 (18:04 -0500)]

DAG: Prevent store value forwarding to distinct addrspace load

DAGCombiner replaces (load const_addr1) directly chained with (store
(val, const_addr2)) with val if address space stripped const_addr1 ==
const_addr2. The patch fixes the issue by checking address spaces as
well. However, it might makes sense to not to chain together side
effects that belong to different address spaces in the first place and
make SelectionDAG::root address space aware.

commit | commitdiff | tree

Craig Topper [Thu, 29 Dec 2022 22:45:04 +0000 (14:45 -0800)]

[RISCV] Teach RISCVDAGToDAGISel::selectShiftMask to bypass adds with constant.

If the shift amount is (add X, C) where C is 0 modulo the size of
the shift, we can bypass the add.

Similar to other targets like AArch64 and X86.

commit | commitdiff | tree

Matt Arsenault [Thu, 18 Jun 2020 01:46:28 +0000 (21:46 -0400)]

AMDGPU/clang: Add builtins for llvm.amdgcn.ballot

Use explicit _w32/_w64 suffixes for the wave size to be consistent
with the existing other wave dependent intrinsics. Also start
diagnosing trying to use both wave32 and wave64.

I would have preferred to avoid the +wavefrontsize64 spam on targets
where that's the only option, but avoiding this seems to be more work
than I expected.

commit | commitdiff | tree

Roman Lebedev [Thu, 29 Dec 2022 22:43:20 +0000 (01:43 +0300)]

[NFC][Codegen][X86] zero_extend_vector_inreg.ll: add SSE4.2 runline

commit | commitdiff | tree

Roman Lebedev [Thu, 29 Dec 2022 15:02:02 +0000 (18:02 +0300)]

[DAGCombiner] Try to partition ISD::EXTRACT_VECTOR_ELT to accomodate it's ISD::BUILD_VECTOR users

This mainly cleans up a few patterns that are legalized by scalarization
from a wide-element vector, but then are further split apart to build
a more narrow-sized-element vector. In particular this happens in some
cases for illegal ISD::ZERO_EXTEND_VECTOR_INREG.

Given a ISD::EXTRACT_VECTOR_ELT, which is a glorified bit sequence extract,
recursively analyse all of it's users. and try to model themselves as
bit sequence extractions. If all of them agree on the new, narrower element
type, and all of them can be modelled as ISD::EXTRACT_VECTOR_ELT's of that
new element type, do that, but only if unmodelled users are ISD::BUILD_VECTOR.

commit | commitdiff | tree

Craig Topper [Thu, 29 Dec 2022 21:58:46 +0000 (13:58 -0800)]

[TargetLowering] Teach BuildUDIV to take advantage of leading zeros in the dividend.

If the dividend has leading zeros, we can use them to reduce the
size of the multiplier and avoid the fixup cases.

This patch is for scalars only, but we might be able to do this
for vectors in a follow up.

Differential Revision: https://reviews.llvm.org/D140750

commit | commitdiff | tree

Gulfem Savrun Yeniceri [Thu, 29 Dec 2022 20:37:57 +0000 (20:37 +0000)]

[instrprof] Fix issue in binary-ids-padding.test

https://reviews.llvm.org/D135929 caused a failure in
binary-ids-padding.test in big endian configurations:
https://lab.llvm.org/buildbot/#/builders/231/builds/6709

binary-ids-padding.test writes the profile in little-endian format.
This patch changes the raw profile reader to use getDataEndianness()
instead of llvm::support::endian::system_endianness() to fix the issue.

commit | commitdiff | tree

Mehdi Amini [Thu, 22 Dec 2022 19:53:45 +0000 (19:53 +0000)]

Apply clang-tidy fixes for performance-for-range-copy in Vectorization.cpp (NFC)

commit | commitdiff | tree

Mehdi Amini [Sat, 10 Dec 2022 14:16:17 +0000 (14:16 +0000)]

Apply clang-tidy fixes for readability-identifier-naming in TestDialect.cpp (NFC)

commit | commitdiff | tree

serge-sans-paille [Thu, 29 Dec 2022 15:24:58 +0000 (16:24 +0100)]

[clang] Use try_emplace instead of insert when getting new identifier

This is both less verbose and slightly faster, according to:

https://llvm-compile-time-tracker.com/compare.php?from=d9ab3e82f30d646deff054230b0c742704a1cf26&to=73405077ad913f634797ffc7a7bbb110ac9cae99&stat=instructions:u

No functional change intended :-)

commit | commitdiff | tree

Eugene Zhulenev [Wed, 28 Dec 2022 20:45:17 +0000 (12:45 -0800)]

[mlir] Add constBuilderCall to DictionaryAttr

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D140740

commit | commitdiff | tree

Aart Bik [Thu, 29 Dec 2022 18:55:01 +0000 (10:55 -0800)]

[mlir][sparse] layout fixes (NFC)

Reviewed By: bixia

Differential Revision: https://reviews.llvm.org/D140761

commit | commitdiff | tree

Fangrui Song [Thu, 29 Dec 2022 20:16:20 +0000 (12:16 -0800)]

[ProfileData] Fix msan -fsanitize-memory-param-retval after D135929

test/tools/llvm-cov/load-multiple-objects.test calls
IndexedInstrProfReader::readBinaryIds with uninitialized BinaryIdsStart.

commit | commitdiff | tree

Markus Böck [Thu, 29 Dec 2022 19:47:29 +0000 (20:47 +0100)]

[llvm][AsmPrinter][NFC] Cleanup `GCMetadataPrinters` field

The field is currently `void*`, which was originlly chosen in 2010 to not need to include `DenseMap`. Since then, `DenseMap` has been included in the header file anyways, so there is no more need to for the indirection via `void*` and the cruft around it can be removed.

Differential Revision: https://reviews.llvm.org/D140758

commit | commitdiff | tree

Sanjay Patel [Thu, 29 Dec 2022 19:23:25 +0000 (14:23 -0500)]

[InstCombine] avoid miscompile in sinkNotIntoLogicalOp()

Fixes #59704

commit | commitdiff | tree

Sanjay Patel [Thu, 29 Dec 2022 19:19:15 +0000 (14:19 -0500)]

[InstCombine] add test for miscompile from sinkNotIntoLogicalOp(); NFC

issue #59704

commit | commitdiff | tree

Valery N Dmitriev [Thu, 29 Dec 2022 19:30:10 +0000 (11:30 -0800)]

[SLP] Fix debug print for cost in tryToVectorizeList - NFC.

Actual VF was confused with local variable named "VF".

commit | commitdiff | tree

Eduard Zingerman [Thu, 29 Dec 2022 18:29:44 +0000 (10:29 -0800)]

[BPF] Use SectionForGlobal() for section names computation in BTF

Use function TargetLoweringObjectFile::SectionForGlobal() to compute
section names for globals described in BTF_KIND_DATASEC records.

This fixes a discrepancy in section name computation between
BTFDebug::processGlobals and the rest of the LLVM pipeline.

Specifically, the following example illustrates the discrepancy
before this commit:

  struct Foo {
    int i;
  } __attribute__((aligned(16)));
  struct Foo foo = { 0 };

The initializer for 'foo' looks as follows:

  %struct.Foo { i32 0, [12 x i8] undef }

TargetLoweringObjectFile::SectionForGlobal() classifies 'foo' as
a part of '.bss' section, while BTFDebug::processGlobals
classified it as a part of '.data' section because of the
following expression:

  SecName = Global.getInitializer()->isZeroValue() ? ".bss" : ".data"

The isZeroValue() returns false because of the undef tail of the
initializer, while SectionForGlobal() allows such patterns in '.bss'.

Differential Revision: https://reviews.llvm.org/D140505

commit | commitdiff | tree

Valery N Dmitriev [Thu, 29 Dec 2022 19:02:36 +0000 (11:02 -0800)]

[SLP] A couple of minor improvements for slp graph view - NFC.

Show ScatterVectorize nodes in frames of blue color
and print vectorize tree indices.

commit | commitdiff | tree

Gulfem Savrun Yeniceri [Thu, 13 Oct 2022 00:50:10 +0000 (00:50 +0000)]

[profile] Add binary ids into indexed profiles

This patch adds support for including binary ids in an indexed profile.
It adds a new field into the header that points to the offset of the
binary id section. The binary id section consists of a size of the
section, and a list of binary ids (if they are present) that consist
of two parts: length and data.

This patch guarantees that indexed profile is backwards compatible
after adding binary ids.

Differential Revision: https://reviews.llvm.org/D135929

commit | commitdiff | tree

Fangrui Song [Thu, 29 Dec 2022 18:30:41 +0000 (10:30 -0800)]

[test] Fix dfsan/stack_trace.c

commit | commitdiff | tree

Craig Topper [Thu, 29 Dec 2022 17:35:34 +0000 (09:35 -0800)]

[Support] Fix what I think is an off by 1 bug in UnsignedDivisionByConstantInfo.

The code in Hacker's Delight says
`nc = -1 - (-d)%d;`

But we have
`NC = AllOnes - (AllOnes-D)%D`

The Hacker's Delight code is written for the LeadingZeros==0 case.
`AllOnes - D` is not the same as `-d` from Hacker's Delight.

This patch changes the code to
`NC = AllOnes - (AllOnes+1-D)%D`

This will increment AllOnes to 0 in the LeadingZeros==0 case. This
will make it equivalent to -D. I believe this is also correct for
LeadingZeros>0.

At least for i8, i16, and i32 the only divisor that changes is
((1 << (BitWidth-1)) | 1). Or 127 for i8, 32769 for i16, and 2147483649
for i32. These are all large enough that the quotient is 0 or 1 so
InstCombine replaces them with an icmp and zext before SelectionDAG.

Reviewed By: lebedev.ri

Differential Revision: https://reviews.llvm.org/D140636

commit | commitdiff | tree

Peter Klausler [Mon, 19 Dec 2022 20:41:25 +0000 (12:41 -0800)]

[flang] Restore checking for some optional values before use

Recent commits (2098ad7f00324ee0f2a6538f418a6f81dfdd2edb and
15a9a72ee68166c0cff3f036cacd3c82be66c729) replaced usage of "o.value()"
on optionals with "*o". Those optional values are expected to be
present -- but now, if it ever turns out that they're not,
compilation will proceed with garbage data rather than crashing
immediately (and more debuggably) with an uncaught exception.

Add asserts for presence to restore the previous level of safety.
(I could have revert these patches so as to resume used of .value()
but I didn't want to just have them get broken again.)

Differential Revision: https://reviews.llvm.org/D140340

commit | commitdiff | tree

Sanjay Patel [Thu, 29 Dec 2022 14:55:59 +0000 (09:55 -0500)]

[InstSimplify] fold exact divide to poison if it is known to not divide evenly

This is related to the discussion in D140665. I was looking over the demanded
bits implementation in IR and noticed that we just bail out of a potential
fold if a udiv is exact:
https://github.com/llvm/llvm-project/blob/82be8a1d2b00f6e89096b86f670a8be894c7b9e6/llvm/lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp#L799

Also, see tests added with 7f0c11509e8f.

Then, I saw that we could lose a fold to poison if we zap the exact with that
transform, so this patch tries to catch that as a preliminary step.

Alive2 proofs:
https://alive2.llvm.org/ce/z/zCjKM7
https://alive2.llvm.org/ce/z/-tz_RK (trailing zeros must be "less-than")
https://alive2.llvm.org/ce/z/c9CMsJ (general proof and specific example)

Differential Revision: https://reviews.llvm.org/D140733

commit | commitdiff | tree

Benjamin Kramer [Thu, 29 Dec 2022 15:05:33 +0000 (16:05 +0100)]

Detemplate llvm::EmitGEPOffset and move it into a cpp file. NFC.

commit | commitdiff | tree

Mike Hommey [Thu, 29 Dec 2022 14:47:13 +0000 (14:47 +0000)]

[MC] [llvm-ml] Add support for the extrn keyword

It is the same as the already supported `extern` keyword.
https://learn.microsoft.com/en-us/cpp/assembler/masm/extrn?view=msvc-170

Fixes: https://github.com/llvm/llvm-project/issues/59712

Reviewed By: epastor

Differential Revision: https://reviews.llvm.org/D140679

commit | commitdiff | tree

Sanjay Patel [Thu, 29 Dec 2022 14:19:53 +0000 (09:19 -0500)]

[InstSimplify] fix formatting and add bool function argument comments; NFC

Make existing code conform with proposed additions in D140733.

commit | commitdiff | tree

Hsiangkai Wang [Fri, 23 Dec 2022 03:05:10 +0000 (03:05 +0000)]

[RISCV] Add fmin/fmax scalar instructions to isAssociativeAndCommutative

Follow-up patch of D140530.

We can add FMIN, FMAX to isAssociativeAndCommutative to
increase instruction-level parallelism by the existing MachineCombiner
pass.

Differential Revision: https://reviews.llvm.org/D140602

commit | commitdiff | tree

Hsiangkai Wang [Thu, 22 Dec 2022 07:07:37 +0000 (07:07 +0000)]

[RISCV] Add integer scalar instructions to isAssociativeAndCommutative

Inspired by D138107.

We can add ADD, AND, OR, XOR, MUL, MIN[U]/MAX[U] to isAssociativeAndCommutative
to increase instruction-level parallelism by the existing MachineCombiner pass.

Differential Revision: https://reviews.llvm.org/D140530

Domain: System / Toolchain;

RSS Atom