platform/upstream/llvm.git
20 months ago[mlir][tensor] Fix transpose permutation in tensor.pack generalization pattern
Quinn Dawkins [Mon, 20 Feb 2023 19:49:38 +0000 (14:49 -0500)]
[mlir][tensor] Fix transpose permutation in tensor.pack generalization pattern

The generalization pattern for tensor.pack was inverting the
innerDimsPos permutation when normalizing. Thus, the transpose op
produced by the generalization would be incorrect.

Differential Revision: https://reviews.llvm.org/D144425

20 months ago[LSAN] Make sure HWASAN tests are linked with lld
Kirill Stoimenov [Fri, 17 Feb 2023 19:33:20 +0000 (19:33 +0000)]
[LSAN] Make sure HWASAN tests are linked with lld

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D144289

20 months agoRevert "[TextAPI] Implement TBDv5 Writer"
Cyndy Ishida [Wed, 22 Feb 2023 19:39:23 +0000 (11:39 -0800)]
Revert "[TextAPI] Implement TBDv5 Writer"

This reverts commit 8217932aabcb271df7eb30e069fdace904299cba.

Breaks buildbots.

20 months ago[TextAPI] Implement TBDv5 Writer
Cyndy Ishida [Wed, 22 Feb 2023 18:00:07 +0000 (10:00 -0800)]
[TextAPI] Implement TBDv5 Writer

Create writer for new JSON format.
The new JSON format allows practically all attributes to be defined per
target in a universal library however the internal representation only
allows one for the time being. For now the write will always write those
attributes as default available for all targets (install name,
compatability & current version, swift abi, flags e.g. flatnamepace &
app exenstion safety)

rdar://102076911

Reviewed By: ributzka

Differential Revision: https://reviews.llvm.org/D144339

20 months ago[LLDB] Enable 64 bit debug/type offset
Alexander Yermolovich [Wed, 22 Feb 2023 19:30:52 +0000 (11:30 -0800)]
[LLDB] Enable 64 bit debug/type offset

This came out of from https://discourse.llvm.org/t/dwarf-dwp-4gb-limit/63902
With big binaries we can have .dwp files where .debug_info.dwo section can grow
beyond 4GB. We would like to support this in LLVM and in LLDB.

The plan is to enable manual parsing of cu/tu index in DWARF library
(https://reviews.llvm.org/D137882), and then
switch internal index data structure to 64 bit.
For the second part is to enable 64bit offset support in LLDB with
this patch.

Reviewed By: labath

Differential Revision: https://reviews.llvm.org/D138618

20 months ago[mlir][linalg] Vectorize tensor.extract using contiguous loads
Andrzej Warzynski [Thu, 29 Dec 2022 10:22:28 +0000 (10:22 +0000)]
[mlir][linalg] Vectorize tensor.extract using contiguous loads

This patch implements vectorization of tensor.extract for n-D tensor (n
>= 2) using contiguous load operations, i.e. `vector.transfer_read`. This
is a follow-up of https://reviews.llvm.org/D137660 in which gather loads
were used, i.e. `vector.gather`.

It is always safe to use gather load operations when the underlying
memory pattern is contiguous, but not vice-verse. At the moment, the
following conditions have to be met for contiguous loads to be
generated:
  1. The _output tensor_ must be a 1-D vector with the trailing dim > 1,
     e.g. `tensor<1x1x4xi32`,
  2. The trailing dim in the _input tensor_ must be > 1, e.g.
     `tensor<1x1x4i32>` would be fine, but not `tensor<1x4x1xi32>`.
If these conditions are not satisfied, gather loads are generated
instead.

Condition 1 guarantees that the iteration space of the corresponding
`linalg.generic` Op is relatively simple. That makes analysing the
indices for `tensor.extract` rather straightforward.

Condition 2 is mostly there to avoid weird vectorisation patterns
resulting in vectors like: `vector<1x1x1xi32>`. In practice, tensors
like `tensor<1x4x1xi32>` should be collapsed to `tensor<1x4xi32>` before
vectorisation, but that's beyond the scope of this patch.

If needed, both conditions can be relaxed. I've not been able to find a
good motivating example for these, hence skipping. For reference,
`tosa.resize` (lowered to Linalg) was the driving example used here.

As a bonus, the test from "vectorization-unsupported.mlir" is moved to
"vectorization.mlir" with proper CHECK lines added.

Differential Revision: https://reviews.llvm.org/D141998

Co-authored-by: Diego Caballero <diegocaballero@google.com>
20 months ago[libc] Partially implement 'quick_exit' for the GPU
Joseph Huber [Mon, 20 Feb 2023 19:42:08 +0000 (13:42 -0600)]
[libc] Partially implement 'quick_exit' for the GPU

This patch adds a partial implementation of `quick_exit` for the GPU
target. This is mainly done to test object libraries for the GPU and
will be expanded later. This will simply cause the threads to terminate
on the GPU without returning an error code. This functionality will be
added later to facilitate unit tests.

Reviewed By: sivachandra

Differential Revision: https://reviews.llvm.org/D144421

20 months ago[Serialization] Don't warn when a deserialized category is equivalent to an existing...
Volodymyr Sapsai [Thu, 16 Feb 2023 01:02:28 +0000 (17:02 -0800)]
[Serialization] Don't warn when a deserialized category is equivalent to an existing one.

A single class allows multiple categories to be defined for it. But if
two of such categories have the same name, we emit a warning. It's not a
hard error but a good indication of a potential mistake.

With modules, we can end up with the same category in different modules.
Diagnosing such a situation has little value as the categories in
different modules are equivalent and don't reflect the usage of the same
name for different purposes. When we deserialize a duplicate category,
compare it to an existing one and warn only when the new one is
different.

rdar://104582081

Differential Revision: https://reviews.llvm.org/D144149

20 months ago[DAGCombine] Fix an ICE in combineMinNumMaxNum(...)
Cameron McInally [Wed, 22 Feb 2023 18:36:11 +0000 (10:36 -0800)]
[DAGCombine] Fix an ICE in combineMinNumMaxNum(...)

65420c8041f4 introduced an ICE in combineMinNumMaxNum(...) when
combineMinNumMaxNumImpl(...) returns an SDValue(). Make sure to check that a
value is returned before trying to perform an FNEG on it.

GitHub Issue: #60924

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D144571

20 months ago[RISCV] XTHeadMemPair: Fix invalid mempair combine for types other than i32/i64
Manolis Tsamis [Wed, 22 Feb 2023 18:57:01 +0000 (19:57 +0100)]
[RISCV] XTHeadMemPair: Fix invalid mempair combine for types other than i32/i64

A mistake in the control flow of performMemPairCombine resulted in paired
loads/stores for types that were not supported by the instructions (i8/i16).
These loads/stores could not match the constraints of the patterns defined
in the THead td file and the compiler would throw a 'Cannot select' error.

This is now fixed and two new test functions have been added in xtheadmempair.ll
which would previously crash the compiler. The compiler was additionally tested
with a wide range of benchmarks and no issues were observed.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D144559

20 months ago[libc] Implement ntohl and ntohs
Raman Tenneti [Wed, 22 Feb 2023 18:32:09 +0000 (10:32 -0800)]
[libc] Implement ntohl and ntohs

Per spec:

https://pubs.opengroup.org/onlinepubs/9699919799/functions/ntohl.html
https://pubs.opengroup.org/onlinepubs/9699919799/functions/ntohs.html

Co-authored-by: Jeff Bailey <jbailey@google.com>
Reviewed By: sivachandra

Differential Revision: https://reviews.llvm.org/D144506

20 months ago[clang-format] Fix format of my last patch
Vitaly Buka [Wed, 22 Feb 2023 18:11:41 +0000 (10:11 -0800)]
[clang-format] Fix format of my last patch

20 months ago[OpenMP] Target memory allocator fallback to default when no device available
Nawrin Sultana [Tue, 21 Feb 2023 23:17:57 +0000 (17:17 -0600)]
[OpenMP] Target memory allocator fallback to default when no device available

Differential Revision: https://reviews.llvm.org/D144525

20 months ago[RISCV] Reorganize deinterleave lowering for reuse [nfc]
Philip Reames [Wed, 22 Feb 2023 17:45:57 +0000 (09:45 -0800)]
[RISCV] Reorganize deinterleave lowering for reuse [nfc]

Not entirely sure we'll end up reusing the body of the transform, but personally I find this structure easier to follow anyways.

Differential Revision: https://reviews.llvm.org/D144532

20 months agoRevert "[IRMover] Remove UB implying parameter attributes when necessary"
Arthur Eubanks [Wed, 22 Feb 2023 17:40:22 +0000 (09:40 -0800)]
Revert "[IRMover] Remove UB implying parameter attributes when necessary"

This reverts commit 451799bb8261bde52bbfef226d019caf1d82aa42.

Causes crashes, see D139209

20 months ago[libc] Fix amdgpu startup code flags
Joseph Huber [Wed, 22 Feb 2023 17:36:12 +0000 (11:36 -0600)]
[libc] Fix amdgpu startup code flags

Summary:
Currently AMDGPU only barely supports cross-TU ELF linking. Full linking
is usually done via LTO. This requires passing the architecture to the
link job. This is done automatically via `-flto` since D144505. Add this
to the link options.

20 months agoRevert "[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag"
Petr Hosek [Wed, 22 Feb 2023 17:32:07 +0000 (17:32 +0000)]
Revert "[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag"

This reverts commit efae3174f09560353fb0f3d528bcbffe060d5438 since
it broke the standalone Flang build.

20 months ago[AMDGPU] Improve the lowering of raw_buffer_load_{i8,i16} and struct_buffer_load_...
Konstantina Mitropoulou [Fri, 17 Feb 2023 22:16:39 +0000 (14:16 -0800)]
[AMDGPU] Improve the lowering of raw_buffer_load_{i8,i16} and struct_buffer_load_{i8,i16} intrinsics

Currently, raw_buffer_load_{i8,i16} and struct_buffer_load_{i8,i16}
intrinsics are lowered as buffer_load_{u8,u16}. This patch combines
buffer_load_{u8,u16} and sign extension instructions in order to
generate buffer_load_{i8,i16} instructions.

Reviewed By: foad

Differential Revision: https://reviews.llvm.org/D144313

20 months ago[Assignment Tracking] Only set module flag if pass modifies a function
OCHyams [Wed, 22 Feb 2023 16:29:44 +0000 (16:29 +0000)]
[Assignment Tracking] Only set module flag if pass modifies a function

The module flag "debug-info-assignment-tracking" is set by the
declare-to-assign pass to indicate that assignment tracking is enabled. This
patch changes declare-to-assign to only set the flag if it makes a
modification. This lets the compiler avoid doing extra work for no benefit,
such as is currently the case if assignment tracking is requested for a build
with line tables only (-gmlt) or a build without debug info.

Reviewed By: scott.linder

Differential Revision: https://reviews.llvm.org/D144481

20 months ago[AMDGPU] Don't set src mods on permlane16
Joe Nash [Tue, 21 Feb 2023 21:14:27 +0000 (16:14 -0500)]
[AMDGPU] Don't set src mods on permlane16

v_permlane16_b32 and v_permlanex16_b32 should not set abs and neg src
modifiers on any input, but they can set op_sel on src0 or src1 to
represent fi or bc when desired. The ISel patterns were setting
the src_modifier bits to -1, effectively setting abs and neg as well,
whenever it was intended to set op_sel, due to an error in ISel. ISel
should now correctly only set the op_sel bits.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D144519

20 months ago[Assignment Tracking] Choose better passes for RemoveRedundantDbgInstrs call
OCHyams [Wed, 22 Feb 2023 14:23:08 +0000 (14:23 +0000)]
[Assignment Tracking] Choose better passes for RemoveRedundantDbgInstrs call

Enabling assignment tracking without this patch, a significant amount of
additional compiler run time comes from the RemoveRedundantDbgInstrs call in
InstCombine. This patch reduces compiler run time by choosing better places to
call RemoveRedundantDbgInstrs.

In non-assignment-tracking builds, RemoveRedundantDbgInstrs is called by
InstCombine if LowerDbgDeclare makes a change (i.e. it is _sometimes_
called). In assignment tracking builds LowerDbgDeclare doesn't do anything. We
still need to clean up redundant intrinsics to avoid a large performance hit
due to the number of instructions, so the current approach is to have
InstCombine _always_ call RemoveRedundantDbgInstrs.

Instrumenting the compiler to run RemoveRedundantDbgInstrs after every pass and
dump the numbers and building CTMark/tramp3d-v4 indicates that SROA and
LoopVectorize give us a bigger bang (number removed) for buck (times pass is
run).

The compile time tracker reports that this patch reduces the number of
instructions retired building CTMark projects by an average of 1.1%.

Reviewed By: scott.linder

Differential Revision: https://reviews.llvm.org/D144483

20 months ago[AMDGPU] Move splitMUBUFOffset from AMDGPUBaseInfo to SIInstrInfo
Jay Foad [Wed, 22 Feb 2023 15:51:29 +0000 (15:51 +0000)]
[AMDGPU] Move splitMUBUFOffset from AMDGPUBaseInfo to SIInstrInfo

Moving this out of AMDGPUBaseInfo enforces that AMDGPUBaseInfo should
not be calling into GCNSubtarget.

Differential Revision: https://reviews.llvm.org/D144564

20 months ago[libc++][format] Removes transitive type_traits.
Mark de Wever [Tue, 21 Feb 2023 16:31:44 +0000 (17:31 +0100)]
[libc++][format] Removes transitive type_traits.

Reviewed By: #libc, philnik

Differential Revision: https://reviews.llvm.org/D144498

20 months ago[clang][doc] Removes obsolete comment.
Mark de Wever [Tue, 21 Feb 2023 16:26:52 +0000 (17:26 +0100)]
[clang][doc] Removes obsolete comment.

After reading about the documentation improvements on LLVM weekly this
part seems obsolete.

Reviewed By: ChuanqiXu

Differential Revision: https://reviews.llvm.org/D144497

20 months ago[Clang] Add options in LTO mode when cross compiling for AMDGPU
Joseph Huber [Tue, 21 Feb 2023 17:42:22 +0000 (11:42 -0600)]
[Clang] Add options in LTO mode when cross compiling for AMDGPU

The AMDGPU toolchain support directly compiling GPU images using
cross-compilation such as `clang --target=amdgcn-amd-amdhsa foo.c`.
However, when attempting to link bitcode this does not work because the
`-mcpu` options are not forwarded to the linker among others. This patch
simply adds them so that `clang --target=amdgcn-amd-amdhsa foo.c -flto`
works correctly.

Reviewed By: JonChesterfield

Differential Revision: https://reviews.llvm.org/D144505

20 months ago[SCEV] Remove unused alignof/offsetof print special cases (NFC)
Nikita Popov [Wed, 22 Feb 2023 15:58:29 +0000 (16:58 +0100)]
[SCEV] Remove unused alignof/offsetof print special cases (NFC)

These shouldn't really reach SCEV without being folded away first,
and we don't have any tests that hit these cases.

The sizeof case does occur with scalable types.

20 months ago[CUDA][SPIRV] Match builtin types and __GCC_ATOMIC_XXX_LOCK_FREE macros on host/device
ShangwuYao [Tue, 14 Feb 2023 22:05:41 +0000 (22:05 +0000)]
[CUDA][SPIRV] Match builtin types and __GCC_ATOMIC_XXX_LOCK_FREE macros on host/device

This change matches the CUDA/SPIRV behavior with CUDA/NVPTX, and makes some builtin types
and __GCC_ATOMIC_XXX_LOCK_FREE macros the same between the host and device. This is only
done when host triple is provided and known, otherwise the behavior is unchanged.

Reviewed By: yaxunl

Differential Revision: https://reviews.llvm.org/D144047

20 months ago[flang] Fix build after D143339
Matthias Springer [Wed, 22 Feb 2023 14:25:57 +0000 (15:25 +0100)]
[flang] Fix build after D143339

This change silences a compiler warning: `base class should be explictily initialized in the copy constructor`.

Differential Revision: https://reviews.llvm.org/D144556

20 months ago[AMDGPU] Improved wide multiplies
Jessica Del [Tue, 21 Feb 2023 08:40:07 +0000 (09:40 +0100)]
[AMDGPU] Improved wide multiplies

These checks show optimized instructions if an operand is known to be
(partially) zero.

Change-Id: Ie2f6d0d3ee9d5b279d1f4c1dd0787492e39cc77a

Differential Revision: https://reviews.llvm.org/D140208

20 months ago[GlobalOpt] Add struct for parts during GlobalSRA (NFC)
Nikita Popov [Fri, 15 Jul 2022 14:36:46 +0000 (16:36 +0200)]
[GlobalOpt] Add struct for parts during GlobalSRA (NFC)

This is in preparation for https://reviews.llvm.org/D129857.

20 months ago[mlir][IR][NFC] Rename RewriterBase::replaceUseIf to replaceUsesWithIf
Matthias Springer [Wed, 22 Feb 2023 15:12:27 +0000 (16:12 +0100)]
[mlir][IR][NFC] Rename RewriterBase::replaceUseIf to replaceUsesWithIf

This is for consistency with `Value::replaceUsesWithIf`.

Differential Revision: https://reviews.llvm.org/D144547

20 months ago[clangd] Publish diagnostics with stale preambles
Kadir Cetinkaya [Tue, 21 Feb 2023 08:44:25 +0000 (09:44 +0100)]
[clangd] Publish diagnostics with stale preambles

This patch achieves this by building an AST and invoking main file
callbacks on each update, in addition to preamble updates.

It means we might have some extra AST builds now (e.g. if an update was
with a stale preamble and there were no reads on it, we would only build
an AST once we had the fresh preamble. Now we'll build 2, once with the
stale preamble and another with the fresh one, but we'll have one more
diagnostics cycle in between.).

This patch preserves forward progress of diagnostics by always using the
latest main file contents when emitting diagnostics after preamble
builds. It also guarantees eventual consistency:
- if an update doesn't invalidate preamble, we'll emit diagnostics with
  fresh preamble already.
- if an update invalidates preamble, we'll first emit diagnostics with
  stale contents, and then once the preamble build finishes it'll emit
  diagnostics (as preamble has changed) with newest version.

This has implications on parsing callbacks, as previously onMainAST
callback was called at most once, now it can be called up to 2 times.
All of the existing clients can already deal with callback firing
multiple times.

Differential Revision: https://reviews.llvm.org/D144456

20 months ago[clangd] Provide patched diagnostics with preamble patch
Kadir Cetinkaya [Wed, 1 Feb 2023 18:26:10 +0000 (19:26 +0100)]
[clangd] Provide patched diagnostics with preamble patch

Translates diagnostics from baseline preamble to relevant modified
contents.

Translation is done by looking for a set of lines that have the same
contents in diagnostic/note/fix ranges inside baseline and modified
contents.

A diagnostic is preserved if its main range is outside of main file or
there's a translation from baseline to modified contents. Later on fixes
and notes attached to that diagnostic with relevant ranges are also
translated and preserved.

Depends on D143095

Differential Revision: https://reviews.llvm.org/D143096

20 months ago[clangd] Respect preamble-patch when handling diags
Kadir Cetinkaya [Wed, 1 Feb 2023 18:10:03 +0000 (19:10 +0100)]
[clangd] Respect preamble-patch when handling diags

Depends on D143093

Differential Revision: https://reviews.llvm.org/D143095

20 months ago[clangd] #undef macros inside preamble patch
Kadir Cetinkaya [Wed, 1 Feb 2023 17:37:46 +0000 (18:37 +0100)]
[clangd] #undef macros inside preamble patch

That way we can stop generating false macro redefinition diagnostics.

Depends on D142890

Differential Revision: https://reviews.llvm.org/D143093

20 months ago[clangd] Add config option for fast diagnostics mode
Kadir Cetinkaya [Mon, 30 Jan 2023 07:06:41 +0000 (08:06 +0100)]
[clangd] Add config option for fast diagnostics mode

Also wire it up for use with patched preambles and introduce test cases
for behaviour we'd like to improve.

Differential Revision: https://reviews.llvm.org/D142890

20 months ago[mlir] Add missing patterns for lowering to Libm.
Alexander Belyaev [Wed, 22 Feb 2023 14:37:28 +0000 (15:37 +0100)]
[mlir] Add missing patterns for lowering to Libm.

Differential Revision: https://reviews.llvm.org/D144561

20 months ago[mlir][bufferization] Consider defaultMemorySpace when bufferizing FuncOp
Maya Amrami [Wed, 22 Feb 2023 07:48:32 +0000 (09:48 +0200)]
[mlir][bufferization] Consider defaultMemorySpace when bufferizing FuncOp

The function arguments and results type will have the default memory space.

Reviewed By: springerm

Differential Revision: https://reviews.llvm.org/D144539

20 months ago[gn build] Port 200034978b95
LLVM GN Syncbot [Wed, 22 Feb 2023 14:32:18 +0000 (14:32 +0000)]
[gn build] Port 200034978b95

20 months ago[flang][hlfir] Insert casts in hlfir.associate rewrites
Jean Perier [Wed, 22 Feb 2023 14:25:43 +0000 (15:25 +0100)]
[flang][hlfir] Insert casts in hlfir.associate rewrites

When the associated expression came from a moved variable, the
type of the moved variable may not exactly match the hlfir.associate
result and cannot be re-used directly. Insert fir.convert/fir.box_addr
as needed.

Reviewed By: clementval

Differential Revision: https://reviews.llvm.org/D144557

20 months ago[AMDGPU] Fix an assertion failure when folding into src2 of V_FMAC_F16
Jay Foad [Wed, 22 Feb 2023 13:39:58 +0000 (13:39 +0000)]
[AMDGPU] Fix an assertion failure when folding into src2 of V_FMAC_F16

D139469 "[AMDGPU] Enable OMod on more VOP3 instructions" caused an
assertion failure when trying to fold into src2 of V_FMAC_F16. It would
temporarily convert the instruction to V_FMA_F16_gfx9 and add an opsel
operand, but if the fold still failed then it would forget to remove the
opsel operand.

Differential Revision: https://reviews.llvm.org/D144558

20 months ago[MemProf] Add printing utilities for MemProf summary structures
Teresa Johnson [Sat, 18 Feb 2023 02:10:02 +0000 (18:10 -0800)]
[MemProf] Add printing utilities for MemProf summary structures

Split out of D140908 as suggested, and added unit testing.

Differential Revision: https://reviews.llvm.org/D144314

20 months ago[libc++][NFC] Replace typedef with using declarations in <__atomic/*>
Nikolas Klauser [Fri, 17 Feb 2023 12:07:52 +0000 (13:07 +0100)]
[libc++][NFC] Replace typedef with using declarations in <__atomic/*>

Reviewed By: Mordante, #libc

Spies: arichardson, libcxx-commits

Differential Revision: https://reviews.llvm.org/D144259

20 months ago[Local][SimplifyCFG][GVN] Handle !nontemporal in combineMetadata
Daniel Woodworth [Wed, 22 Feb 2023 12:38:53 +0000 (13:38 +0100)]
[Local][SimplifyCFG][GVN] Handle !nontemporal in combineMetadata

SimplifyCFG currently drops !nontemporal metadata when sinking
common instructions. With this change, SimplifyCFG and similar
transforms will preserve !nontemporal metadata as long as it is
set on both original instructions.

Differential Revision: https://reviews.llvm.org/D144298

20 months ago[BOLT] Fix data reoder for aarch64
Vladislav Khmelevsky [Wed, 15 Feb 2023 13:18:37 +0000 (17:18 +0400)]
[BOLT] Fix data reoder for aarch64

Use proper relocation for aarch64

Differential Revision: https://reviews.llvm.org/D144095

20 months ago[gn build] Port 3b8c0b342e16
LLVM GN Syncbot [Wed, 22 Feb 2023 12:33:42 +0000 (12:33 +0000)]
[gn build] Port 3b8c0b342e16

20 months ago[RISCV] Add new pass to transform undef to pseudo for vector values.
Piyou Chen [Wed, 22 Feb 2023 12:02:06 +0000 (04:02 -0800)]
[RISCV] Add new pass to transform undef to pseudo for vector values.

RISC-V vector instruction has register overlapping constraint for certain
instructions, and will cause illegal instruction trap if violated, we use
early clobber to model this constraint, but it can't prevent register allocator
allocated same or overlapped if the input register is undef value, so convert
IMPLICIT_DEF to temporary pseudo could prevent that happen, it's not best way
to resolve this. Ideally we should model the constraint right, but before we
model the constraint right, it's the approach to prevent that happen.

See also: https://github.com/llvm/llvm-project/issues/50157

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D129735

20 months ago[AArch64] Lower all fp zero buildvectors through BUILD_VECTOR.
David Green [Wed, 22 Feb 2023 11:26:41 +0000 (11:26 +0000)]
[AArch64] Lower all fp zero buildvectors through BUILD_VECTOR.

Just like with integers, we can treat zero fp buildvector as legal so that they
can be recognized in tablegen patterns using immAllZerosV.

20 months ago[AArch64] Fix N2 SchedModel element-to-element INS latencies
Sjoerd Meijer [Tue, 21 Feb 2023 16:00:52 +0000 (16:00 +0000)]
[AArch64] Fix N2 SchedModel element-to-element INS latencies

The instruction regexp "^INSv" for the insert gen-reg-to-element was also
matching the element-to-element instruction, which only has a latency of 2 and
not 5, so we were getting that wrong.

Differential Revision: https://reviews.llvm.org/D144508

20 months ago[gn build] Port 850dab0f2537
LLVM GN Syncbot [Wed, 22 Feb 2023 10:24:28 +0000 (10:24 +0000)]
[gn build] Port 850dab0f2537

20 months ago[RISCV] Add vendor-defined XTheadSync (Multi-core synchronization) extension
Manolis Tsamis [Wed, 22 Feb 2023 09:58:50 +0000 (10:58 +0100)]
[RISCV] Add vendor-defined XTheadSync (Multi-core synchronization) extension

The vendor-defined XTheadSync (no comparable standard extension exists
at the time of writing) extension adds multi-core synchronization
instructions.

It is supported by the C9xx cores (e.g., found in the wild in the
Allwinner D1) by Alibaba T-Head.

The current (as of this commit) public documentation for this
extension is available at:
  https://github.com/T-head-Semi/thead-extension-spec/releases/download/2.2.2/xthead-2023-01-30-2.2.2.pdf

Support for these instructions has already landed in GNU Binutils:
  https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=547c18d9bb95571261dbd17f4767194037eb82bd

Depends on D144496

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D144501

20 months ago[LV][RISCV] Don't interleave scalable vector loops
Luke Lau [Tue, 21 Feb 2023 14:56:22 +0000 (14:56 +0000)]
[LV][RISCV] Don't interleave scalable vector loops

It's less clear with scalable vectors than fixed length vectors that
interleaving exposes more ILP, as scalable vectors can be thought of a
sort of hardware form of interleaving, especially with larger LMULs.
This also addresses the unexpected additional unrolling that occurs when
using larger LMULs in the loop vectorizer.

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D144485

20 months ago[LV][NFC] Use ElementCount for getMaxInterleaveFactor
Luke Lau [Tue, 21 Feb 2023 13:00:18 +0000 (13:00 +0000)]
[LV][NFC] Use ElementCount for getMaxInterleaveFactor

In order to allow targets to disable interleaving for scalable vectors, pass the entire VF's ElementCount to getMaxInterleaveFactor.
This is based off of the approach used here: https://repo.hca.bsc.es/gitlab/rferrer/llvm-epi/-/commit/8d36708507b3c378078b9fe364bc548354aaec86

The plan would then be to disable interleaving on scalable VFs on RISC-V in a follow up patch.
See https://reviews.llvm.org/D143723#4132349

Reviewed By: reames

Differential Revision: https://reviews.llvm.org/D144474

20 months ago[WebAssembly] pmin/pmax fixes
Samuel Parker [Tue, 21 Feb 2023 10:52:27 +0000 (10:52 +0000)]
[WebAssembly] pmin/pmax fixes

Reverse the operand ordering to ? rhs : lhs.

Differential Revision: https://reviews.llvm.org/D144466

20 months ago[RISCV] Add vendor-defined XTheadCmo (Cache Management Operations) extension
Manolis Tsamis [Wed, 22 Feb 2023 09:36:42 +0000 (10:36 +0100)]
[RISCV] Add vendor-defined XTheadCmo (Cache Management Operations) extension

The vendor-defined XTHeadCmo (there are some similarities with the
Zicbom standard extension) extension adds cache management instructions.

It is supported by the C9xx cores (e.g., found in the wild in the
Allwinner D1) by Alibaba T-Head.

The current (as of this commit) public documentation for this
extension is available at:
  https://github.com/T-head-Semi/thead-extension-spec/releases/download/2.2.2/xthead-2023-01-30-2.2.2.pdf

Support for these instructions has already landed in GNU Binutils:
  https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=a9ba8bc2d396fb8ae2b892f3bc6be8cdfe4b555c

Reviewed By: craig.topper

Differential Revision: https://reviews.llvm.org/D144496

20 months ago[BOLT][AArch64] Replace NOP with adrp in AdrRelaxationPass to preserve relative offsets.
Denis Revunov [Wed, 18 Jan 2023 14:15:25 +0000 (09:15 -0500)]
[BOLT][AArch64] Replace NOP with adrp in AdrRelaxationPass to preserve relative offsets.

Avoid replacing one adr instruction with two adrp+add by utilizing linker-provided nops
when they are present. By doing so we preserve relative offsets of next instructions
in a function which reduces chances to break undetected jump tables. This commit makes
release-mode lld-linked clang, lld and etc work after BOLT.

Reviewed By: rafauler, yota9

Differential Revision: https://reviews.llvm.org/D143887

20 months ago[FunctionAttrs] Move nosync inference into inferAttrsFromFunctionBodies() (NFC)
Nikita Popov [Wed, 22 Feb 2023 09:23:11 +0000 (10:23 +0100)]
[FunctionAttrs] Move nosync inference into inferAttrsFromFunctionBodies() (NFC)

There doesn't appear to be any reason why this attribute is
inferred separately from other ones that use AttributeInferer.

20 months ago[mlir] RewriterBase::Listener: Add notifyOperationModified callback
Matthias Springer [Wed, 22 Feb 2023 09:41:22 +0000 (10:41 +0100)]
[mlir] RewriterBase::Listener: Add notifyOperationModified callback

This callback is triggered by `finalizeRootUpdate`. This allows listeners to listen for in-place op modifications without creating a new RewriterBase subclass.

Differential Revision: https://reviews.llvm.org/D143380

20 months ago[mlir][SCF][Utils][NFC] Make some utils public for better reuse
Matthias Springer [Wed, 22 Feb 2023 09:33:09 +0000 (10:33 +0100)]
[mlir][SCF][Utils][NFC] Make some utils public for better reuse

These functions will be used in a subsequent change. Also some minor refactoring.

Differential Revision: https://reviews.llvm.org/D143909

20 months ago[mlir] GreedyPatternRewriteDriver: Support optional Listener
Matthias Springer [Wed, 22 Feb 2023 09:25:37 +0000 (10:25 +0100)]
[mlir] GreedyPatternRewriteDriver: Support optional Listener

Allow an optional `RewriterBase::Listener` to be attached to greedy pattern rewrites, so that clients can listen for IR modifications.

Differential Revision: https://reviews.llvm.org/D143340

20 months ago[NFC] Class for building MultilibSet
Michael Platings [Tue, 7 Feb 2023 11:23:46 +0000 (11:23 +0000)]
[NFC] Class for building MultilibSet

The functionality in MultilibSet for creating it is tied to its current
implementation. Putting that code in a separate class is an enabler for
changing the MultilibSet implementation.

Differential Revision: https://reviews.llvm.org/D142893

20 months ago[Clang] Improve hermeticity of clang_rt tests
Michael Platings [Wed, 22 Feb 2023 08:43:35 +0000 (08:43 +0000)]
[Clang] Improve hermeticity of clang_rt tests

Specifying --sysroot prevents libclang_rt from being located in standard
library directories.

Differential Revision: https://reviews.llvm.org/D144542

20 months ago[AArch64] Fix abs(sub nsw) -> absd
Ricardo Jesus [Mon, 20 Feb 2023 16:44:40 +0000 (16:44 +0000)]
[AArch64] Fix abs(sub nsw) -> absd

This partially reverts a regression introduced in 8f25e382c5b1 for
AArch64 targets. In particular, we restore the logic of `(abs (sub nsw
x, y)) -> abds(x, y)` for all targets except X86, which keeps the logic
introduced in 8f25e382c5b1. See also https://reviews.llvm.org/D142288.

Differential Revision: https://reviews.llvm.org/D144379

20 months ago[Lex] Fix a crash in updateConsecutiveMacroArgTokens.
Haojian Wu [Tue, 14 Feb 2023 23:39:10 +0000 (00:39 +0100)]
[Lex] Fix a crash in updateConsecutiveMacroArgTokens.

Fixes https://github.com/llvm/llvm-project/issues/60722.

Differential Revision: https://reviews.llvm.org/D144054

20 months ago[TableGen] Emit static const globals in getOperandType
Michael Liao [Wed, 22 Feb 2023 08:36:36 +0000 (03:36 -0500)]
[TableGen] Emit static const globals in getOperandType

- That saves the overhead of operand type querying.

20 months ago[clang] Add the check of membership for the issue #58674 and improve the lookup process
Liming Liu [Sun, 12 Feb 2023 01:08:13 +0000 (09:08 +0800)]
[clang] Add the check of membership for the issue #58674 and improve the lookup process

This patch includes the commit 01adf96ebc86 and a fix of unhandled declaration
references.

When looking up base classes, Clang first checks whether a base class is a
template and takes the specialized template based on it. However, the base class
might be instantiated, and the above behavior can lose information.

This patch fixes the problem by first checking whether a base class is a record
declaration, so the instantiated one will be taken.

Differential Revision: https://reviews.llvm.org/D143840

20 months ago[FunctionAttrs] Slightly clean up returned attr inference (NFC)
Nikita Popov [Wed, 22 Feb 2023 08:31:12 +0000 (09:31 +0100)]
[FunctionAttrs] Slightly clean up returned attr inference (NFC)

Use hasAttrSomewhere() and directly return Argument from the
helper.

20 months agoThis change makes `RewriterBase` symmetric to `OpBuilder`.
Matthias Springer [Wed, 22 Feb 2023 08:12:24 +0000 (09:12 +0100)]
This change makes `RewriterBase` symmetric to `OpBuilder`.

```
  OpBuilder           OpBuilder::Listener
      ^                        ^
      |                        |
RewriterBase        RewriterBase::Listener
```

* Clients can listen to IR modifications with `RewriterBase::Listener`.
* `RewriterBase` no longer inherits from `OpBuilder::Listener`.
* Only a single listener can be registered at the moment (same as `OpBuilder`).

RFC: https://discourse.llvm.org/t/rfc-listeners-for-rewriterbase/68198

Differential Revision: https://reviews.llvm.org/D143339

20 months ago[flang][runtime] Add API to help with the difficult array constructor cases
Jean Perier [Wed, 22 Feb 2023 08:16:01 +0000 (09:16 +0100)]
[flang][runtime] Add API to help with the difficult array constructor cases

This runtime API can be used to lower any flavor of array constructors,
but is mainly intended to be used with:

- array constructors for which the extent or length parameters cannot
 be computed without lowering some ac-value or ac-implied-do-control
 that cannot be pre-evaluated.

- array constructors of a derived type with allocatable component where
 copy is not trivial or PDTS.

Example of use cases:
 - `[((i+j,i=1, ifoo()), j=1,n)]` where ifoo() is not pure.
 - `[return_allocatable_array(), return_allocatable_array()]`

Differential Revision: https://reviews.llvm.org/D144411

20 months ago[X86][MC] Correct the description & assert for mmm field of EVEX, NFCI
Shengchen Kan [Wed, 22 Feb 2023 07:56:08 +0000 (15:56 +0800)]
[X86][MC] Correct the description & assert for mmm field of EVEX, NFCI

20 months agoRevert "[NFC] Make FPClassTest a bitmask enumeration"
Nikita Popov [Wed, 22 Feb 2023 07:56:19 +0000 (08:56 +0100)]
Revert "[NFC] Make FPClassTest a bitmask enumeration"

This reverts commit 2e416cdd52c1079b8c7cb1f7d7e557c889a4fb56.

Breaks the GCC build:

In file included from /home/npopov/repos/llvm-project/llvm/include/llvm/ADT/FloatingPointMode.h:18,
                 from /home/npopov/repos/llvm-project/llvm/include/llvm/ADT/APFloat.h:20,
                 from /home/npopov/repos/llvm-project/llvm/lib/Support/APFloat.cpp:14:
/home/npopov/repos/llvm-project/llvm/include/llvm/ADT/BitmaskEnum.h:66:22: error: extra qualification not allowed [-fpermissive]
   66 |   template <> struct llvm::is_bitmask_enum<Enum> : std::true_type {};          \
      |                      ^~~~
/home/npopov/repos/llvm-project/llvm/include/llvm/ADT/FloatingPointMode.h:223:1: note: in expansion of macro ‘LLVM_DECLARE_ENUM_AS_BITMASK’
  223 | LLVM_DECLARE_ENUM_AS_BITMASK(FPClassTest, /* LargestValue */ fcPosInf);
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/npopov/repos/llvm-project/llvm/include/llvm/ADT/BitmaskEnum.h:67:22: error: extra qualification not allowed [-fpermissive]
   67 |   template <> struct llvm::largest_bitmask_enum_bit<Enum> {                    \
      |                      ^~~~
/home/npopov/repos/llvm-project/llvm/include/llvm/ADT/FloatingPointMode.h:223:1: note: in expansion of macro ‘LLVM_DECLARE_ENUM_AS_BITMASK’
  223 | LLVM_DECLARE_ENUM_AS_BITMASK(FPClassTest, /* LargestValue */ fcPosInf);
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~
[43/4396] Building CXX object lib/Supp...iles/LLVMSupport.dir/CommandLine.cpp.o

20 months ago[Clang] Fix clang_rt tests when LLVM_ENABLE_PER_TARGET_RUNTIME_DIR is ON
Michael Platings [Thu, 9 Feb 2023 18:07:52 +0000 (18:07 +0000)]
[Clang] Fix clang_rt tests when LLVM_ENABLE_PER_TARGET_RUNTIME_DIR is ON

If clang is part of a toolchain that can locate clang_rt libraries
outside its resource directory and these are built with
LLVM_ENABLE_PER_TARGET_RUNTIME_DIR=ON then the tests would fail because
the library names don't have the arch suffix. This change makes the arch
suffix optional.

Differential Revision: https://reviews.llvm.org/D143666

20 months ago[NFC] Make FPClassTest a bitmask enumeration
Serge Pavlov [Wed, 22 Feb 2023 07:20:04 +0000 (14:20 +0700)]
[NFC] Make FPClassTest a bitmask enumeration

With this change bitwise operations are allowed for FPClassTest
enumeration, it must simplify using this type. Also some functions
changed to get argument of type FPClassTest instead of unsigned.

Differential Revision: https://reviews.llvm.org/D144241

20 months ago[mlir][Vector] Add support for high-order masked contractions
Diego Caballero [Wed, 22 Feb 2023 06:53:37 +0000 (06:53 +0000)]
[mlir][Vector] Add support for high-order masked contractions

This patch adds support for masked vector.contract ops that needs to be
decomposed using the ContractionOpLowering pattern. It just slices the
mask according to the rest of the lowering.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D144427

20 months ago[mlir][Vector] Add folding for masked reductions and vector.mask
Diego Caballero [Wed, 22 Feb 2023 06:37:38 +0000 (06:37 +0000)]
[mlir][Vector] Add folding for masked reductions and vector.mask

This patch adds support for folding trivial masked reductions and
multi-reductions (e.g., multi-reductions with only parallel dims,
reductions of a single element, etc.). To support those foldings in
a composable way we also add support for folding different flavors of
empty vector.mask opertions.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D144414

20 months ago[mlir][linalg] Allow constant exprs in pack/unpack propagation through elementwise
Quinn Dawkins [Tue, 21 Feb 2023 01:52:49 +0000 (20:52 -0500)]
[mlir][linalg] Allow constant exprs in pack/unpack propagation through elementwise

The pack/unpack propagation patterns currently assume all map results
for non-scalar arguments are AffineDimExprs, leading to crashes when the
input operand being packed has constant expressions.

Differential Revision: https://reviews.llvm.org/D144443

20 months ago[mlir][Analysis] Guard data flow analysis from no block function
Kai Sasaki [Wed, 22 Feb 2023 06:09:09 +0000 (15:09 +0900)]
[mlir][Analysis] Guard data flow analysis from no block function

Foo analysis for testing the data flow analysis does not support the region without any block. Although that analysis is assumed to be used for testing purpose, it is generally better to be explicit about the scope the framework supports.

The original issue was reported here.
https://github.com/llvm/llvm-project/issues/60580

Reviewed By: springerm

Differential Revision: https://reviews.llvm.org/D144359

20 months ago[X86][NFC] Remove useless code in X86MCCodeEmitter.cpp
Shengchen Kan [Wed, 22 Feb 2023 05:59:28 +0000 (13:59 +0800)]
[X86][NFC] Remove useless code in X86MCCodeEmitter.cpp

Neither the switch in A nor the switch in B is fully covered, so we
don't need write an impossible format there.

20 months ago[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag
Petr Hosek [Wed, 1 Feb 2023 08:25:35 +0000 (08:25 +0000)]
[CMake] Unify llvm_check_linker_flag and llvm_check_compiler_linker_flag

These have the same purposes but two different implementations.
llvm_check_compiler_linker_flag uses CMAKE_REQUIRED_FLAGS which affects
flags used both for compilation and linking which is problematic because
some flags may be link-only and trigger unused argument warning when set
during compilation. llvm_check_linker_flag does not have this issue so
we chose it as the prevailaing implementation.

Differential Revision: https://reviews.llvm.org/D143052

20 months ago[ORC] Add an ExecutionSession::getTargetTriple convenience function.
Lang Hames [Wed, 22 Feb 2023 03:09:34 +0000 (19:09 -0800)]
[ORC] Add an ExecutionSession::getTargetTriple convenience function.

Forwards to ExecutorProcessControl::getTargetTriple, and saves clients the
trouble of spelling 'getExecutorProcessControl()' everywhere.

20 months ago[flang] Run MathToLLVM conversion as part of math conversion PM.
Slava Zakharin [Tue, 21 Feb 2023 22:55:00 +0000 (14:55 -0800)]
[flang] Run MathToLLVM conversion as part of math conversion PM.

We used to specify benefit=0 for MathToLibm patterns so that
MathToLLVM patterns are more preferable. D144450 removed
the benefit parameter, so we have to run MathToLLVM separately now.

This resolves Intrinsics/math-codegen failure caused by D144450.

Differential Revision: https://reviews.llvm.org/D144523

20 months ago[Support] Fix data race with “safe libc++”
Vitaly Buka [Wed, 22 Feb 2023 02:28:06 +0000 (18:28 -0800)]
[Support] Fix data race with “safe libc++”

Otherwise NFC.

20 months ago[ORC-RT] Remove the '__' prefix from the ORC runtime's public API.
Lang Hames [Wed, 22 Feb 2023 02:18:58 +0000 (18:18 -0800)]
[ORC-RT] Remove the '__' prefix from the ORC runtime's public API.

The '__' prefix should only be used for the parts of the ORC runtime that
implement compiler / loader runtime details (e.g. ORC-RT's __tlv_get_addr
implementations).

This patch only fixes the public API. Future changes will fix internal names.

20 months ago[NFC] Use single quotes for single char output during `printPipline`
Liren Peng [Wed, 22 Feb 2023 02:30:13 +0000 (02:30 +0000)]
[NFC] Use single quotes for single char output during `printPipline`

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D144365

20 months ago[Clang][OpenMP] Update tests using update_cc_test_checks.py
Shilei Tian [Wed, 22 Feb 2023 02:09:20 +0000 (21:09 -0500)]
[Clang][OpenMP] Update tests using update_cc_test_checks.py

Make preparation for other patches

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D144320

20 months ago[WebAssembly] Add more combine pattern for vector shift
Jun Ma [Mon, 20 Feb 2023 05:54:20 +0000 (13:54 +0800)]
[WebAssembly] Add more combine pattern for vector shift

After change with D144169, the codegen generates redundant instructions
like and and wrap. This fixes it.

Differential Revision: https://reviews.llvm.org/D144360

20 months ago[mlir][Vector] Support masking for more contraction flavors
Diego Caballero [Wed, 22 Feb 2023 01:20:10 +0000 (01:20 +0000)]
[mlir][Vector] Support masking for more contraction flavors

This patch adds masking support for more contraction flavors including those
with any combiner operation (add, mul, min, max, and, or, etc.) and
regular matmul contractions.

Combiner operations that are performing vertical reductions (and,
therefore, they are not represented with a horizontal reduction
operation) can be executed unmasked. However, the previous value of
the accumulator must be propagated for lanes that shouldn't accumulate.
We achieve this goal by introducing a select operation after the
accumulator to choose between the combined and the previous accumulator
value. This design decision is made to avoid introducing masking support
to all the arithmetic and logical operations in the Arith dialect. VP
intrinsics do not support pass-thru values either so we would have to
generate the same sequence when lowering to LLVM. The op + select
pattern is peepholed by some backend with native masking support for those
operations.

Consequently, this patch removes masking support from the vector.fma
operation to follow the same approach for all the combiner operations.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D144239

20 months agoRevert "[Clang][OpenMP] Update tests using update_cc_test_checks.py"
Shilei Tian [Wed, 22 Feb 2023 01:28:00 +0000 (20:28 -0500)]
Revert "[Clang][OpenMP] Update tests using update_cc_test_checks.py"

This reverts commit 61faf261506f93819e48f050f318f96452831f92 as it makes buildbot unhappy.

20 months ago[PowerPC][NFC] add const-splat-array-init.ll
Ting Wang [Wed, 22 Feb 2023 01:24:12 +0000 (20:24 -0500)]
[PowerPC][NFC] add const-splat-array-init.ll

Add test case and will show combiner can improve these.

Reviewed By: lkail

Differential Revision: https://reviews.llvm.org/D144235

20 months ago[X86][NFC] Refactor X86InstrArithmetic.td by class
Wang, Xin10 [Wed, 22 Feb 2023 00:41:12 +0000 (08:41 +0800)]
[X86][NFC] Refactor X86InstrArithmetic.td by class

1. Extract the common code of some instructions into a class to reduce duplication
2. Refine some comments to the make the description of the class clearer

By this way, the records defined here will be consistent and easier to
maintain, I think.

Reviewed By: skan

Differential Revision: https://reviews.llvm.org/D144154

20 months ago[Clang][OpenMP] Update tests using update_cc_test_checks.py
Shilei Tian [Wed, 22 Feb 2023 01:09:09 +0000 (20:09 -0500)]
[Clang][OpenMP] Update tests using update_cc_test_checks.py

Make preparation for other patches

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D144320

20 months ago[flang] Avoid linking step for failing LIT tests added in D143301.
Slava Zakharin [Wed, 22 Feb 2023 01:03:53 +0000 (17:03 -0800)]
[flang] Avoid linking step for failing LIT tests added in D143301.

20 months ago[ORC-RT][MachO] Use 'UseCallbackStyleUnwindInfo' flag, rather than re-test.
Lang Hames [Sat, 18 Feb 2023 04:28:07 +0000 (20:28 -0800)]
[ORC-RT][MachO] Use 'UseCallbackStyleUnwindInfo' flag, rather than re-test.

The UseCallbackStyleUnwindInfo flag already captures the original conditional,
and using the flag everywhere makes it easier to switch registration styles
when debugging.

20 months ago[Test] Set gitattributes for line endings in tapi inputs
Justin Bogner [Tue, 21 Feb 2023 22:37:18 +0000 (14:37 -0800)]
[Test] Set gitattributes for line endings in tapi inputs

LLVM's parsing of .tbd files errors out with "unsupported file type"
if the tbd files have CRLF line endings. Add gitattributes for the
inputs to avoid test failures on windows.

Differential Revision: https://reviews.llvm.org/D144527

20 months ago[Test] Invoke GetErrcMessages from config-ix instead of llvm/CMakeLists
Justin Bogner [Tue, 21 Feb 2023 22:28:04 +0000 (14:28 -0800)]
[Test] Invoke GetErrcMessages from config-ix instead of llvm/CMakeLists

Since GetErrcMessages uses cmake's `try_run` mechanism it's sensitive
to changes to CMAKE_CXX_FLAGS, so we move it into config-ix with the
similar flag-sensitive configuration. This makes it run before
HandleLLVMOptions and avoids issues with LLVM_ENABLE_WERROR and other
configuration that manipulate CMAKE_CXX_FLAGS.

Differential Revision: https://reviews.llvm.org/D144526

20 months ago[runtimes] Remove unused functions from Handle{Libcxx,Libunwind}Flags.cmake
Nikolas Klauser [Mon, 20 Feb 2023 16:12:09 +0000 (17:12 +0100)]
[runtimes] Remove unused functions from Handle{Libcxx,Libunwind}Flags.cmake

Reviewed By: phosek, #libunwind, #libc

Spies: libcxx-commits

Differential Revision: https://reviews.llvm.org/D144400

20 months ago[PowerPC] Include vector bool and pixel when emitting lax warning
Maryam Moghadas [Thu, 2 Feb 2023 18:31:44 +0000 (18:31 +0000)]
[PowerPC] Include vector bool and pixel when emitting lax warning

This patch is to fix some missing lax-vector-conversion warnings including
cases that involve vector bool and vector pixel, also to fix the vector
compatibility check for the warnings.

Differential Revision: https://reviews.llvm.org/D143210

20 months ago[flang] Handle unsupported warning flags
Ethan Luis McDonough [Tue, 21 Feb 2023 22:09:34 +0000 (16:09 -0600)]
[flang] Handle unsupported warning flags

This PR makes flang emit a warning when the user passes an unsupported gfortran warning flag in as a CLI arg.  This PR also checks each `-W` argument instead of just looking at the last one passed in.

Reviewed By: awarzynski

Differential Revision: https://reviews.llvm.org/D143301

20 months ago[InstSimplify] enhance simplifyWithOpReplaced() to allow more 'select' removal
Sanjay Patel [Tue, 21 Feb 2023 20:03:32 +0000 (15:03 -0500)]
[InstSimplify] enhance simplifyWithOpReplaced() to allow more 'select' removal

This is a generalization of a suggestion from issue #60799
that allows removing a redundant guard of an input value
via icmp+select. It should also solve issue #60801.

This only comes into play for a select with an equality
condition where we are trying to substitute a constant into
the false arm of a select. (A 'true' select arm substitution
allows "refinement", so it is not on this code path.)

The constant must be the same in the compare and the select,
and it must be a "binop absorber" (X op C = C). That query
currently includes 'or', 'and', and 'mul', so there are tests
for all of those opcodes.

We then use "impliesPoison" on the false arm binop and the
original "Op" to be replaced to ensure that the select is not
actually blocking poison from leaking. That could be
potentially expensive as we recursively test each operand, but
it is currently limited to a depth of 2. That's enough to catch
our motivating cases, but probably nothing more complicated
(although that seems unlikely).

I don't know how to generalize a proof for Alive2 for this, but
here's a positive and negative test example to help illustrate
the subtle logic differences of poison/undef propagation:
https://alive2.llvm.org/ce/z/Sz5K-c

Differential Revision: https://reviews.llvm.org/D144493

20 months ago[InstCombine] Don't fold freeze poison when it's used in shufflevector
ManuelJBrito [Tue, 21 Feb 2023 21:58:27 +0000 (21:58 +0000)]
[InstCombine] Don't fold freeze poison when it's used in shufflevector

With this patch freeze undef/poison will no longer be folded into a constant if it's used as a
vector operand in a shufflevector.

Differential Revision: https://reviews.llvm.org/D143593