Sanjay Patel [Wed, 29 Jun 2022 14:43:55 +0000 (10:43 -0400)]
[InstCombine] add tests for (pow2 >> X) <u C; NFC
Adapted from
af5e64df728f498ea9c6
Corentin Jabot [Tue, 28 Jun 2022 21:48:48 +0000 (23:48 +0200)]
[Clang] Rename StringLiteral::isAscii() => isOrdinary() [NFC]
"Ascii" StringLiteral instances are actually narrow strings
that are UTF-8 encoded and do not have an encoding prefix.
(UTF8 StringLiteral are also UTF-8 encoded strings, but with
the u8 prefix.
To avoid possible confusion both with actuall ASCII strings,
and with future works extending the set of literal encodings
supported by clang, this rename StringLiteral::isAscii() to
isOrdinary(), matching C++ standard terminology.
Reviewed By: aaron.ballman
Differential Revision: https://reviews.llvm.org/
D128762
Walter Erquinigo [Fri, 24 Jun 2022 21:02:47 +0000 (14:02 -0700)]
[trace] Make events first class items in the trace cursor and rework errors
We want to include events with metadata, like context switches, and this
requires the API to handle events with payloads (e.g. information about
such context switches). Besides this, we want to support multiple
similar events between two consecutive instructions, like multiple
context switches. However, the current implementation is not good for this because
we are defining events as bitmask enums associated with specific
instructions. Thus, we need to decouple instructions from events and
make events actual items in the trace, just like instructions and
errors.
- Add accessors in the TraceCursor to know if an item is an event or not
- Modify from the TraceDumper all the way to DecodedThread to support
- Renamed the paused event to disabled.
- Improved the tsc handling logic. I was using an API for getting the tsc from libipt, but that was an overkill that should be used when not processing events manually, but as we are already processing events, we can more easily get the tscs.
event items. Fortunately this simplified many things
- As part of this refactor, I also fixed and long stating issue, which is that some non decoding errors were being inserted in the decoded thread. I changed this so that TraceIntelPT::Decode returns an error if the decoder couldn't be set up proplerly. Then, errors within a trace are actual anomalies found in between instrutions.
All test pass
Differential Revision: https://reviews.llvm.org/
D128576
Craig Topper [Wed, 29 Jun 2022 15:51:10 +0000 (08:51 -0700)]
[RISCV] Select (srl (and X, C2) as (slli (srliw X, C3), C3-C).
If C2 has 32 leading zeros and C3 trailing zeros.
Matt Arsenault [Sat, 25 Jun 2022 14:25:11 +0000 (10:25 -0400)]
AMDGPU: Make packed 32-bit instructions rematerializable
Matt Arsenault [Sat, 25 Jun 2022 17:24:31 +0000 (13:24 -0400)]
AMDGPU: Make 16-bit pk instructions rematerializable
Matt Arsenault [Sat, 25 Jun 2022 16:56:51 +0000 (12:56 -0400)]
AMDGPU: Add rematerialize tests for 16-bit pk instructions
Michał Górny [Wed, 29 Jun 2022 15:47:53 +0000 (17:47 +0200)]
[lldb] [test] Avoid relying on signos in other fork tests
Sponsored by: The FreeBSD Foundation
Florian Hahn [Wed, 29 Jun 2022 15:34:11 +0000 (16:34 +0100)]
[LV] Add test case showing dead recipe blocking region merging.
Matt Arsenault [Sat, 25 Jun 2022 15:39:59 +0000 (11:39 -0400)]
AMDGPU: Mark more instructions as rematerializable
D106023 excluded 16-bit instructions from rematerialization, with the
justification that we can't rematerialize instructions that preserve
the high bits (plus the instructions which do are a confusing mess
between different subtargets). This doesn't make sense to me as a
problem since cases where we would rely on the high bit behavior would
still need to be represented as a register value constraint with a
tied operand. It's not a hidden side effect and should still be
rematerializable.
Matt Arsenault [Sat, 25 Jun 2022 13:53:31 +0000 (09:53 -0400)]
AMDGPU: Add more rematerialization tests for 16-bit instructions
Nikita Popov [Wed, 29 Jun 2022 15:17:20 +0000 (17:17 +0200)]
[VectorCombine] Avoid ConstantExpr::get() (NFC)
Use IRBuilder APIs instead, which will still constant fold.
Guillaume Chatelet [Wed, 29 Jun 2022 15:12:24 +0000 (15:12 +0000)]
Revert "[libc][test] Remove dependency on sstream in algorithm_test.cpp"
This reverts commit
292b281caf8c3750cc0796b971af2ca24236a926.
Guillaume Chatelet [Wed, 29 Jun 2022 12:28:00 +0000 (12:28 +0000)]
[libc][test] Remove dependency on sstream in algorithm_test.cpp
Bots have been failing in full build mode because ´#include <sstream>´ would pull pthread which is not possible when code is compiled with ´-freestanding´.
Differential Revision: https://reviews.llvm.org/
D128809
Nicolas Vasilache [Wed, 29 Jun 2022 14:51:41 +0000 (07:51 -0700)]
[mlir][Tensor] Improve documentation of verification behavior of InsertSliceOp.
Nikita Popov [Wed, 29 Jun 2022 14:41:38 +0000 (16:41 +0200)]
[JumpThreading] Avoid ConstantExpr::get() (NFCI)
This code requires the result to be an UndefValue/ConstantInt
anyway (checked by getKnownConstant), so we are only interested
in the case where this folds.
Nicolas Vasilache [Wed, 29 Jun 2022 11:16:49 +0000 (04:16 -0700)]
[mlir][Tensor][NFC] Better document rank-reducing behavior of ExtractSliceOp and cleanup
Lei Huang [Tue, 21 Jun 2022 16:54:40 +0000 (11:54 -0500)]
[PowerPC] Fix signatures for vec_replace_unaligned builtin
``vec_replace_unaligned`` is meant to return vuc to emphasize that elements
are being inserted on unnatural boundaries.
Reviewed By: amyk, quinnp
Differential Revision: https://reviews.llvm.org/
D128288
Matt Arsenault [Mon, 6 Jun 2022 21:19:03 +0000 (17:19 -0400)]
AMDGPU: Use isMeta flags on pseudoinstructions
Nikita Popov [Wed, 29 Jun 2022 14:06:49 +0000 (16:06 +0200)]
[InstCombine] Avoid some calls to ConstantExpr::get() (NFCI)
Replace some calls to ConstantExpr::get() with IRBuilder APIs
(which will also constant fold if possible).
Michał Górny [Wed, 29 Jun 2022 14:04:50 +0000 (16:04 +0200)]
[lldb] [test] Un-XFAIL fork tests on arm as well
Sponsored by: The FreeBSD Foundation
Stephen Long [Tue, 28 Jun 2022 13:45:18 +0000 (06:45 -0700)]
[clang-cl] Handle some pragma alloc_text corner cases handled by MSVC
MSVC's pragma alloc_text accepts a function that was redeclared in
a non extern-C context if the previous declaration was in an extern-C
context. i.e.
```
extern "C" { static void f(); }
static void f();
```
MSVC's pragma alloc_text also rejects non-functions.
Reviewed By: hans
Differential Revision: https://reviews.llvm.org/
D128649
Jay Foad [Wed, 29 Jun 2022 13:43:01 +0000 (14:43 +0100)]
[AMDGPU] Remove FIXMEs that were resolved by D30134
Michał Górny [Wed, 29 Jun 2022 05:18:54 +0000 (07:18 +0200)]
[lldb] [test] Use raise(SIGSTOP) instead of trap in fork tests
Replace the use of "trap" with a new "stop" command in fork tests,
that maps to `raise(SIGSTOP)`. Since traps do not increment PC on some
architectures (notably ARM), using traps would require special logic
to increment it while testing. Using SIGSTOP avoids the problem
and is probably more logical, given that the purpose of the "trap"s
was to simply stop the inferior at a synchronization point. This fixes
tests on AArch64 (and possibly ARM, I'll update XFAILs when it is
confirmed by the buildbot).
Sponsored by: The FreeBSD Foundation
Differential Revision: https://reviews.llvm.org/
D128780
Joseph Huber [Tue, 28 Jun 2022 19:21:03 +0000 (15:21 -0400)]
[CUDA] Stop adding CUDA features twice
We currently call the `addNVPTXFeatures` function in two places, inside
of the CUDA Toolchain and inside of Clang in the standard entry point.
We normally add features to the job in Clang, so the call inside of the
CUDA toolchain is redundant and results in `+ptx` features being added.
Since we remove this call, we no longer will have a cached CUDA
installation so we will usually create it twice.
Reviewed By: tra
Differential Revision: https://reviews.llvm.org/
D128752
Haojian Wu [Wed, 29 Jun 2022 13:10:39 +0000 (15:10 +0200)]
[pseudo] Update the cxx.bnf path in comments to reflect the new
location, NFC
Corentin Jabot [Fri, 20 Aug 2021 15:52:28 +0000 (17:52 +0200)]
Improve handling of static assert messages.
Instead of dumping the string literal (which
quotes it and escape every non-ascii symbol),
we can use the content of the string when it is a
8 byte string.
Wide, UTF-8/UTF-16/32 strings are still completely
escaped, until we clarify how these entities should
behave (cf https://wg21.link/p2361).
`FormatDiagnostic` is modified to escape
non printable characters and invalid UTF-8.
This ensures that unicode characters, spaces and new
lines are properly rendered in static messages.
This make clang more consistent with other implementation
and fixes this tweet
https://twitter.com/jfbastien/status/
1298307325443231744 :)
Of note, `PaddingChecker` did print out new lines that were
later removed by the diagnostic printing code.
To be consistent with its tests, the new lines are removed
from the diagnostic.
Unicode tables updated to both use the Unicode definitions
and the Unicode 14.0 data.
U+00AD SOFT HYPHEN is still considered a print character
to match existing practices in terminals, in addition of
being considered a formatting character as per Unicode.
Reviewed By: aaron.ballman, #clang-language-wg
Differential Revision: https://reviews.llvm.org/
D108469
Valentin Clement [Wed, 29 Jun 2022 12:46:19 +0000 (05:46 -0700)]
[Flang] fix some types in error message
Pavel Labath [Wed, 29 Jun 2022 12:48:54 +0000 (14:48 +0200)]
[lldb] Replace linux/uio.h with sys/uio.h in NativeRegisterContextLinux_s390x
Fixes PR56280.
Max Kazantsev [Wed, 29 Jun 2022 12:30:52 +0000 (19:30 +0700)]
[Test] Add XFAIL test for PR56243
This test demonstrates how sinking down gc.relocate may lead to breach
of LCSSA form by tokens and, consecutively, end up with SSA breach by
LoopSimplifyCFG which creates fake edges and is unable to update missing
LCSSA phis for tokens used outside of the loop.
Nikita Popov [Wed, 29 Jun 2022 12:16:30 +0000 (14:16 +0200)]
[Bitcode] Restore bitcast expression auto-upgrade
Restore the autoupgrade from bitcast to ptrtoint+inttoptr, which
was lost as part of
D127729.
This fixes the backwards compatibility issue noted in:
https://reviews.llvm.org/
D127729#inline-
1236519
Nikita Popov [Wed, 29 Jun 2022 12:34:02 +0000 (14:34 +0200)]
[SCEV] Don't create udiv constant expression (NFC)
Work on APInts to make it clear that this will not create a
constant expression.
This code path is not reached if the RHS is zero.
Valentin Clement [Wed, 29 Jun 2022 12:21:00 +0000 (14:21 +0200)]
[flang][NFC] Cosmetic changes to make the file more homogenous
This patch is part of the upstreaming effort from fir-dev branch.
Reviewed By: jeanPerier
Differential Revision: https://reviews.llvm.org/
D128799
Valentin Clement [Wed, 29 Jun 2022 12:19:56 +0000 (14:19 +0200)]
[flang] Switch attr name to bindc_name
This patch is part of the upstreaming effort from fir-dev branch.
Reviewed By: jeanPerier
Differential Revision: https://reviews.llvm.org/
D128788
Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>
Mehdi Amini [Sun, 12 Jun 2022 18:12:26 +0000 (18:12 +0000)]
Apply clang-tidy fixes for readability-simplify-boolean-expr in ViewLikeInterface.cpp (NFC)
Mehdi Amini [Sun, 12 Jun 2022 18:08:50 +0000 (18:08 +0000)]
Apply clang-tidy fixes for readability-identifier-naming in Float16bits.cpp (NFC)
Bradley Smith [Fri, 17 Jun 2022 10:45:27 +0000 (10:45 +0000)]
[AArch64][SVE] Match (add x (urshr/srshr y c)) -> ursra/srsra x y c
Differential Revision: https://reviews.llvm.org/
D128447
Kiran Chandramohan [Wed, 29 Jun 2022 11:59:52 +0000 (11:59 +0000)]
[Flang][OpenMP] Remove TODO for shared and two values of default clause
Shared is the default behaviour in the IR, so no handling is required.
Default clause with shared or none do not require any handling since
Shared is the default behaviour in the IR and None is only required
for semantic checks.
This patch is carved out from
D123930 to remove couple of false TODOs.
Reviewed By: peixin, shraiysh
Differential Revision: https://reviews.llvm.org/
D128797
Co-authored-by: Nimish Mishra <neelam.nimish@gmail.com>
Arjun P [Wed, 29 Jun 2022 12:02:52 +0000 (13:02 +0100)]
[MLIR][Preburger] fix typo covertVarKind -> convertVarKind
Also update parameter names in the implementation file to match the header.
Kadir Cetinkaya [Wed, 15 Jun 2022 13:44:09 +0000 (15:44 +0200)]
[clangd] Support multiline semantic tokens
Per LSP, multiline tokens should be handled as if they end at the end
of the line starting the token (there's also a capability to enable them, but
that's an adventure for a different day).
Fixes https://github.com/clangd/clangd/issues/1145
Differential Revision: https://reviews.llvm.org/
D127856
Michał Górny [Wed, 29 Jun 2022 11:33:43 +0000 (13:33 +0200)]
[lldb] [test] Fix variable overwrite in non-stop fork tests
Thanks to Pavel Labath for noticing the mistake in:
https://reviews.llvm.org/
D128638#
3618039
Sponsored by: The FreeBSD Foundation
Arjun P [Wed, 29 Jun 2022 11:21:48 +0000 (12:21 +0100)]
Revert clang-tidy fixes for readability-simplify-boolean-expr and add NOLINT
The original code is more readable because the goal is to check if the given
value does *not* lie in the range. It is harder to understand this by
reading the rewritten code.
Reviewed By: mehdi_amini
Differential Revision: https://reviews.llvm.org/
D128753
Nikita Popov [Wed, 29 Jun 2022 11:04:15 +0000 (13:04 +0200)]
[IRBuilder] Migrate div/rem to use fold infrastructure
Migrate udiv, sdiv, urem, and srem to use the FoldXYZ rather than
the CreateXYZ infrastructure.
isuckatcs [Wed, 29 Jun 2022 10:50:24 +0000 (12:50 +0200)]
[analyzer] Fix BindingDecl evaluation for reference types
The case when the bound variable is reference type in a
BindingDecl wasn't handled, which lead to false positives.
Differential Revision: https://reviews.llvm.org/
D128716
Jay Foad [Tue, 28 Jun 2022 16:05:21 +0000 (17:05 +0100)]
[AMDGPU] Remove FIXME that was resolved by D30212
Benjamin Kramer [Wed, 29 Jun 2022 10:32:32 +0000 (12:32 +0200)]
[Presburger] Cheat around old versions of clang not doing NRVO when there's a derived-to-base cast in the way
Should be NFC. We can just do the base conversion manually and avoid
warnings about it. Clang before Clang 13 didn't implement P1825 and
complains:
mlir/lib/Analysis/Presburger/IntegerRelation.cpp:226:10: warning: local variable 'result' will be copied
despite being returned by name [-Wreturn-std-move]
return result;
^~~~~~
mlir/lib/Analysis/Presburger/IntegerRelation.cpp:226:10: note: call 'std::move' explicitly to avoid copying
return result;
^~~~~~
std::move(result)
Valentin Clement [Wed, 29 Jun 2022 10:21:26 +0000 (03:21 -0700)]
[flang][NFC] Fix some typos
Dmitry Preobrazhensky [Mon, 27 Jun 2022 15:55:53 +0000 (18:55 +0300)]
[AMDGPU][GFX90a][DOC][NFC] Update assembler syntax description
Summary of changes:
- Update MUBUF lds syntax (see https://reviews.llvm.org/
D124485).
- Update SMEM syntax (see https://reviews.llvm.org/
D127314).
- Enable src0=literal for v_madak*, v_madmk* (see https://reviews.llvm.org/
D111067).
- Correct src0 operands of v_accvgpr_write_b32.
- Correct description of s_getreg/s_setreg (add TBA/TMA).
- Remove SYSMSG_OP_HOST_TRAP_ACK message.
- Minor bug fixing and improvements.
Florian Hahn [Wed, 29 Jun 2022 10:29:31 +0000 (11:29 +0100)]
[SCEV] Construct SCEV iteratively.
This patch updates SCEV construction to work iteratively instead of recursively
in most cases. It resolves stack overflow issues when trying to construct SCEVs
for certain inputs, e.g. PR45201.
The basic approach is to to use a worklist to queue operands of V which
need to be created before V. To do so, the current patch adds a
getOperandsToCreate function which collects the operands SCEV
construction depends on for a given value. This is a slight duplication
with createSCEV.
At the moment, SCEVs for phis are still created recursively.
Fixes #32078, #42594, #44546, #49293, #49599, #55333, #55511
Reviewed By: nikic
Differential Revision: https://reviews.llvm.org/
D114650
Anton Afanasyev [Sat, 25 Jun 2022 11:02:32 +0000 (14:02 +0300)]
[test][RISCV][Driver] Precommit tests for
D128625
Florian Hahn [Wed, 29 Jun 2022 10:25:19 +0000 (11:25 +0100)]
[BasicAA] Add test coverage from D76194.
Florian Hahn [Wed, 29 Jun 2022 10:25:13 +0000 (11:25 +0100)]
[BasicAA] Convert test to use opaque pointers.
Using opaque pointers simplifies the tests quite a bit.
Christian Sigg [Wed, 29 Jun 2022 10:24:28 +0000 (12:24 +0200)]
Revert "Add default copy and move c'tor/assignment to PresburgerRelation."
This reverts commit
2cd468ef1561e9f4c888e5dde821f86e427a75a1.
Nicolai Hähnle [Tue, 28 Jun 2022 17:14:09 +0000 (19:14 +0200)]
StandardInstrumentation: print verifier output to errs
Enabling the verifiers is not very helpful if their output is
suppressed beyond the fatal error.
Differential Revision: https://reviews.llvm.org/
D128743
Christian Sigg [Wed, 29 Jun 2022 08:40:34 +0000 (10:40 +0200)]
Add default copy and move c'tor/assignment to PresburgerRelation.
Differential Revision: https://reviews.llvm.org/
D128789
Jolanta Jensen [Wed, 15 Jun 2022 16:11:28 +0000 (17:11 +0100)]
[NFC] Switch FloatModeKind enum class to use bitmask enums
Using bitmask enums simplifies and clarifies the code.
Differential Revision: https://reviews.llvm.org/
D128182
Nikita Popov [Wed, 29 Jun 2022 09:51:22 +0000 (11:51 +0200)]
[SROA] Don't create constant expressions (NFC)
Use IRBuilder instead, which will fold these. Just to clarify
that this does not actually create any udiv expression.
Nikita Popov [Wed, 29 Jun 2022 09:24:16 +0000 (11:24 +0200)]
[SCCP] Regenerate test checks (NFC)
lewuathe [Wed, 29 Jun 2022 08:32:23 +0000 (10:32 +0200)]
[mlir][complex] Canonicalization for consecutive complex.neg
Consecutive complex.neg are redundant so that we can canonicalize them to the original operands.
Reviewed By: pifon2a
Differential Revision: https://reviews.llvm.org/
D128781
Michael Daniels [Wed, 29 Jun 2022 09:00:45 +0000 (11:00 +0200)]
[lldb] fix stepping through POSIX trampolines
The DynamicLoaderPOSIXDYLD::GetStepThroughTrampolinePlan() function was
doing the symbol lookup using the demangled name. This stopped working
with https://reviews.llvm.org/
D118814. To get things working again, just
use the mangled name for the lookup instead.
Reviewed By: labath
Differential Revision: https://reviews.llvm.org/
D127999
Muhammad Omair Javaid [Wed, 29 Jun 2022 08:56:00 +0000 (12:56 +0400)]
[LLDB] Add PDB/calling-conventions.test for Arm/Windows
This patch renames PDB/calling-conventions.test to calling-conventions-x86.test.
Also restrict it to run only for target-x86*.
This patch also adds a arm specific test PDB/calling-conventions-arm.test which
tests that x86 specifc calling convention decorators are ignored by Arm compiler.
Reviewed By: labath
Differential Revision: https://reviews.llvm.org/
D128678
Muhammad Omair Javaid [Wed, 29 Jun 2022 08:50:41 +0000 (12:50 +0400)]
[LLDB] Add Arm64 CodeView to LLDB regnum mapping
This patch writes a mapping structure for converting CodeView Arm64 register numbers to LLDB Arm64 regnums.
This fixes various symbols and variable location test failures on AArch64/Windows.
Reviewed By: DavidSpickett
Differential Revision: https://reviews.llvm.org/
D128221
Muhammad Omair Javaid [Wed, 29 Jun 2022 08:34:56 +0000 (12:34 +0400)]
[LLDB] Fix PDB/pointers.test for 32bit Arm/Windows
PDB/pointers.test was orignally written for 32bit x86 keeping in mind
__cdecl and __stdcall calling conventions which does name mangling for
example like adding "_" underscore before function name.
This is only x86 specific but purpose of pointers.test is NOT to test
calling convention.
I am have made a few minor changes to this test which will make it pass
when run on Windows/Arm platform.
Reviewed By: mstorsjo
Differential Revision: https://reviews.llvm.org/
D128668
Nikita Popov [Wed, 29 Jun 2022 08:42:46 +0000 (10:42 +0200)]
[ConstExpr] Remove more leftovers of extractvalue expression (NFC)
Remove some leftover bits of extractvalue handling after the
removal in
D125795.
Fangrui Song [Wed, 29 Jun 2022 08:06:01 +0000 (01:06 -0700)]
[flang][test] Remove RUN COMMANDS/EXPECTED OUTPUT/INPUT markers from test/Driver
Uses of these markers are contrary to all other llvm-project subprojects
and carry no information for people who know the basics of a lit test.
I understand that there is an argument "this makes beginners get started
quickly" but I am unsure whether this is strong enough to deviate from
all other projects. https://llvm.org/docs/TestingGuide.html covers the
basics. Actually, some contributors were confused by the markers.
Differential Revision: https://reviews.llvm.org/
D128763
Nikita Popov [Thu, 16 Jun 2022 14:39:54 +0000 (16:39 +0200)]
[AsmPrinter] Further restrict expressions supported in global initializers
lowerConstant() currently accepts a number of constant expressions
which have corresponding MC expressions, but which cannot be
evaluated as a relocatable expression (unless the operands are
constant, in which case we'll just fold the expression to a constant).
The motivation here is to clarify which constant expressions are
really needed for https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179,
and in particular clarify that we do not need to support any
division expressions, which are particularly problematic.
Differential Revision: https://reviews.llvm.org/
D127972
Amir Ayupov [Wed, 29 Jun 2022 07:59:01 +0000 (00:59 -0700)]
[TableGen] Add a knob for MCOperandInfo expansion in gen-instr-info
Control the MCOperandInfo expansion with `-instr-info-expand-mi-operand-info`.
For X86, this would make it possible to see memory operand type e.g.:
```
/* MOV8rm */
GR8, i8mem,
/* MOV8rm_NOREX */
GR8_NOREX, i8mem_NOREX,
```
The intended use is a follow-up diff
D126116 (`getMemOperandSize`).
Reviewed By: skan
Differential Revision: https://reviews.llvm.org/
D127932
Andrzej Warzynski [Wed, 22 Jun 2022 10:29:57 +0000 (10:29 +0000)]
[clang][flang] Disable defaulting to `-fpie` for LLVM Flang
In, https://reviews.llvm.org/
D120305, CLANG_DEFAULT_PIE_ON_LINUX was set
to `On` by default. However, neither `-fpie` nor `-fpic` are currently
supported in LLVM Flang. Hence, in this patch the behaviour controlled
with CLANG_DEFAULT_PIE_ON_LINUX is refined not to apply to Flang.
Another way to look at this is that CLANG_DEFAULT_PIE_ON_LINUX is
currently affecting both Clang and Flang. IIUC, the intention for this
CMake variable has always been to only affect Clang. This patch makes
sure that that's the case.
Without this change, you might see errors like this on X86_64:
```
/usr/bin/ld: main.o: relocation R_X86_64_32 against `.bss' can not be used when making a PIE object; recompile with -fPIC
```
I've not experienced any issues on AArch64. That's probably because on
AArch64 some object files happen to be position independent without
needing -fpie or -fpic.
Differential Revision: https://reviews.llvm.org/
D128333
Pavel Samolysov [Tue, 28 Jun 2022 13:25:54 +0000 (16:25 +0300)]
[ArgPromotion] Remove all the getters and ReplaceCallSite (NFC)
AARGetter is an abstraction over a source of the `AAResults` introduced
to support the legacy pass manager as well as the modern one. Since the
Argument Promotion pass doesn't support the legacy pass manager anymore,
the abstraction is not required and `AAResults` may be used directly.
The instance of the `FunctionAnalysisManager` is passed through the
functions to get all the required analyses just wherever they are
required and do not use the awkward getter callbacks.
The `ReplaceCallSite` parameter was required for the legacy pass manager
only and isn't used anymore, so the parameter has been eliminated.
Differential Revision: https://reviews.llvm.org/
D128727
Pavel Samolysov [Wed, 18 May 2022 08:26:35 +0000 (11:26 +0300)]
[ArgPromotion] Move isDenselyPacked static member (NFC)
The `isDenselyPacked` static member of the `ArgumentPromotionPass` class
is not used in the class itself anymore. The single known user of the
function is in the `AttributorAttributes.cpp` file, so the function has
been moved into the file.
Differential Revision: https://reviews.llvm.org/
D128725
Adrian Kuegel [Wed, 29 Jun 2022 07:22:36 +0000 (09:22 +0200)]
Revert "[mlir][Presburger] Fix warning Wreturn-std-move (NFC)"
This reverts commit
a4070a5e77283912eb29e79efd4fb872fdb9ec90.
It introduced another warning instead.
Huang Zhen-Hong [Wed, 29 Jun 2022 07:09:05 +0000 (15:09 +0800)]
[clang-format] Fix misplacement of `*` in declaration of pointer to struct
Fixes #55810
Differential Revision: https://reviews.llvm.org/
D127873
Jose M Monsalve Diaz [Wed, 22 Jun 2022 15:05:34 +0000 (10:05 -0500)]
[OpenMP] Implementing omp_get_device_num()
This patch implements omp_get_device_num() in the host and the device.
It uses the already existing getDeviceNum in the device config for the device.
And in the host it uses the omp_get_num_devices().
Two simple tests added
Differential Revision: https://reviews.llvm.org/
D128347
luxufan [Fri, 24 Jun 2022 05:18:19 +0000 (13:18 +0800)]
[llvm-readobj][RISCV] Support dumping PT_RISCV_ATTRIBUTES
This patch drops the prefix `PT_RISCV_` when dumping `PT_RISCV_ATTRIBUTES`.
GNU readelf dumps it as `RISCV_ATTRIBUT`. Because GNU readelf uses
something like `%-14.14s` so only the first 14 bytes are printed.
Differential Revision: https://reviews.llvm.org/
D128493
Adrian Kuegel [Wed, 29 Jun 2022 07:10:20 +0000 (09:10 +0200)]
[mlir][Presburger] Fix warning Wreturn-std-move (NFC)
Serguei Katkov [Wed, 29 Jun 2022 04:06:34 +0000 (11:06 +0700)]
[GuardWidening] Add tests showing incorrect behavior of GW.
The first test shows that combineRangeChecks may choose to keep only two poison conditions.
And we cannot do simple arithmetic or logical and in guard.
The second test shows that keeping two poison conditions in the widened guard may allow
execution of side-effect instruction even if just freeze these conditions.
The third test shows that even in simple test we can hoist a poison and even logical and does not help here.
Reviewed By: mkazantsev
Subscribers: llvm-commits
Differential Revision: https://reviews.llvm.org/
D128779
Mark de Wever [Tue, 28 Dec 2021 17:48:04 +0000 (18:48 +0100)]
[libc++][format] Improve pointer formatters.
This changes the implementation of the formatter. Instead of inheriting
from a specialized parser all formatters will use the same generic
parser. This reduces the binary size.
The new parser contains some additional fields only used in the chrono
formatting. Since this doesn't change the size of the parser the fields
are in the generic parser. The parser is designed to fit in 128-bit,
making it cheap to pass by value.
The new format function is a const member function. This isn't required
by the Standard yet, but it will be after LWG-3636 is accepted.
Additionally P2286 adds a formattable concept which requires the member
function to be const qualified in C++23. This paper is likely to be
accepted in the 2022 July plenary.
This is based on
D125606. That commit did the groundwork and did similar
changes for the string formatters.
Depends on
D128139.
Reviewed By: #libc, ldionne
Differential Revision: https://reviews.llvm.org/
D128671
lorenzo chelini [Mon, 27 Jun 2022 16:16:39 +0000 (18:16 +0200)]
[MLIR][Math] Improve docs for round op (NFC)
Reviewed By: antiagainst
Differential Revision: https://reviews.llvm.org/
D128662
esmeyi [Wed, 29 Jun 2022 06:30:13 +0000 (02:30 -0400)]
Pass code-model through Module IR to [llc].
Currently, the code-model specified in IR can't be captured by [llc].
This patch fixes that.
Reviewed By: shchenz, MaskRay
Differential Revision: https://reviews.llvm.org/
D128623
Mark de Wever [Tue, 28 Dec 2021 17:48:04 +0000 (18:48 +0100)]
[libc++][format] Improve integral formatters.
This changes the implementation of the formatter. Instead of inheriting
from a specialized parser all formatters will use the same generic
parser. This reduces the binary size.
The new parser contains some additional fields only used in the chrono
formatting. Since this doesn't change the size of the parser the fields
are in the generic parser. The parser is designed to fit in 128-bit,
making it cheap to pass by value.
The new format function is a const member function. This isn't required
by the Standard yet, but it will be after LWG-3636 is accepted.
Additionally P2286 adds a formattable concept which requires the member
function to be const qualified in C++23. This paper is likely to be
accepted in the 2022 July plenary.
This is based on
D125606. That commit did the groundwork and did similar
changes for the string formatters.
Depends on
D125606
Reviewed By: #libc, ldionne
Differential Revision: https://reviews.llvm.org/
D128139
Michał Górny [Wed, 29 Jun 2022 04:56:32 +0000 (06:56 +0200)]
[lldb] [test] Split TestGdbRemoteFork in two
Split the test that's gotten very long in two, in hope that it will
resolve the arm/aarch64 buildbot failures. Even if it does not, it
could help pinpointing where the problem lies.
Sponsored by: The FreeBSD Foundation
Chuanqi Xu [Wed, 29 Jun 2022 04:48:48 +0000 (12:48 +0800)]
[C++20] [Module] Support reachable definition initially/partially
This patch introduces a new kind of ModuleOwnershipKind as
ReachableWhenImported. This intended the status for reachable described
at: https://eel.is/c++draft/module.reach#3.
Note that this patch is not intended to support all semantics about
reachable semantics. For example, this patch didn't implement discarded
declarations in GMF. (https://eel.is/c++draft/module.global.frag#3).
This fixes: https://bugs.llvm.org/show_bug.cgi?id=52281 and
https://godbolt.org/z/81f3ocjfW.
Reviewed By: rsmith, iains
Differential Revision: https://reviews.llvm.org/
D113545
Chuanqi Xu [Wed, 29 Jun 2022 04:43:26 +0000 (12:43 +0800)]
Revert "[C++20] [Modules] Implement Reachable initiallly"
This reverts commit
a223ba0a697c1598b434cf2495c9cd9ec5640fc7.
The previous commit don't contain additional information, which is bad.
Chuanqi Xu [Mon, 21 Feb 2022 09:41:46 +0000 (17:41 +0800)]
[C++20] [Modules] Implement Reachable initiallly
Peixin-Qiao [Wed, 29 Jun 2022 04:20:03 +0000 (12:20 +0800)]
[NFC][OpenMP] Fix worksharing-loop
1. Remove the redundant collapse clause in MLIR OpenMP worksharing-loop
operation.
2. Fix several typos.
3. Refactor the chunk size type conversion since CreateSExtOrTrunc has
both type check and type conversion.
Reviewed By: kiranchandramohan
Differential Revision: https://reviews.llvm.org/
D128338
Chen Zheng [Wed, 29 Jun 2022 04:16:01 +0000 (04:16 +0000)]
[XCOFF] change default program code csect alignment to 32
This is the same with commercial XLC on AIX.
Reviewed By: Esme
Differential Revision: https://reviews.llvm.org/
D114419
Chen Zheng [Wed, 29 Jun 2022 04:08:58 +0000 (04:08 +0000)]
[XCOFF] make sure same number of paddings are added for DWARF sections
We are going to change alignment for DWARF sections. This patch is to
fix functionality issue if the alignment is not the same with
DefaultSectionAlign defined in XCOFFObjectWriter.cpp.
Currently no test for this patch as for now alignment for DWARF sections
and other sections are always the same. This patch will be tested when
patch changing DWARF section is merged in.
Reviewed By: Esme
Differential Revision: https://reviews.llvm.org/
D116092
Peixin-Qiao [Wed, 29 Jun 2022 04:07:29 +0000 (12:07 +0800)]
[NFC][flang] Add the missed `constexpr`
Reviewed By: kiranchandramohan
Differential Revision: https://reviews.llvm.org/
D128598
Petr Hosek [Tue, 22 Mar 2022 01:30:35 +0000 (18:30 -0700)]
[CoverageMapping] Remove dots from paths inside the profile
We already remove dots from collected paths and path mappings. This
makes it difficult to match paths inside the profile which contain
dots. For example, we would never match /path/to/../file.c because
the collected path is always be normalized to /path/file.c. This
change enables dot removal for paths inside the profile to address
the issue.
Differential Revision: https://reviews.llvm.org/
D123164
Peixin-Qiao [Wed, 29 Jun 2022 03:25:19 +0000 (11:25 +0800)]
[flang] Support check for BIND statement entity
As Fortran 2018 8.6.4(1), the BIND statement specifies the BIND attribute
for a list of variables and common blocks.
Reviewed By: klausler
Differential Revision: https://reviews.llvm.org/
D127120
Chris Bieneman [Wed, 29 Jun 2022 02:53:54 +0000 (21:53 -0500)]
[DirectX] Properly fix for MaybeAlign API removal
My last attempt caused a test case failure. This resovles the issue.
Luo, Yuanke [Sat, 25 Jun 2022 14:00:44 +0000 (22:00 +0800)]
[X86][AMX] Split greedy RA for tile register
When we fill the shape to tile configure memory, the shape is gotten
from AMX pseudo instruction. However the register for the shape may be
split or spilled by greedy RA. That cause we fill the shape to config
memory after ldtilecfg is executed, so that the shape configuration
would be wrong.
This patch is to split the tile register allocation from greedy register
allocation, so that after tile registers are allocated the shape
registers are still virtual register. The shape register only may be
redefined or multi-defined by phi elimination pass, two address pass.
That doesn't affect tile register configuration.
Differential Revision: https://reviews.llvm.org/
D128584
Rafael Auler [Wed, 29 Jun 2022 02:23:28 +0000 (19:23 -0700)]
Revert "[BOLT][AArch64] Handle gold linker veneers"
This reverts commit
425dda76e9fac93117289fd68a2abdfb1e4a0ba5.
This commit is currently causing BOLT to crash in one of our
binaries and needs a bit more checking to make sure it is safe
to land.
Guozhi Wei [Wed, 29 Jun 2022 01:37:56 +0000 (01:37 +0000)]
[AArch64] Update test case.
Replace the new generated virtual register number with a macro to avoid
name mismatch due to different configuration of compiler.
Chris Bieneman [Wed, 29 Jun 2022 00:45:11 +0000 (19:45 -0500)]
[DirectX] Fix broken build after API removal
This removes the DirectX backend's usage of the MaybeAlign API removed
in
1baf1fc.
Walter Erquinigo [Wed, 29 Jun 2022 00:38:17 +0000 (17:38 -0700)]
[trace] Fix errors when handling command arguments
https://reviews.llvm.org/
D128453 recently added some safety checks for
command arguments. Unfortunately, some few commands started failing due
to that, and this diff fixes it. But fortunately, the fix is trivial, which is
simply declaring the argument that these commands will receive.
Differential Revision: https://reviews.llvm.org/
D128775
Craig Topper [Tue, 28 Jun 2022 23:48:14 +0000 (16:48 -0700)]
[RISCV] Fold (add X, [-4096, -2049]) or (add X, [2048,4096]) into load/store address during isel.
Previously we iseled this to a pair of ADDIs and relied on a post
isel peephole to fold one of the ADDIs into the load/store. Now
we split the immediate in two parts the same way isel does and fold
one of the pieces. If the add has a non-memory use it will emit
two isels and larger one will CSE with the ADDI we created for the
the memory use.
Reviewed By: reames
Differential Revision: https://reviews.llvm.org/
D128741
Walter Erquinigo [Fri, 24 Jun 2022 18:19:09 +0000 (11:19 -0700)]
[trace] Improve the TraceCursor iteration API
The current way ot traversing the cursor is a bit uncommon and it can't handle empty traces, in fact, its invariant is that it shold always point to a valid item. This diff simplifies the cursor API and allows it to point to invalid items, thus being able to handle empty traces or to know it ran out of data.
- Removed all the granularity functionalities, because we are not actually making use of that. We can bring them back when they are actually needed.
- change the looping logic to the following:
```
for (; cursor->HasValue(); cursor->Next()) {
if (cursor->IsError()) {
.. do something for error
continue;
}
.. do something for instruction
}
```
- added a HasValue method that can be used to identify if the cursor ran out of data, the trace is empty, or the user tried to move to an invalid position via SetId() or Seek()
- made several simplifications to severals parts of the code.
Differential Revision: https://reviews.llvm.org/
D128543
Stanislav Mekhanoshin [Tue, 28 Jun 2022 22:06:41 +0000 (15:06 -0700)]
[AMDGPU] Relax verification of soffset in scalar stores
It must use m0 only on GFX8. Later chips can use ang SGPR.
Differential Revision: https://reviews.llvm.org/
D128765
Fangrui Song [Tue, 28 Jun 2022 23:01:30 +0000 (16:01 -0700)]
[Driver][ARM][AArch64] Use err_drv_unsupported_option_argument for -march=/-mcpu=/-mtune= diagnostics
err_drv_clang_unsupported is for a Clang unsupported option (any value is rejected).
err_drv_unsupported_option_argument is for an unsupported value (other values may be supported).