review.tizen.org Git - platform/upstream/llvm.git/log

projects / platform / upstream / llvm.git / log

Djordje Todorovic [Mon, 6 Dec 2021 10:58:50 +0000 (02:58 -0800)]

[NFC][LICM] Update the comment in the scalar-promote.ll

The comment was stale after the https://reviews.llvm.org/D113289
was committed.

commit | commitdiff | tree

Kazushi (Jam) Marukawa [Fri, 19 Nov 2021 11:43:19 +0000 (20:43 +0900)]

[VE] Support multiple architectures installation

Change C++ header files placement to support multiple LLVM_RUNTIME_TARGETS
build. Also modifies regression test for it.

Reviewed By: simoll

Differential Revision: https://reviews.llvm.org/D114527

commit | commitdiff | tree

Kristina Bessonova [Mon, 6 Dec 2021 10:19:09 +0000 (12:19 +0200)]

[clang][DebugInfo] Allow function-local statics and types to be scoped within a lexical block

This is almost a reincarnation of https://reviews.llvm.org/D15977 originally
implemented by Amjad Aboud. It was discussed on llvm-dev [0], committed
with its backend counterpart [1], but finally reverted [2].

This patch makes clang to emit debug info for function-local static variables,
records (classes, structs and unions) and typdefs correctly scoped if
those function-local entites defined within a lexical (bracketed) block.

Before this patch, clang emits all those entities directly scoped in
DISubprogram no matter where they were really defined, causing
debug info loss (reported several times in [3], [4], [5]).

[0] https://lists.llvm.org/pipermail/llvm-dev/2015-November/092551.html
[1] https://reviews.llvm.org/rG30e7a8f694a19553f64b3a3a5de81ce317b9ec2f
[2] https://reviews.llvm.org/rGdc4531e552af6c880a69d226d3666756198fbdc8
[3] https://bugs.llvm.org/show_bug.cgi?id=19238
[4] https://bugs.llvm.org/show_bug.cgi?id=23164
[5] https://bugs.llvm.org/show_bug.cgi?id=44695

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D113743

commit | commitdiff | tree

Matthias Springer [Mon, 6 Dec 2021 09:06:41 +0000 (18:06 +0900)]

[mlir][linalg][bufferize][NFC] Utilize isWritable for FuncOps

This is a cleanup of ModuleBufferization. Instead of storing information about writable function arguments in BufferizationAliasInfo, we can use isWritable and make the decision there, based on dialect-specifc bufferization state.

Differential Revision: https://reviews.llvm.org/D114930

commit | commitdiff | tree

Balazs Benics [Mon, 6 Dec 2021 09:20:17 +0000 (10:20 +0100)]

[analyzer] Ignore flex generated files

Some projects [1,2,3] have flex-generated files besides bison-generated
ones.
Unfortunately, the comment `"/* A lexical scanner generated by flex */"`
generated by the tools is not necessarily at the beginning of the file,
thus we need to quickly skim through the file for this needle string.

Luckily, StringRef can do this operation in an efficient way.

That being said, now the bison comment is not required to be at the very
beginning of the file. This allows us to detect a couple more cases
[4,5,6].

Alternatively, we could say that we only allow whitespace characters
before matching the bison/flex header comment. That would prevent the
(probably) unnecessary string search in the buffer. However, I could not
verify that these tools would actually respect this assumption.

Additionally to this, e.g. the Twin project [1] has other non-whitespace
characters (some preprocessor directives) before the flex-generated
header comment. So the heuristic in the previous paragraph won't work
with that.
Thus, I would advocate the current implementation.

According to my measurement, this patch won't introduce measurable
performance degradation, even though we will do 2 linear scans.

I introduce the ignore-bison-generated-files and
ignore-flex-generated-files to disable skipping these files.
Both of these options are true by default.

[1]: https://github.com/cosmos72/twin/blob/master/server/rcparse_lex.cpp#L7
[2]: https://github.com/marcauresoar/make-examples/blob/22362cdcf9dd7c597b5049ce7f176621e2e9ac7a/sandbox/count-words/lexer.c#L6
[3]: https://github.com/vladcmanea/2nd-faculty-year-Formal-Languages---Automata-assignments/blob/11abdf64629d9eb741438ba69f04636769d5a374/lab1/lex.yy.c#L6

[4]: https://github.com/KritikaChoudhary/System-Software-Lab/blob/47f5b2cfe2a2738fd54eae9f8439817f6a22034e/B_yacc/1/y1.tab.h#L2
[5]: https://github.com/VirtualMonitor/VirtualMonitor/blob/71d1bf9b1e7b392a7bd0c73dc217138dc5865651/src/VBox/Additions/x11/x11include/xorg-server-1.8.0/parser.h#L2
[6]: https://github.com/bspaulding/DrawTest/blob/3f773ceb13de14275429036b9cbc5aa19e29bab9/Framework/OpenEars.framework/Versions/A/Headers/jsgf_parser.h#L2

Reviewed By: xazax.hun

Differential Revision: https://reviews.llvm.org/D114510

commit | commitdiff | tree

Matthias Springer [Mon, 6 Dec 2021 08:40:08 +0000 (17:40 +0900)]

[mlir][linalg][bufferize] Remove buffer equivalence from bufferize

Remove all function calls related to buffer equivalence from bufferize implementations.

Add a new PostAnalysisStep for scf.for that ensures that yielded values are equivalent to the corresponding BBArgs. (This was previously checked in `bufferize`.) This will be relaxed in a subsequent commit.

Note: This commit changes two test cases. These were broken by design
and should not have passed. With the new scf.for PostAnalysisStep, this
bug was fixed.

Differential Revision: https://reviews.llvm.org/D114927

commit | commitdiff | tree

Paulo Matos [Sun, 5 Dec 2021 12:50:47 +0000 (13:50 +0100)]

[WebAssembly] Implementation of intrinsic for ref.null and HeapType removal

This patch implements the intrinsic for ref.null.
In the process of implementing int_wasm_ref_null_func() and
int_wasm_ref_null_extern() intrinsics, it removes the redundant
HeapType.

This also causes the textual assembler syntax for ref.null to
change. Instead of receiving an argument: `func` or `extern`, the
instruction mnemonic is either ref.null_func or ref.null_extern,
without the need for a further operand.

Reviewed By: tlively

Differential Revision: https://reviews.llvm.org/D114979

commit | commitdiff | tree

MaheshRavishankar [Mon, 6 Dec 2021 08:35:47 +0000 (08:35 +0000)]

[mlir] Add default implementations for methods in `TilingInterface`.

Adding the default implementation of `getLoopIteratorTypes` and
`getLoopBounds` allows ExternalModels to override these methods.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D115101

commit | commitdiff | tree

Matthias Springer [Mon, 6 Dec 2021 08:25:13 +0000 (17:25 +0900)]

[mlir][linalg][bufferize][NFC] Collect equivalent FuncOp BBArgs in PostAnalysisStep

Collect equivalent BBArgs right after the equivalence analysis of the FuncOp and before bufferizing. This is in preparation of decoupling bufferization from aliasInfo.

Also gather equivalence info for CallOps, which was missing in the
previous commit.

Differential Revision: https://reviews.llvm.org/D114847

commit | commitdiff | tree

Nikita Popov [Sun, 4 Jul 2021 16:14:09 +0000 (18:14 +0200)]

[llvm-c] Add header deprecations

This adds support for header deprecation using
LLVM_ATTRIBUTE_C_DEPRECATED (note that we can't use
LLVM_ATTRIBUTE_DEPRECATED, which is C++ specific). This will not
help people using the FFI interface, but may help people using the
C headers.

Differential Revision: https://reviews.llvm.org/D114936

commit | commitdiff | tree

Michal Terepeta [Mon, 6 Dec 2021 07:59:49 +0000 (07:59 +0000)]

[mlir][Vector] Support 0-D vectors in `ConstantMaskOp`

To support creating both a mask with just a single `true` and `false` values,
I had to relax the restriction in the verifier that the rank is always equal to
the length of the attribute array, in other words, we now allow:

- `vector.constant_mask [0] : vector<i1>` which gets lowered to
`arith.constant dense<false> : vector<i1>`
- `vector.constant_mask [1] : vector<i1>` which gets lowered to
`arith.constant dense<true> : vector<i1>`

(the attribute list for the 0-D case must be a singleton containing
either `0` or `1`)

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D115023

commit | commitdiff | tree

gysit [Mon, 6 Dec 2021 07:25:24 +0000 (07:25 +0000)]

[mlir][linalg] Pad independent of application order (NFC).

This revision makes the padding pattern independent of the application order. It addresses the concern that we cannot rely on the execution order of the greedy rewriter (https://reviews.llvm.org/D114689). Instead, the pattern is updated to apply repeatedly till all operations are padded.

Reviewed By: mehdi_amini

Differential Revision: https://reviews.llvm.org/D114851

commit | commitdiff | tree

Tan S. B [Mon, 6 Dec 2021 06:32:48 +0000 (22:32 -0800)]

[clang-format] Adjust braced list detection

This avoids mishandling nested compound statements that are followed by another compound statement.

Fixes https://llvm.org/PR38314 and https://llvm.org/PR48305.

Differential Revision: https://reviews.llvm.org/D114583

commit | commitdiff | tree

Zi Xuan Wu [Mon, 6 Dec 2021 05:36:20 +0000 (13:36 +0800)]

[CSKY] Add compressed instruction mapping between 32-bit and 16-bit instruction

Add all CompressPat to map instructions between 16-bit and 32-bit with using the CompressInstEmitter infra.
Although it's only used in asm printer, also enable it in asm parser to debug mapping when -enable-csky-asm-compressed-inst is on.

Differential Revision: https://reviews.llvm.org/D115026

commit | commitdiff | tree

Noah Shutty [Mon, 6 Dec 2021 05:39:05 +0000 (05:39 +0000)]

Revert "[llvm] [Debuginfo] Debuginfod client library."

This reverts commit af69947e7028274573cfc927aabead8326b63367 because it
caused buildbot failures.

commit | commitdiff | tree

Noah Shutty [Mon, 6 Dec 2021 04:27:53 +0000 (04:27 +0000)]

[llvm] [Debuginfo] Debuginfod client library.

This adds a Debuginfod library containing the `fetchDebuginfo` function which queries servers specified by the `DEBUGINFOD_URLS` environment variable for the debuginfo, executable, or a specified source file associated with a given build id.

This diff was split out from D111252.

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D112758

commit | commitdiff | tree

Shivam Gupta [Mon, 6 Dec 2021 03:57:08 +0000 (09:27 +0530)]

[Docs] Fix a link

current link is pointing to https://llvm.org/docs/CodeGenerator.html#segmented-stacks while it point to https://llvm.org/docs/CodeGenerator.html#tail-call-optimization or id81.

Differential Revision: https://reviews.llvm.org/D115119

commit | commitdiff | tree

Qiu Chaofan [Mon, 6 Dec 2021 02:15:05 +0000 (10:15 +0800)]

[PowerPC] Implement general back2back fusion

Implement 'back-to-back' FX fusion according to Power10 User Manual
'19.1.5.4 Fusion', not enabled by default.

Reviewed By: nemanjai

Differential Revision: https://reviews.llvm.org/D114345

commit | commitdiff | tree

Nikolas Klauser [Thu, 2 Dec 2021 20:09:38 +0000 (21:09 +0100)]

[libc++][NFC] Disable clang-tidy checks

Disable clang-tidy checks as discussed in D114915

Reviewed By: ldionne, #libc

Spies: cjdb, aheejin, libcxx-commits

Differential Revision: https://reviews.llvm.org/D114985

commit | commitdiff | tree

Arthur O'Dwyer [Sun, 5 Dec 2021 23:56:58 +0000 (18:56 -0500)]

[libc++] Remove space-alignment of trailing braces in module.modulemap. NFC.

As discussed on the Discord, 2021-12-01 through 2021-12-05.
Our new consistent style for this is "don't align the right-braces"
(but still align the left-braces, as shown).

commit | commitdiff | tree

Jack Andersen [Sun, 5 Dec 2021 19:55:20 +0000 (14:55 -0500)]

[GlobalISel] Allow DBG_VALUE to use undefined vregs before LiveDebugValues.

Expanding on D109750.

Since `DBG_VALUE` instructions have final register validity determined in
`LDVImpl::handleDebugValue`, there is no apparent reason to immediately prune
unused register operands as their defs are erased. Consequently, this renders
`MachineInstr::eraseFromParentAndMarkDBGValuesForRemoval` moot; gaining a
substantial performance improvement.

The only necessary changes involve making relevant passes consider invalid
DBG_VALUE vregs uses as valid.

Reviewed By: MatzeB

Differential Revision: https://reviews.llvm.org/D112852

commit | commitdiff | tree

Jez Ng [Sat, 4 Dec 2021 02:26:32 +0000 (21:26 -0500)]

[lld-macho] Unreferenced weak dylib symbols shouldn't fetch archive symbols

We were fetching archive symbols too eagerly, bloating binary size as well as
just screwing up binaries that expected to look up certain symbols only at
runtime.

Reviewed By: #lld-macho, oontvoo

Differential Revision: https://reviews.llvm.org/D115092

commit | commitdiff | tree

Jack Andersen [Sun, 5 Dec 2021 19:45:33 +0000 (14:45 -0500)]

[CMake] Installable find modules for terminfo and libffi

Improves cross-distro portability of LLVM cmake package by resolving paths for
terminfo and libffi via import targets.

When LLVMExports.cmake is generated for installation, it contains absolute
library paths which are likely to be a common cause of portability issues. To
mitigate this, the discovery logic for these dependencies is refactored into
find modules which get installed alongside LLVMConfig.cmake. The result is
cleaner, cmake-friendly management of these dependencies that respect the
environment of the LLVM package importer.

Reviewed By: JDevlieghere

Differential Revision: https://reviews.llvm.org/D114327

commit | commitdiff | tree

Jack Andersen [Sun, 5 Dec 2021 19:35:33 +0000 (14:35 -0500)]

Test commit to check access.

commit | commitdiff | tree

Mehdi Amini [Sun, 5 Dec 2021 19:16:54 +0000 (19:16 +0000)]

Fix TOSA verifier to emit verbose errors

Also as a test for invalid ops which was missing.

commit | commitdiff | tree

Michael Liao [Sun, 5 Dec 2021 18:39:48 +0000 (13:39 -0500)]

Fix `-Wunused-variable` warning. NFC.

commit | commitdiff | tree

Arthur O'Dwyer [Sun, 5 Dec 2021 18:21:01 +0000 (13:21 -0500)]

[libc++] Adjust space-alignment in module.modulemap. NFC.

commit | commitdiff | tree

Arthur O'Dwyer [Sun, 5 Dec 2021 18:08:36 +0000 (13:08 -0500)]

[libc++] Add missing `#pragma GCC system_header` in a few headers. NFCI.

commit | commitdiff | tree

Arthur O'Dwyer [Sun, 5 Dec 2021 18:05:21 +0000 (13:05 -0500)]

[libc++] Fix an include-guard comment. NFC.

commit | commitdiff | tree

Nico Weber [Sun, 5 Dec 2021 18:15:56 +0000 (13:15 -0500)]

[gn build] port a8025e06fc0f more

src/ryu/*.cpp includes files relative to src, so src/ needs
to be passes as -I flag now.

commit | commitdiff | tree

Mark de Wever [Sun, 5 Dec 2021 16:44:24 +0000 (17:44 +0100)]

[libc++][doc] Update format implementation status.

commit | commitdiff | tree

Kazu Hirata [Sun, 5 Dec 2021 16:33:02 +0000 (08:33 -0800)]

[llvm] Use range-based for loops (NFC)

commit | commitdiff | tree

Sanjay Patel [Sun, 5 Dec 2021 14:47:17 +0000 (09:47 -0500)]

[InstSimplify] fix logic fold of 'or' for vectors

Reduce code duplication for commutative pattern matching
and fix a miscompile.

We can't safely propagate an undef element in this transform:
https://alive2.llvm.org/ce/z/s5xy55

commit | commitdiff | tree

Sanjay Patel [Sun, 5 Dec 2021 14:28:11 +0000 (09:28 -0500)]

[InstSimplify] add/adjust tests for 'or' logic fold; NFC

The last test shows a miscompile:
https://alive2.llvm.org/ce/z/s5xy55

commit | commitdiff | tree

Sanjay Patel [Fri, 3 Dec 2021 14:14:41 +0000 (09:14 -0500)]

[InstCombine] add tests for icmp with mul op with known bits; NFC

D114962

commit | commitdiff | tree

Kristina Bessonova [Sun, 5 Dec 2021 13:31:25 +0000 (15:31 +0200)]

Follow-up for D113741: fix DebugInfo/Generic/lexical_block_static.ll on MachO

commit | commitdiff | tree

Nilay Vaish [Sun, 5 Dec 2021 13:02:32 +0000 (14:02 +0100)]

Remove duplicate comment

The same comment appears in the very next line.

Reviewed By: #libc, ldionne

Differential Revision: https://reviews.llvm.org/D115018

commit | commitdiff | tree

Nico Weber [Sun, 5 Dec 2021 12:52:43 +0000 (07:52 -0500)]

[gn build] (semiautomaticallly) port a8025e06fc0f (libc++ ryu)

commit | commitdiff | tree

Mark de Wever [Tue, 9 Feb 2021 16:52:41 +0000 (17:52 +0100)]

Microsoft's floating-point to_chars powered by Ryu and Ryu Printf

Microsoft would like to contribute its implementation of floating-point to_chars to libc++. This uses the impossibly fast Ryu and Ryu Printf algorithms invented by Ulf Adams at Google. Upstream repos: https://github.com/microsoft/STL and https://github.com/ulfjack/ryu .

Licensing notes: MSVC's STL is available under the Apache License v2.0 with LLVM Exception, intentionally chosen to match libc++. We've used Ryu under the Boost Software License.

This patch contains minor changes from Jorg Brown at Google, to adapt the code to libc++. He verified that it works in Google's Linux-based environment, but then I applied more changes on top of his, so any compiler errors are my fault. (I haven't tried to build and test libc++ yet.) Please tell me if we need to do anything else in order to follow https://llvm.org/docs/DeveloperPolicy.html#attribution-of-changes .

Notes:

* libc++'s integer charconv is unchanged (except for a small refactoring). MSVC's integer charconv hasn't been tuned for performance yet, so you're not missing anything.
* Floating-point from_chars isn't part of this patch because Jorg found that MSVC's implementation (derived from our CRT's strtod) was slower than Abseil's. If you're unable to use Abseil or another implementation due to licensing or technical considerations, Microsoft would be delighted if you used MSVC's from_chars (and you can just take it, or ask us to provide a patch like this). Ulf is also working on a novel algorithm for from_chars.
* This assumes that float is IEEE 32-bit, double is IEEE 64-bit, and long double is also IEEE 64-bit.
* I have added MSVC's charconv tests (the whole thing: integer/floating from_chars/to_chars), but haven't adapted them to libcxx's harness at all. (These tests will be available in the microsoft/STL repo soon.)
* Jorg added int128 codepaths. These were originally present in upstream Ryu, and I removed them from microsoft/STL purely for performance reasons (MSVC doesn't support int128; Clang on Windows does, but I found that x64 intrinsics were slightly faster).
* The implementation is split into 3 headers. In MSVC's STL, charconv contains only Microsoft-written code. xcharconv_ryu.h contains code derived from Ryu (with significant modifications and additions). xcharconv_ryu_tables.h contains Ryu's large lookup tables (they were sufficiently large to make editing inconvenient, hence the separate file). The xmeow.h convention is MSVC's for internal headers; you may wish to rename them.
* You should consider separately compiling the lookup tables (see https://github.com/microsoft/STL/issues/172 ) for compiler throughput and reduced object file size.
* See https://github.com/StephanTLavavej/llvm-project/commits/charconv for fine-grained history. (If necessary, I can perform some rebase surgery to show you what Jorg changed relative to the microsoft/STL repo; currently that's all fused into the first commit.)

Differential Revision: https://reviews.llvm.org/D70631

commit | commitdiff | tree

Mark de Wever [Sun, 5 Dec 2021 12:22:58 +0000 (13:22 +0100)]

[libc++][ci] Disable generating debug information.

In the bootstrap build generating debug information causes an ICE.
This is a work-around for llvm.org/PR52584

commit | commitdiff | tree

Florian Hahn [Sun, 5 Dec 2021 12:14:58 +0000 (12:14 +0000)]

[VPlan] Separate ctors for VPWidenIntOrFpInduction. (NFC)

VPWidenIntOrFpInductionRecipes can either be constructed with a PHI and
an optional cast or a PHI and a trunc instruction. Reflect this in 2
separate constructors. This also simplifies a follow-up change.

commit | commitdiff | tree

Kristina Bessonova [Sat, 4 Dec 2021 15:12:47 +0000 (17:12 +0200)]

Reland [DwarfDebug] Support emitting function-local declaration for a lexical block

This is another attempt to make function-local declarations
(like static variables, structs/classes and other) be correctly
emitted within a lexical (bracketed) block.

Fixes https://bugs.llvm.org/show_bug.cgi?id=19238.

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D113741

commit | commitdiff | tree

Kristina Bessonova [Sat, 4 Dec 2021 12:08:10 +0000 (14:08 +0200)]

Reland [DwarfDebug] Move emission of global vars, types and imports to endModule()

This patch proposes to move emission of global variables, types,
imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule().
Effectively, this changes nothing but the order of debug entities which
will be as follows:
* subprograms (including related context, local variables/labels,
  local imported entities; related types can be created as a part of
  the emission of local entities of an abstract subprogram);
* global variables (including related context and types);
* retained types and enums;
* non-local-scoped imported entities;
* basic types;
* other types left (as a part of local variables attributes emission).

Note that the order of emitted compile units may also be changed as now we emit
units that contain subprograms first and then all other non-empty units.

The motivation behind this change is the following:
(1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline,
    from this time IR can be significantly changed by target-specific passes.
    If it happens for debug metadata of global entities, those changes will not
    be reflected in the emitted DWARF.
(2) imported subprogram names should refer to an abstract subprogram if it exists,
    but it isn't known in DwarfDebug::beginModule() (it's possible to make some
    guesses based on location info, but it's not quite reliable);
(3) aforementioned entities if they are scoped within a bracketed block
    (subject of D113741) couldn't be emitted in DwarfDebug::beginModule()
    (they need parent emitted first). Another problem is if to try to gather
    some information about local entities and defer their emission
    (till subprogram's processing or DwarfDebug::endModule()) all the gathered
    details might be irrelevant / invalid by the time the entities are being
    emitted (because of (1)).

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D114705

commit | commitdiff | tree

Phoebe Wang [Sun, 5 Dec 2021 11:17:12 +0000 (19:17 +0800)]

[X86][FP16] Replace vXi16 to vXf16 instead of v8f16

Fixes pr52561

Reviewed By: LuoYuanke

Differential Revision: https://reviews.llvm.org/D114304

commit | commitdiff | tree

Florian Hahn [Sun, 5 Dec 2021 11:12:44 +0000 (11:12 +0000)]

[MemoryLocation] Use getForArgument in getForSource/getForDest. (NFC)

getForArgument already knows how to extract a memory location for all
memory intrinsics. Use it instead of duplicating the logic.

commit | commitdiff | tree

Lang Hames [Sun, 5 Dec 2021 09:31:03 +0000 (20:31 +1100)]

[JITLink][ELF][x86-64] Adjust addends for R_X86_64_PLT32 relocations.

R_X86_64_PLT32 explicitly represents the '-4' PC-adjustment in the relocation's
addend, but JITLink's x86_64::Branch32PCRel includes the PC-adjustment
implicitly. We have been zeroing the addend to account for the difference, but
this breaks for branches to non-zero offsets past labels. This patch updates the
relocation parsing code to unconditionally adjust the offset by '+4' instead.
For branches directly to labels the result is still 0, for branches to offsets
past labels the result is the correct addend for x86_64::Branch32PCRel.

commit | commitdiff | tree

David Green [Sun, 5 Dec 2021 09:25:52 +0000 (09:25 +0000)]

[DAG] Create fptoui.sat from clamped fptosi

As an extension to D111976, this converts clamp fptosi, clamped between
0 and (2^n)-1 to a fptoui.sat. This can greatly help on targets with
conversions that naturally saturate, such as Arm.

X86 disables the transform as some of the test cases increases in size.
A fptoui.sat necessitates a fp clamp without native support, so there is
little use in converting if the instruction is just going to be
expanded.

Differential Revision: https://reviews.llvm.org/D112428

commit | commitdiff | tree

Lang Hames [Sun, 5 Dec 2021 06:12:39 +0000 (17:12 +1100)]

[JITLink][ELF][x86-64] Use the right edge-naming function for debugging output.

Graph edges use the generic x86-64 edge set (the ELF specific edges are only
used during parsing).

commit | commitdiff | tree

Lang Hames [Fri, 3 Dec 2021 23:15:15 +0000 (10:15 +1100)]

[llvm-jitlink] Allow -entry option to find hidden symbols.

This is useful when debugging failures in object files compiled with
visibility=hidden.

commit | commitdiff | tree

Michael Liao [Sat, 4 Dec 2021 23:20:21 +0000 (18:20 -0500)]

Fix `-Wunused-variable` warning. NFC.

commit | commitdiff | tree

Nico Weber [Sun, 5 Dec 2021 03:29:05 +0000 (22:29 -0500)]

[gn build] port f1585a4b47cc

commit | commitdiff | tree

Leonard Grey [Sun, 5 Dec 2021 03:24:58 +0000 (22:24 -0500)]

[Support] Use final filename for Caching buffer identifier

Mach-O LLD uses the buffer identifier of the memory buffer backing an object
file to generate stabs which are used by `dsymutil` to find the object file for
dSYM generation.

When using thinLTO, these buffers are provided by the cache which initially
saves them to disk as temporary files beginning with "Thin-" but renames them
to persistent files beginning with "llvmcache-" before the buffer is provided
to the cache user.

However, the buffer is created before the file is renamed and is given the temp
file's name as an identifier. This causes the generated stabs to point to
nonexistent files.

This change names the buffer with the eventual persistent filename. I think
this is safe because failing to rename the temp file is a fatal error.

Differential Revision: https://reviews.llvm.org/D115055

commit | commitdiff | tree

Kazu Hirata [Sun, 5 Dec 2021 02:34:29 +0000 (18:34 -0800)]

[lldb] Fix a warning

This patch fixes:

  lldb/source/Plugins/Platform/Windows/PlatformWindows.cpp:386:13:
  error: comparison between NULL and non-pointer ('lldb::addr_t' (aka
  'unsigned long') and NULL) [-Werror,-Wnull-arithmetic]

commit | commitdiff | tree

Peter Klausler [Fri, 3 Dec 2021 00:36:09 +0000 (16:36 -0800)]

[flang] OPEN(RECL=) handling for sequential formatted I/O

RECL= is required for direct access I/O, but is permitted
as well for sequential I/O, where it is defined by the
standard to specify a maximum record (line) length.
The standard does not say what should happen when an
sequential formatted input record appears whose length is
unequal to RECL= when it is specified.

Precedents from other compilers are unclear: one raises an error,
some honor RECL= as an effective truncation, and a few ignore the
situation. On output, all other compilers tested raised an
error when an attempt is made to emit a record longer than RECL=.

This patch treats RECL= as effective truncation on input and
as a hard limit with error on output, and also ensures that
RECL= can be set *longer* than the actual input record lengths.

Differential Revision: https://reviews.llvm.org/D115102

commit | commitdiff | tree

Zhihao Yuan [Wed, 24 Nov 2021 00:38:53 +0000 (16:38 -0800)]

[PowerPC] Drop stdlib paths in freestanding tests

When targeting FreeBSD on a Linux host with a copy
of system libc++, Clang prepends /usr/include/c++/v1
to the search paths even with -ffreestanding, and
fails to compile a program with a
single #include <xmmintrin.h>

Dropping the path with -nostdlibinc.

Differential Revision: https://reviews.llvm.org/D114497

commit | commitdiff | tree

Florian Hahn [Sat, 4 Dec 2021 22:18:38 +0000 (22:18 +0000)]

[MemoryLocation] Support missing atomic intrinsics in getForArg.

getForArgument is missing support for atomic memory transfer
intrinsics. In terms of accessed locations they behave like regular
memory transfer intrinsics and we already support them as such in
getForSource/getForDest.

commit | commitdiff | tree

Butygin [Thu, 28 Oct 2021 16:04:35 +0000 (19:04 +0300)]

[mlir] Add InlinerInterface to bufferization dialect

Differential Revision: https://reviews.llvm.org/D115080

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 15:59:34 +0000 (16:59 +0100)]

[clang-format][NFC] Use member directly

Instead of passing it as argument to the member function.

Differential Revision: https://reviews.llvm.org/D115072

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 15:40:52 +0000 (16:40 +0100)]

[clang-format][NFC] Use range based for for fake l parens

Differential Revision: https://reviews.llvm.org/D115071

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 15:38:10 +0000 (16:38 +0100)]

[clang-format][NFC] Early return when nothing to do

Do not compute SkipFirstExtraIndent just to see that there are no fake l
parens.

Differential Revision: https://reviews.llvm.org/D115070

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 08:25:45 +0000 (09:25 +0100)]

[clang-format][NFC] Move static variable in scope

Let only the JS/TS users pay for the initialistation.

Differential Revision: https://reviews.llvm.org/D115068

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 07:47:38 +0000 (08:47 +0100)]

[clang-format][NFC] Use range based for

That's much easier to read.

Differential Revision: https://reviews.llvm.org/D115067

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 07:45:56 +0000 (08:45 +0100)]

[clang-format][NFC] Reorder conditions

Prefer to check the local variables first before dereferencing the
pointer.

Differential Revision: https://reviews.llvm.org/D115066

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 07:29:47 +0000 (08:29 +0100)]

[clang-format][NFC] Merge two calls of isOneOf

Differential Revision: https://reviews.llvm.org/D115065

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 07:25:23 +0000 (08:25 +0100)]

[clang-format][NFC] Rename variable so no shadowing happens

In the loop there is also a Node.

Differential Revision: https://reviews.llvm.org/D115063

commit | commitdiff | tree

Björn Schäpers [Fri, 3 Dec 2021 07:24:02 +0000 (08:24 +0100)]

[clang-format][NFC] Prefer pass by reference

Differential Revision: https://reviews.llvm.org/D115061

commit | commitdiff | tree

Mehrnoosh Heidarpour [Fri, 3 Dec 2021 15:04:43 +0000 (10:04 -0500)]

[InstSimplify] Add logic 'or' fold to -1

Adding the following folding opportunity:
(~A | B) | (A ^ B) --> -1

https://alive2.llvm.org/ce/z/PMtdYB

Differential revision: https://reviews.llvm.org/D114996

commit | commitdiff | tree

Peter Klausler [Thu, 2 Dec 2021 20:34:37 +0000 (12:34 -0800)]

[flang] Fix folding of EXPONENT() intrinsic function

The definition of the EXPONENT() intrinsic function differs by one
from the real arithmetic folding templates concept of an unbiased
exponent, and also needs special handling for zero. Fix, and add
more tests.

Differential Revision: https://reviews.llvm.org/D115084

commit | commitdiff | tree

Saleem Abdulrasool [Mon, 29 Nov 2021 04:05:31 +0000 (20:05 -0800)]

Windows: support `DoLoadImage`

This implements `DoLoadImage` and `UnloadImage` in the Windows platform
plugin modelled after the POSIX platform plugin.  This was previously
unimplemented and resulted in a difficult to decipher error without any
logging.

This implementation is intended to support enables the use of LLDB's
Swift REPL on Windows.

Paths which are added to the library search path are persistent and
applied to all subsequent loads.  This can be adjusted in the future by
storing all the cookies and restoring the path prior to returning from
the helper.  However, the dynamic path count makes this a bit more
challenging.

Reviewed By: @JDevlieghere
Differential Revision: https://reviews.llvm.org/D77287

commit | commitdiff | tree

Dimitry Andric [Tue, 23 Nov 2021 20:21:02 +0000 (21:21 +0100)]

[XRay] fix more -Wformat warnings

Building xray with recent clang on a 64-bit system results in a number
of -Wformat warnings:

    compiler-rt/lib/xray/xray_allocator.h:70:11: warning: format specifies type 'int' but the argument has type '__sanitizer::uptr' (aka 'unsigned long') [-Wformat]
              RoundedSize, B);
              ^~~~~~~~~~~
    compiler-rt/lib/xray/xray_allocator.h:119:11: warning: format specifies type 'int' but the argument has type '__sanitizer::uptr' (aka 'unsigned long') [-Wformat]
              RoundedSize, B);
              ^~~~~~~~~~~

Since `__sanitizer::uptr` has the same size as `size_t`, these can be
fixed by using the printf specifier `%zu`.

    compiler-rt/lib/xray/xray_basic_logging.cpp:348:46: warning: format specifies type 'int' but the argument has type '__sanitizer::tid_t' (aka 'unsigned long long') [-Wformat]
          Report("Cleaned up log for TID: %d\n", GetTid());
                                          ~~     ^~~~~~~~
                                          %llu
    compiler-rt/lib/xray/xray_basic_logging.cpp:353:62: warning: format specifies type 'int' but the argument has type '__sanitizer::tid_t' (aka 'unsigned long long') [-Wformat]
          Report("Skipping buffer for TID: %d; Offset = %llu\n", GetTid(),
                                           ~~                    ^~~~~~~~
                                           %llu

Since `__sanitizer::tid_t` is effectively declared as `unsigned long
long`, these can be fixed by using the printf specifier `%llu`.

    compiler-rt/lib/xray/xray_basic_logging.cpp:354:14: warning: format specifies type 'unsigned long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat]
                 TLD.BufferOffset);
                 ^~~~~~~~~~~~~~~~

Since `BufferOffset` is declared as `size_t`, this one can be fixed by
using `%zu` as a printf specifier.

    compiler-rt/lib/xray/xray_interface.cpp:172:50: warning: format specifies type 'int' but the argument has type 'uint64_t' (aka 'unsigned long') [-Wformat]
        Report("Unsupported sled kind '%d' @%04x\n", Sled.Address, int(Sled.Kind));
                                       ~~            ^~~~~~~~~~~~
                                       %lu

Since ``xray::SledEntry::Address` is declared as `uint64_t`, this one
can be fixed by using `PRIu64`, and adding `<cinttypes>`.

    compiler-rt/lib/xray/xray_interface.cpp:308:62: warning: format specifies type 'long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat]
        Report("System page size is not a power of two: %lld\n", PageSize);
                                                        ~~~~     ^~~~~~~~
                                                        %zu
    compiler-rt/lib/xray/xray_interface.cpp:359:64: warning: format specifies type 'long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat]
        Report("Provided page size is not a power of two: %lld\n", PageSize);
                                                          ~~~~     ^~~~~~~~
                                                          %zu

Since `PageSize` is declared as `size_t`, these can be fixed by using
`%zu` as a printf specifier.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D114469

commit | commitdiff | tree

Nikita Popov [Sat, 4 Dec 2021 17:54:36 +0000 (18:54 +0100)]

[llvm-c] Avoid deprecated APIs in tests

Avoid the use of deprecated (opaque pointer incompatible) APIs
in C API tests, in preparation for header deprecation. Add a
LLVMGetGEPSourceElementType() to cover a bit of functionality
that is necessary for the echo test.

This change is split out from https://reviews.llvm.org/D114936.

commit | commitdiff | tree

Kazu Hirata [Sat, 4 Dec 2021 16:48:04 +0000 (08:48 -0800)]

[CodeGen] Use range-based for loops (NFC)

commit | commitdiff | tree

Nikita Popov [Sat, 4 Dec 2021 16:24:07 +0000 (17:24 +0100)]

[NewPM] Test more options in pipeline test (NFC)

As suggested on D115098, this tests the positioning of
HotColdSplitting, IROutliner and MergeFunctions in the optimization
pipeline.

commit | commitdiff | tree

Nikita Popov [Sat, 4 Dec 2021 12:14:15 +0000 (13:14 +0100)]

[NewPM] Fix MergeFunctions scheduling

MergeFunctions (as well as HotColdSplitting an IROutliner) are
incorrectly scheduled under the new pass manager. The code makes
it look like they run towards the end of the module optimization
pipeline (as they should), while in reality the run at the start.
This is because the OptimizePM populated around them is only
scheduled later.

I'm fixing this by moving these three passes until after OptimizePM
to avoid splitting the function pass pipeline. It doesn't seem
important to me that some of the function passes run after these
late module passes.

Differential Revision: https://reviews.llvm.org/D115098

commit | commitdiff | tree

Matt Arsenault [Sat, 4 Dec 2021 16:17:39 +0000 (11:17 -0500)]

Attributor: Fix typo in function name

commit | commitdiff | tree

Matt Arsenault [Sat, 4 Dec 2021 16:24:28 +0000 (11:24 -0500)]

OpenMP: Un-xfail tests that pass now

729bf9b26b657df8ddad2e5a63377e6afb349a18 should have fixed these

commit | commitdiff | tree

Kristina Bessonova [Sat, 4 Dec 2021 16:03:46 +0000 (18:03 +0200)]

Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block"

This reverts commits
* ee691970a9a85470948ada623c31f0ab8773617c (D113741),
* 79d3132998b2828be8f7d2ec411f91fb11b3e01f (D114705)

due to lldb and dexter test failures.

commit | commitdiff | tree

Matt Arsenault [Sat, 14 Aug 2021 23:10:46 +0000 (19:10 -0400)]

AMDGPU: Enable fixed function ABI by default

Code using indirect calls is broken without this, and there isn't
really much value in supporting the old attempt to vary the argument
placement based on uses. This resulted in more argument shuffling code
anyway.

Also have the option stop implying all inputs need to be passed. This
will no rely on the amdgpu-no-* attributes to avoid passing
unnecessary values.

commit | commitdiff | tree

Florian Hahn [Sat, 4 Dec 2021 15:20:03 +0000 (15:20 +0000)]

[BasicAA] Add atomic mem intrinsic tests.

commit | commitdiff | tree

Matt Arsenault [Tue, 26 Oct 2021 01:30:42 +0000 (21:30 -0400)]

AMDGPU: Assume all amdhsa kernarg passed implicit arguments by default

Previously we would require adding an attribute to kernels to enable
the inputs passed in the kernarg segment, accessed by
llvm.amdgcn.implicitarg.ptr. This violates the principle of being
correct by default. Some OpenMP testcases were broken recently since
it wasn't correctly setting this attribute, and no known frontends are
setting this to anything other than the maximum.

Most of the test changes are from load widening of argument loads
since there now more implied dereferenceable bytes.

commit | commitdiff | tree

Matt Arsenault [Mon, 25 Oct 2021 19:30:55 +0000 (15:30 -0400)]

AMDGPU: Optimize out implicit kernarg argument allocation if unused

We already annotate whether llvm.amdgcn.implicitarg.ptr is known to be
unused. Start using it to avoid allocating the implicit arguments if
unneeded.

commit | commitdiff | tree

Kristina Bessonova [Sat, 4 Dec 2021 15:12:47 +0000 (17:12 +0200)]

[DwarfDebug] Support emitting function-local declaration for a lexical block

This is another attempt to make function-local declarations
(like static variables, structs/classes and other) be correctly
emitted within a lexical (bracketed) block.

Fixes https://bugs.llvm.org/show_bug.cgi?id=19238.

Differential Revision: https://reviews.llvm.org/D113741

commit | commitdiff | tree

Hugo Pompougnac [Sat, 4 Dec 2021 01:42:23 +0000 (07:12 +0530)]

Apply the permutation map on each affine nest

When using -test-loop-permutation="permutation-map=...", applies the
permutation map on each affine nest in the function (and not only the
first one). If the size of the permutation map and the size of a nest
are not consistent, do nothing on this particular nest (instead of
making MLIR crash).

Differential Revision: https://reviews.llvm.org/D112947

commit | commitdiff | tree

Kristina Bessonova [Sat, 4 Dec 2021 12:08:10 +0000 (14:08 +0200)]

[DwarfDebug] Move emission of global vars, types and imports to endModule()

This patch proposes to move emission of global variables, types,
imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule().
Effectively, this changes nothing but the order of debug entities which
will be as follows:
* subprograms (including related context, local variables/labels,
  local imported entities; related types can be created as a part of
  the emission of local entities of an abstract subprogram);
* global variables (including related context and types);
* retained types and enums;
* non-local-scoped imported entities;
* basic types;
* other types left (as a part of local variables attributes emission).

Note that the order of emitted compile units may also be changed as now we emit
units that contain subprograms first and then all other non-empty units.

The motivation behind this change is the following:
(1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline,
    from this time IR can be significantly changed by target-specific passes.
    If it happens for debug metadata of global entities, those changes will not
    be reflected in the emitted DWARF.
(2) imported subprogram names should refer to an abstract subprogram if it exists,
    but it isn't known in DwarfDebug::beginModule() (it's possible to make some
    guesses based on location info, but it's not quite reliable);
(3) aforementioned entities if they are scoped within a bracketed block
    (subject of D113741) couldn't be emitted in DwarfDebug::beginModule()
    (they need parent emitted first). Another problem is if to try to gather
    some information about local entities and defer their emission
    (till subprogram's processing or DwarfDebug::endModule()) all the gathered
    details might be irrelevant / invalid by the time the entities are being
    emitted (because of (1)).

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D114705

commit | commitdiff | tree

Dmitry Vyukov [Sat, 4 Dec 2021 11:23:37 +0000 (12:23 +0100)]

tsan: disable dlopen_static_tls.cpp test on aarch64

Fails on bots: https://lab.llvm.org/buildbot#builders/184/builds/1580

Differential Revision: https://reviews.llvm.org/D115095

commit | commitdiff | tree

Anton Afanasyev [Mon, 1 Nov 2021 13:48:52 +0000 (16:48 +0300)]

[Passes] Move AggressiveInstCombine after InstCombine

Swap AIC and IC neighbouring in pipeline. This looks more natural and even
almost has no effect for now (three slightly touched tests of test-suite). Also
this could be the first step towards merging AIC (or its part) to -O2 pipeline.

After several changes in AIC (like D108091, D108201, D107766, D109515, D109236)
there've been observed several regressions (like PR52078, PR52253, PR52289)
that were fixed in different passes (see D111330, D112721) by extending their
functionality, but these regressions were exposed since changed AIC prevents IC
from making some of early optimizations.

This is common problem and it should be fixed by just moving AIC after IC
which looks more logically by itself: make aggressive instruction combining
only after failed ordinary one.

Fixes PR52289

Reviewed By: spatel, RKSimon

Differential Revision: https://reviews.llvm.org/D113179

commit | commitdiff | tree

Jay Foad [Thu, 2 Dec 2021 12:26:59 +0000 (12:26 +0000)]

[AMDGPU] Change llvm.amdgcn.image.bvh.intersect.ray to take vec3 args

The ray_origin, ray_dir and ray_inv_dir arguments should all be vec3 to
match how the hardware instruction works.

Don't change the API of the corresponding OpenCL builtins.

Differential Revision: https://reviews.llvm.org/D115032

commit | commitdiff | tree

Jay Foad [Thu, 2 Dec 2021 12:25:00 +0000 (12:25 +0000)]

[IR,TableGen] Add support for vec3 intrinsic arguments

Add generic support for vec3 types, and in particular define
llvm_v3f32_ty which will be used by AMDGPU's
llvm.amdgcn.image.bvh.intersect.ray intrinsic.

Differential Revision: https://reviews.llvm.org/D114956

commit | commitdiff | tree

Jay Foad [Thu, 2 Dec 2021 13:17:14 +0000 (13:17 +0000)]

[AMDGPU] Generate checks for llvm.amdgcn.image.bvh.intersect.ray

Differential Revision: https://reviews.llvm.org/D114955

commit | commitdiff | tree

Nikita Popov [Fri, 3 Dec 2021 22:26:54 +0000 (23:26 +0100)]

[PhaseOrdering] Add test for incorrect merge function scheduling

Add an -enable-merge-functions option to allow testing of function
merging as it will actually happen in the optimization pipeline.
Based on that add a test where we currently produce two identical
functions without merging them due to incorrect pass scheduling
under the new pass manager.

commit | commitdiff | tree

Carlos Galvez [Sat, 4 Dec 2021 08:36:50 +0000 (08:36 +0000)]

[clang-tidy][NFC] Move CachedGlobList to GlobList.h

Currently it's hidden inside ClangTidyDiagnosticConsumer,
so it's hard to know it exists.

Given that there are multiple uses of globs in clang-tidy,
it makes sense to have these classes publicly available
for other use cases that might benefit from it.

Also, add unit test by converting the existing tests
for GlobList into typed tests.

Reviewed By: salman-javed-nz

Differential Revision: https://reviews.llvm.org/D113422

commit | commitdiff | tree

Anton Afanasyev [Sat, 4 Dec 2021 07:58:16 +0000 (10:58 +0300)]

[Test][PhaseOrdering] Precommit test for PR52289

commit | commitdiff | tree

Vitaly Buka [Tue, 23 Nov 2021 05:23:46 +0000 (21:23 -0800)]

[sanitizer] Hook up LZW into stack store

Depends on D114503.

Reviewed By: morehouse

Differential Revision: https://reviews.llvm.org/D114924

commit | commitdiff | tree

Kazu Hirata [Sat, 4 Dec 2021 04:45:59 +0000 (20:45 -0800)]

[CodeGen] Use range-based for loops (NFC)

commit | commitdiff | tree

Tee KOBAYASHI [Sat, 4 Dec 2021 04:23:09 +0000 (23:23 -0500)]

[Sparc] Create an error when `__builtin_longjmp` is used

Support for builtin setjmp/longjmp was removed by https://reviews.llvm.org/D51487. An
error should be created when compiling C code using __builtin_setjmp or __builtin_longjmp.

Reviewed By: dcederman

Differential Revision: https://reviews.llvm.org/D108901

commit | commitdiff | tree

Chia-hung Duan [Sat, 4 Dec 2021 04:35:24 +0000 (04:35 +0000)]

[mlir] Support collecting logs from notifyMatchFailure().

Let the user registers their own handler to processing the matching
failure information.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D110896

commit | commitdiff | tree

Mehdi Amini [Sat, 4 Dec 2021 04:23:21 +0000 (04:23 +0000)]

Use LLVM_ATTRIBUTE_UNUSED to silent warning for static function used in assert only (NFC)

commit | commitdiff | tree

Mehdi Amini [Fri, 3 Dec 2021 21:51:55 +0000 (21:51 +0000)]

Split the locking of the queue and the threads vector in the ThreadPool implementation

This allows to release the QueueLock early and create Thread
independently of the queue processing.

Differential Revision: https://reviews.llvm.org/D115078

commit | commitdiff | tree

Matthias Springer [Sat, 4 Dec 2021 02:49:07 +0000 (11:49 +0900)]

[mlir][linalg][bufferize] Implement equivalence analysis

Instead of checking buffer equivalence during bufferization, gather buffer equivalence information right after the analysis. This is in preparation of decoupling bufferization from BufferizationAliasInfo.

This change also fixes equivalence analysis for scf.if op results, which was not fully implemented. scf.if op results are equivalent to their corresponding yield values if both yield values are equivalent.

Differential Revision: https://reviews.llvm.org/D114774

commit | commitdiff | tree

Mehdi Amini [Sat, 4 Dec 2021 02:19:53 +0000 (02:19 +0000)]

Fix build for ThreadPool when using -DLLVM_ENABLE_THREADS=OFF

Differential Revision: https://reviews.llvm.org/D115019

Domain: System / Toolchain;

RSS Atom