review.tizen.org Git - platform/upstream/llvm.git/log

projects / platform / upstream / llvm.git / log

Jay Foad [Wed, 7 Sep 2022 08:56:55 +0000 (09:56 +0100)]

[AMDGPU] Add a comment for a missing fold

Nikita Popov [Wed, 7 Sep 2022 08:46:58 +0000 (10:46 +0200)]

[Reassociate] Avoid ConstantExpr::getFNeg() (NFCI)

Use ConstantFoldUnaryOpOperand() instead. Also make the code below
robust against non-instruction users, just in case it doesn't fold.

commit | commitdiff | tree

Alexander Belyaev [Wed, 7 Sep 2022 08:10:48 +0000 (10:10 +0200)]

[mlir] Remove `materializeOpFoldResult` functions.

We can use `getValueOrCreateConstantIndexOp` instead.

Differential Revision: https://reviews.llvm.org/D133403

commit | commitdiff | tree

chenglin.bi [Wed, 7 Sep 2022 08:07:26 +0000 (16:07 +0800)]

[AArch64] add tests for non-power2 int types; NFC

commit | commitdiff | tree

serge-sans-paille [Tue, 6 Sep 2022 12:22:40 +0000 (14:22 +0200)]

[clang][doc] Do not keep a copy of ClangCommandLineReference.rst in tree

This file is auto-generated, it's a bit confusing for the maintainers to have it
in tree while it shouldn't be modified *and* already have a rule to be
generated.

Differential Revision: https://reviews.llvm.org/D133349

commit | commitdiff | tree

Uday Bondhugula [Wed, 7 Sep 2022 04:39:47 +0000 (10:09 +0530)]

[MLIR] NFC. Introduce mlir::hasEffect and refactor usages dialect util

Introduce mlir::hasEffect and refactor existing usage to use utility.
NFC.

Reviewed By: rriddle, mehdi_amini

Differential Revision: https://reviews.llvm.org/D132117

commit | commitdiff | tree

Zi Xuan Wu (Zeson) [Wed, 7 Sep 2022 06:40:01 +0000 (14:40 +0800)]

[CSKY] Fix the compiling error about missing Log2 function with Log2_64

commit | commitdiff | tree

Xiang1 Zhang [Wed, 31 Aug 2022 03:16:09 +0000 (11:16 +0800)]

[X86][NFC] Refine load/store reg to StackSlot for extensibility

Reviewed By: LuoYuanke

Differential Revision: https://reviews.llvm.org/D133078

commit | commitdiff | tree

Jan Ole Hüser [Wed, 7 Sep 2022 06:17:10 +0000 (09:17 +0300)]

[LLD][COFF] Fix writing a map file when range extension thunks are inserted

Bug: An assertion fails:

    Assertion failed: isa<To>(Val) && "cast<Ty>() argument of incompatible type!",
    file C:\Users\<user>\prog\llvm\llvm-git-lld-bug\llvm\include\llvm/Support/Casting.h, line 578

Bug is triggered, if

    - a map file is requested with /MAP, and
    - Architecture is ARMv7, Thumb, and
    - a relative jump (branch instruction) is greater than 16 MiB (2^24)

The reason for the Bug is:

    - a Thunk is created for the jump
    - a Symbol for the Thunk is created
        - of type `DefinedSynthetic`
        - in file `Writer.cpp`
        - in function `getThunk`
    - the Symbol has no name
    - when creating the map file, the name of the Symbol is queried
    - the function `Symbol::computeName` of the base class `Symbol`
      casts the `this` pointer to type `DefinedCOFF` (a derived type),
      but the acutal type is `DefinedSynthetic`
    - The in the llvm::cast an assertion fails

Changes:

- Modify regression test to trigger this bug
- Give the symbol pointing to the thunk a name, to fix the bug
- Add assertion, that only DefinedCOFF symbols are allowed to have an
  empty name, when the constructor of the base class Symbol is executed

Reviewed By: rnk

Differential Revision: https://reviews.llvm.org/D133201

commit | commitdiff | tree

Michał Górny [Tue, 6 Sep 2022 13:40:10 +0000 (15:40 +0200)]

[compiler-rt] [test] Handle missing ld.gold gracefully

Fix the is_binutils_lto_supported() function to handle missing
executables gracefully.  Currently, the function does not catch
exceptions from subprocess.Popen() and therefore causes lit to crash
if config.gold_executable does not specify a valid executable:

```
lit: /usr/lib/python3.11/site-packages/lit/TestingConfig.py:136: fatal: unable to parse config file '/tmp/portage/sys-libs/compiler-rt-
15.0.0/work/compiler-rt/test/lit.common.cfg.py', traceback: Traceback (most recent call last):
  File "/usr/lib/python3.11/site-packages/lit/TestingConfig.py", line 125, in load_from_path
    exec(compile(data, path, 'exec'), cfg_globals, None)
  File "/tmp/portage/sys-libs/compiler-rt-15.0.0/work/compiler-rt/test/lit.common.cfg.py", line 561, in <module>
    if is_binutils_lto_supported():
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/tmp/portage/sys-libs/compiler-rt-15.0.0/work/compiler-rt/test/lit.common.cfg.py", line 543, in is_binutils_lto_supported
    ld_cmd = subprocess.Popen([exe, '--help'], stdout=subprocess.PIPE, env={'LANG': 'C'})
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/lib/python3.11/subprocess.py", line 1022, in __init__
    self._execute_child(args, executable, preexec_fn, close_fds,
  File "/usr/lib/python3.11/subprocess.py", line 1899, in _execute_child
    raise child_exception_type(errno_num, err_msg, err_filename)
FileNotFoundError: [Errno 2] No such file or directory: 'GOLD_EXECUTABLE-NOTFOUND'
```

Differential Revision: https://reviews.llvm.org/D133358

commit | commitdiff | tree

Vitaly Buka [Wed, 7 Sep 2022 03:55:39 +0000 (20:55 -0700)]

[NFC][sancov] Rename ModuleSanitizerCoveragePass

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 05:30:23 +0000 (22:30 -0700)]

[NFC][msan] Rename ModuleMemorySanitizerPass

commit | commitdiff | tree

jacquesguan [Tue, 6 Sep 2022 09:11:20 +0000 (17:11 +0800)]

[mlir][Math] Add constant folder for RoundEvenOp.

This patch uses roundeven/roundevenf of libm to fold RoundEvenOp of constant.

Differential Revision: https://reviews.llvm.org/D133344

commit | commitdiff | tree

Sinan Lin [Mon, 5 Sep 2022 14:00:00 +0000 (22:00 +0800)]

[cmake] do not set execution permission to regular files.

some regular files(e.g. files have no shebang and no execute
bit in source dir) are wrongly assigned an execution permission,
such as scanview.css and ear.c from libscanbuild, which is
unnecessary and introduces warnings in some envs.

Reviewed By: MaskRay, phosek

Differential Revision: https://reviews.llvm.org/D133308

commit | commitdiff | tree

jacquesguan [Fri, 5 Aug 2022 03:03:20 +0000 (11:03 +0800)]

[mlir][Math] Add constant folder for CosOp.

This patch adds constant folder for CosOp which only supports single and double precision floating-point.

Differential Revision: https://reviews.llvm.org/D131233

commit | commitdiff | tree

Chuanqi Xu [Wed, 7 Sep 2022 02:35:54 +0000 (10:35 +0800)]

[NFC] [Frontend] Correct the use of 'auto' in SemaCoroutine and CGCoroutine

We should only use 'auto' in case we can know the type from the right
hand side of the expression. Also we need keep '*' around if the type is
a pointer actually. Few uses of 'auto' in SemaCoroutine.cpp and
CGCoroutine.cpp violates the rule. This commit tries to fix it.

commit | commitdiff | tree

Xiang1 Zhang [Tue, 30 Aug 2022 05:58:35 +0000 (13:58 +0800)]

[CodeGen] Limit building time in CodeGenPrepare for huge function

Details:

Currently CodeGenPrepare is very time consuming in handling big functions.

Old Algorithm :
It iterate each BB in function, and go on handle very instructions in BB.
Due to some instruction optimizations may affect the BBs' dominate tree.
The old logic will re-iterate and try optimize for each BB.

Suppose we have a big function with 20000 BBs, If we handled the last BB
with fine tuning the dominate tree. We need totally re-iterate and try optimize
the 20000 BBs from the beginning.

The Complex is near N!

And we really encounter somes big tests (> 20000 BBs) that cost more than 30
mins in this pass. (Debug version compiler will cost 2 hours here)

What this patch do for huge function ?
It mainly changes the iteration way for optimization.

1 We do optimizeBlock for each BB (that is same with old way).
And, in the meaning time, If BB is changed/updated in the optimization, it will
be put into FreshBBs (try do optimizeBlock again).
The new created BB at previous iteration will also put into FreshBBs.

2 For the BBs which not updated at previous iteration, we directly skip it.
Strictly speaking, here may miss some opportunity, but the probability is very
small.

3 For Instructions in single BB, we do optimizeInst for each instruction.
If optimizeInst change the instruction dominator in this BB, rather than break
and go back to optimize the first BB (the old way), we directly iterate
instructions (to do optimizeInst) in this updated BB again (the new way).

What this patch do for small/normal (not huge) function ?
It is same with the Old Algorithm. (NFC)

Reviewed By: LuoYuanke

Differential Revision: https://reviews.llvm.org/D129352

commit | commitdiff | tree

Jordan Rupprecht [Wed, 7 Sep 2022 01:59:03 +0000 (18:59 -0700)]

[test] Fix typo in setting the wrong Context test field

This manifests as an msan error because we check the value of `es`, but it's actually uninitialized because we mistakenly set `ss` twice.

commit | commitdiff | tree

Joseph Huber [Wed, 7 Sep 2022 01:31:27 +0000 (20:31 -0500)]

[CUDA] Actually fix the test correctly this time

commit | commitdiff | tree

Joseph Huber [Wed, 7 Sep 2022 01:14:20 +0000 (20:14 -0500)]

[CUDA] Fix test failing when using the new driver

Summary:
Previously the new driver crashed when using `-fsyntax-only` which
required a work-around in one of the test files. This was not properly
updated when it was fixed for the new driver. This patch fixes the test
and also adjusts a missing boolean check.

commit | commitdiff | tree

Joseph Huber [Thu, 1 Sep 2022 22:04:49 +0000 (17:04 -0500)]

[Clang] Fix the new driver crashing when using '-fsyntax-only'

The new driver currently crashses when attempting to use the
'-fsyntax-only' option. This is because the option causes all output to
be given the `TY_Nothing' type which should signal the end of the
pipeline. The new driver was not treating this correctly and attempting
to use empty input. This patch fixes the handling so we do not attempt
to continue when the input is nothing.

One concession is that we must now check when generating the arguments
for Clang if the input is of 'TY_Nothing'. This is because the new
driver will only create code if the device code is a dependency on the
host, creating the output without the dependency would require a
complete rewrite of the logic as we do not maintain any state between
calls to 'BuildOffloadingActions' so I believe this is the most
straightforward method.

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D133161

commit | commitdiff | tree

natashaknk [Wed, 7 Sep 2022 00:13:12 +0000 (17:13 -0700)]

[mlir][tosa] Support non-batch dynamic dims for tosa.rescale to linalg

Reviewed By: rsuderman

Differential Revision: https://reviews.llvm.org/D133383

commit | commitdiff | tree

Richard Smith [Wed, 7 Sep 2022 00:10:55 +0000 (17:10 -0700)]

Enforce module decl-use restrictions and private header restrictions in textual headers

Per the documentation, these restrictions were intended to apply to textual headers but previously this didn't work because we decided there was no requesting module when the `#include` was in a textual header.

A `-cc1` flag is provided to restore the old behavior for transitionary purposes.

Reviewed By: aaron.ballman

Differential Revision: https://reviews.llvm.org/D132779

commit | commitdiff | tree

Florian Mayer [Tue, 6 Sep 2022 22:18:28 +0000 (15:18 -0700)]

[HWASan] Show memory rather than tag addresses in tag dump

Reviewed By: eugenis

Differential Revision: https://reviews.llvm.org/D133380

commit | commitdiff | tree

Fangrui Song [Tue, 6 Sep 2022 23:48:16 +0000 (16:48 -0700)]

[test] Remove PowerPC/aix-xcoff-exported-nondefault.ll

This is not asserted by IR verifier.

commit | commitdiff | tree

Craig Topper [Tue, 6 Sep 2022 23:03:45 +0000 (16:03 -0700)]

[RISCV] Use llvm::none_of to replace a loop. NFC

commit | commitdiff | tree

Ruobing Han [Sun, 4 Sep 2022 16:56:25 +0000 (12:56 -0400)]

[SimpleLoopUnswitch] Skip non-trivial unswitching of cold functions

In the current main branch, all cold loops will not be applied non-trivial unswitch. As reported in D129599, skipping these cold loops will incur regression in SPEC benchmark.
Thus, instead of skipping cold loops, now only skipping loops in cold functions.

Reviewed By: alexgatea, aeubanks

Differential Revision: https://reviews.llvm.org/D133275

commit | commitdiff | tree

River Riddle [Fri, 2 Sep 2022 02:31:31 +0000 (19:31 -0700)]

[mlir] Flip default value of emitAccessorPrefix to kEmitAccessorPrefix_Prefixed

Most dialects have already flipped to prefixed, and the intention to switch
has been telegraphed for a while.

Differential Revision: https://reviews.llvm.org/D133179

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 22:42:13 +0000 (15:42 -0700)]

[test][hwasan] Update test for D132622

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 03:31:43 +0000 (20:31 -0700)]

[pipelines] OptimizerEarlyEPCallbacks for ThinLTO prelink

Similar to OptimizerLastEPCallbacks workaround
added D96320.

Probably NFC as-is, I don't see anything hooked with this callbacks yet,
but I we are looking to move sanitizers.

Reviewed By: aeubanks, MaskRay

Differential Revision: https://reviews.llvm.org/D133333

commit | commitdiff | tree

Anastasia Stulova [Tue, 6 Sep 2022 22:07:29 +0000 (15:07 -0700)]

[mlir][tosa] Fix dynamic shape inference in conv2d

The comment in the code correctly states the equation for the shape inference as follows:

```
H = ((IH+pad_top+pad_bottom-(dilation_y*(KH-1)+1))/stride_y)+1
```

However the final operation is generated as `-` instead of `+`. I believe `+`
is indeed correct. For example if we have an image with dimension 6 and kernel
of dimension 3 (assuming padding is 0 and stride and dilation are both 1) we
are expecting 4 elements in the output (computed for image elements `(0, 1, 2)
x kernel`, `(1, 2, 3) x kernel`, `(2, 3, 4) x kernel` and `(3, 4, 5) x kernel`.
However currently only 2 elements are produced in the output.

Reviewed By: NatashaKnk

Differential Revision: https://reviews.llvm.org/D133208

commit | commitdiff | tree

Peiming Liu [Tue, 6 Sep 2022 22:25:41 +0000 (22:25 +0000)]

[mlir][tensor] Fix a typo in the example code for UnrankedTensorType

The syntax for unrank tensor type is defined as tensor-type ::= `tensor` `<` `*` `x` type `>`, the example code missed the `x` in between.

Reviewed By: aartbik

Differential Revision: https://reviews.llvm.org/D133381

commit | commitdiff | tree

Fangrui Song [Tue, 6 Sep 2022 22:28:29 +0000 (15:28 -0700)]

[Verifier] Allow dllexport protected after D133267

I have noticed that this combo makes sense (D133266) but rejected it for
simplicity. It turns out to be used by PlayStation, so let's allow it.

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 05:37:45 +0000 (22:37 -0700)]

[NFC][asan] Rename ModuleAddressSanitizerPass

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 04:28:02 +0000 (21:28 -0700)]

[msan] Convert Msan to ModulePass

MemorySanitizerPass function pass violatied requirement 4 of function
pass to do not insert globals. Msan nees to insert globals for origin
tracking, and paramereters tracking.

https://llvm.org/docs/WritingAnLLVMPass.html#the-functionpass-class

Reviewed By: kstoimenov, fmayer

Differential Revision: https://reviews.llvm.org/D133336

commit | commitdiff | tree

River Riddle [Sat, 3 Sep 2022 01:52:43 +0000 (18:52 -0700)]

[mlir:vscode] Add support for loading big bytecode files

VSCode doesn't let our extension manage files >50mb. This commit
adds a proper diagnostic in this case, and also gives the user an option
to open as a temporary .mlir file instead.

Differential Revision: https://reviews.llvm.org/D133242

commit | commitdiff | tree

Arthur Eubanks [Tue, 6 Sep 2022 21:35:14 +0000 (14:35 -0700)]

[ThinLTOBitcodeWriter] Mark pass as required

Or else with -opt-bisect-limit we don't write ThinLTO bitcode.

Reviewed By: asbirlea

Differential Revision: https://reviews.llvm.org/D133378

commit | commitdiff | tree

Craig Topper [Tue, 6 Sep 2022 21:45:17 +0000 (14:45 -0700)]

[RISCV] Add '32bit' feature to rv32 only builtins.

The backend now has a 32bit feature as part of the recent mtune
patch. We can now use that make our rv32-only builtin error checking
work the same way as rv64-only errors.

Reviewed By: kito-cheng

Differential Revision: https://reviews.llvm.org/D132192

commit | commitdiff | tree

River Riddle [Sat, 3 Sep 2022 04:23:47 +0000 (21:23 -0700)]

[mlir] Allow passing AsmState when printing Attributes and Types

This allows for extracting assembly information when printing an attribute
or type, such as the dialect resources referenced. This functionality is used in
a followup that adds resource support to the bytecode. This change also results
in a nice cleanup of AsmPrinter now that we don't need to awkwardly workaround
optional AsmStates.

Differential Revision: https://reviews.llvm.org/D132728

commit | commitdiff | tree

Krzysztof Drewniak [Tue, 6 Sep 2022 20:25:20 +0000 (20:25 +0000)]

[mlir] Improve BitEnumAttr, update documentation

- Add new operators to BitEnumAttr, mainly not (which only inverts
bits that can be valid bits for the attribute) and xor
- Add new bit enum utility functions: bitEnumClear(bits, bit) and
bitEnumSet(bits, bit, value=true) as they've come up in code I've been
writing that makes use of such enums
- Add rudimentary tests for the enum generator
- Update the OpDefinition documentation to make it contain a correct
example and to have it account for the changes mentioned above.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D133374

commit | commitdiff | tree

Simon Pilgrim [Tue, 6 Sep 2022 21:05:27 +0000 (22:05 +0100)]

[CostModel][X86] getIntrinsicInstrCost - convert to CostKindTblEntry

Begin the refactoring to use CostKindTblEntry and return real latency/codesize/sizelatency costs instead of reusing the throughput numbers

This should allow us to merge getTypeBasedIntrinsicInstrCost into getIntrinsicInstrCost and remove all remaining references

commit | commitdiff | tree

Peiming Liu [Tue, 6 Sep 2022 18:13:55 +0000 (18:13 +0000)]

[mlir][sparse] Add lowering rules for sparse_tensor.storage Op

Reviewed By: aartbik

Differential Revision: https://reviews.llvm.org/D133368

commit | commitdiff | tree

bzcheeseman [Fri, 2 Sep 2022 17:23:43 +0000 (10:23 -0700)]

[LLVM][Support/ADT] Add assert for isPresent to dyn_cast.

This change adds an assert to dyn_cast that the value passed-in is present. In the past, this relied on the isa_impl assertion (which still works in many cases) but which we can tighten up for a better QoI.

The PointerUnion change is because it seems like (based on the call sites) the semantics of the member dyn_cast are actually dyn_cast_if_present.

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D133221

commit | commitdiff | tree

Florian Mayer [Wed, 24 Aug 2022 23:40:26 +0000 (16:40 -0700)]

[HWASan] rename left/right to before/after.

left/right is a weird way to refer to address ordering.

Reviewed By: eugenis, vitalybuka

Differential Revision: https://reviews.llvm.org/D132622

commit | commitdiff | tree

Aaron Ballman [Tue, 6 Sep 2022 20:47:40 +0000 (16:47 -0400)]

Fix the Clang sphinx build bot; NFC

This should address the failure introduced in:
https://lab.llvm.org/buildbot/#/builders/92/builds/32377

commit | commitdiff | tree

raghavmedicherla [Tue, 6 Sep 2022 20:41:00 +0000 (16:41 -0400)]

[AMDGPU/Metadata] Rename HSAMD::MetadataStreamer classes

Renamed all HSAMD::MetadataStreamer classes to improve readability of the code.

Differential Revision: https://reviews.llvm.org/D133156

commit | commitdiff | tree

raghavmedicherla [Tue, 6 Sep 2022 18:45:25 +0000 (14:45 -0400)]

Support: Add mapped_file_region::sync(), equivalent to msync

Add mapped_file_region::sync(), equivalent to POSIX msync,
synchronizing written content to disk without unmapping the region.
Asserts if the mode is not mapped_file_region::readwrite.

Note that I don't have access to a Windows machine, so I can't
easily run those unit tests.

Change by dexonsmith

Differential Revision: https://reviews.llvm.org/D95494

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 20:42:57 +0000 (13:42 -0700)]

[test][libcxx] Mark ubsan test as UNSUPPORTED

It inconsistently fails on bots.

commit | commitdiff | tree

Florian Mayer [Tue, 6 Sep 2022 20:37:35 +0000 (13:37 -0700)]

[ASan] Fix libfuzzer test

commit | commitdiff | tree

Philip Reames [Tue, 6 Sep 2022 20:35:39 +0000 (13:35 -0700)]

[docs][RISCV] Document status of Zicpo(m,p,z) extensions

Unless I missed something when checking, we only support these in assembly. Given that, documenting them is a good opportunity to define support levels which will be useful for other extensions as well.

Differential Revision: https://reviews.llvm.org/D133372

commit | commitdiff | tree

Alexander Shaposhnikov [Tue, 6 Sep 2022 20:28:17 +0000 (20:28 +0000)]

[AArch64] Increase AddedComplexity of BIC

This diff adjusts AddedComplexity of BIC to bump its position
in the list of patterns to make LLVM pick it instead of MVN + AND.
MVN + AND requires 2 cycles, so does e.g. MOV + BIC, but the latter
outperforms the former if the instructions producing the operands of
BIC can be issued in parallel.

One may consider the following example:

ldur x15, [x0, #2] # 4 cycles
mvn x10, x15 # 1 cycle (depends on ldur)
and x9, x10, #0x8080808080808080

vs.

ldur x15, [x0, #2] # 4 cycles
mov x9, #0x8080808080808080 # 1 cycle (can be executed in parallel with ldur)
bic x9, x9, x15. # 1 cycle

Test plan: ninja check-all

Differential revision: https://reviews.llvm.org/D133345

commit | commitdiff | tree

Joseph Huber [Tue, 6 Sep 2022 20:25:32 +0000 (15:25 -0500)]

[Libomptarget][NFC] Clean up CUDA plugin and address warnings

commit | commitdiff | tree

Simon Pilgrim [Tue, 6 Sep 2022 20:27:46 +0000 (21:27 +0100)]

[CostModel][X86] Fix check prefixes on cttz cost kinds tests

Add missing SSE2 + BMI/NOBMI coverage

commit | commitdiff | tree

Florian Mayer [Wed, 24 Aug 2022 21:30:14 +0000 (14:30 -0700)]

[ASan] rename left/right to before/after.

left/right is a weird way to refer to address ordering.

Reviewed By: eugenis, MaskRay

Differential Revision: https://reviews.llvm.org/D132611

commit | commitdiff | tree

Jim Ingham [Tue, 6 Sep 2022 20:24:21 +0000 (13:24 -0700)]

Fix another place were we suggest lldb.target in a Frame Recognizer.

commit | commitdiff | tree

Jim Ingham [Tue, 6 Sep 2022 20:20:03 +0000 (13:20 -0700)]

You can't use lldb.target in a frame recognizer.

commit | commitdiff | tree

Alexander Shaposhnikov [Tue, 6 Sep 2022 20:19:47 +0000 (20:19 +0000)]

[clang-tidy] Skip union-like classes in use-equals-default

Skip unions/union-like classes since in this case constructors
with empty bodies behave differently in comparison with regular
structs/classes.

Test plan: ninja check-clang-tools

Differential revision: https://reviews.llvm.org/D132713

commit | commitdiff | tree

Vitaly Buka [Sun, 4 Sep 2022 05:24:33 +0000 (22:24 -0700)]

[tsan] Replace mem intrinsics with calls to interceptors

After https://reviews.llvm.org/rG463aa814182a23 tsan replaces llvm
intrinsics with calls to glibc functions. However this approach is
fragile, as slight changes in pipeline can return llvm intrinsics back.
In particular InstCombine can do that.

Msan/Asan already declare own version of these memory
functions for the similar purpose.

KCSAN, or anything that uses something else than compiler-rt, needs to
implement this callbacks.

Reviewed By: melver

Differential Revision: https://reviews.llvm.org/D133268

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 20:00:57 +0000 (13:00 -0700)]

[tsan][go] Fix buildgo.sh when GOAMD64 undefined

commit | commitdiff | tree

Chenguang Wang [Tue, 6 Sep 2022 19:28:23 +0000 (21:28 +0200)]

[mlir] Fix DenseElementsAttr::mapValues(i1, splat).

Splat of bool is encoded as a byte with all-ones in it [1]. Without this
change, this piece of code:

    auto xs = builder.getI32TensorAttr({42, 42, 42, 42});
    auto xs2 = xs.mapValues(builder.getI1Type(), [](const llvm::APInt &x) {
      return x.isZero() ? llvm::APInt::getZero(1) : llvm::APInt::getAllOnes(1);
    });
    xs2.dump();

Prints:

    dense<[true, false, false, false]> : tensor<4xi1>

Because only the first bit is set. This applies to both
DenseIntElementsAttr::mapValues() and DenseFPElementsAttr::mapValues().

[1]: https://github.com/llvm/llvm-project/blob/e877b42e2c70813352c1963ea33e992f481d5cba/mlir/lib/IR/BuiltinAttributes.cpp#L984

Reviewed By: rriddle

Differential Revision: https://reviews.llvm.org/D132767

commit | commitdiff | tree

Philip Reames [Tue, 6 Sep 2022 19:06:53 +0000 (12:06 -0700)]

[docs] Add a RISC-V Usage page

This follows the pattern set by e.g. SPIR-V, AMDGPU, and WebAssembly.

For the moment, this page is fairly minimal stub. I mostly want a place to record which extensions we support, and when that support is not complete, a central place to note that. I'm deliberately starting with a subset of extensions which are pretty straight forward with only one case needing a detail note.

I figure the partial support cases are going to need a bunch of discussion, and having each of them in their own review seemed worthwhile. Similarly, the experimental extensions probably fall into the same bucket of triggering a bunch of review discussion.

Differential Revision: https://reviews.llvm.org/D133063

commit | commitdiff | tree

Roy Jacobson [Tue, 6 Sep 2022 18:59:27 +0000 (21:59 +0300)]

[DOC][Clang] Update cxx_status.html after the 15 release and fix a broken link in release notes

commit | commitdiff | tree

Joseph Huber [Tue, 6 Sep 2022 18:56:24 +0000 (13:56 -0500)]

[OpenMP] Fix logic error when building offloading applications

Summary:
A previous patch removed support for the `-fopenmp-new-driver` and
accidentally used the `isHostOffloading` flag instead of
`isDeviceOffloading` which lead to some build errors when compiling for
the offloading device. This patch addresses that.

commit | commitdiff | tree

Joseph Huber [Tue, 6 Sep 2022 17:51:43 +0000 (12:51 -0500)]

[OpenMP] Remove use of removed '-f[no-]openmp-new-driver' flag

The changes in D130020 removed all support for the old method of
compiling OpenMP offloading programs. This means that
`-fopenmp-new-driver` has no effect and `-fno-openmp-new-driver` does
not work. This patch removes the use and documentation of this flag.
Note that the `--offload-new-driver` flag still exists for using the new
driver optionally with CUDA and HIP.

Reviewed By: tra

Differential Revision: https://reviews.llvm.org/D133367

commit | commitdiff | tree

Markus Böck [Tue, 6 Sep 2022 18:20:40 +0000 (20:20 +0200)]

[MC] Emit Stackmaps before debug info

This patch is essentially an alternative to https://reviews.llvm.org/D75836 and was mentioned by @lhames in a comment.

The gist of the issue is that Mach-O has restrictions on which kind of sections are allowed after debug info has been emitted, which is also properly asserted within LLVM. Problem is that stack maps are currently emitted as one of the last sections in each target-specific AsmPrinter so far, which would cause the assertion to trigger. The current approach of special casing for the `__LLVM_STACKMAPS` section is not viable either, as downstream users can overwrite the stackmap format using plugins, which may want to use different sections.

This patch fixes the issue by emitting the stack map earlier, right before debug info is emitted. The way this is implemented is by taking the choice when to emit the StackMap away from the target AsmPrinter and doing so in the base class. The only disadvantage of this approach is that the `StackMaps` member is now part of the base class, even for targets that do not support them. This is functionaly not a problem however, as emitting an empty `StackMaps` is a no-op.

Differential Revision: https://reviews.llvm.org/D132708

commit | commitdiff | tree

Joseph Huber [Tue, 6 Sep 2022 15:22:25 +0000 (10:22 -0500)]

[OpenMP] Fix `omp_get_wtime` function being marked incorrectly as readonly

OpenMP has a list of of optimistic attributes that can be attached to
known runtime functions to aid some analysis. The `omp_get_wtime`
function incorrectly used the `readonly` attribute. This is not correct
at the `omp_get_wtime` function changes values depending on some
external state. This is more correctly modeled with
`inaccessiblememonly` meaning that the value does not depend on anything
within the module, but can not be removes as it depends on external
state.

Fixes #57578

Reviewed By: tianshilei1992

Differential Revision: https://reviews.llvm.org/D133360

commit | commitdiff | tree

Jakub Kuderski [Tue, 6 Sep 2022 17:48:47 +0000 (13:48 -0400)]

[ADT] Remove is_splat

`is_splat` is superseded by `all_equal` and marked as deprecated.
See the discussion thread for more details:
https://discourse.llvm.org/t/adt-is-splat-and-empty-ranges/64692

Reviewed By: dblaikie

Differential Revision: https://reviews.llvm.org/D132336

commit | commitdiff | tree

Dave Lee [Tue, 6 Sep 2022 17:20:27 +0000 (10:20 -0700)]

[lldb] Fix CommunicationKDP following D133251

Add `m_bytes` and `m_bytes_mutex` to `CommunicationKDP`, following refactoring
in D133251.

Differential Revision: https://reviews.llvm.org/D133365

commit | commitdiff | tree

Krzysztof Drewniak [Thu, 1 Sep 2022 21:46:54 +0000 (21:46 +0000)]

[mlir] Make bit enum operators constexpr

This allows using the | operator on the values of enum attributes
in complie-time constants.

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D133159

commit | commitdiff | tree

Jakub Kuderski [Tue, 6 Sep 2022 17:12:24 +0000 (13:12 -0400)]

[mlir][spirv] Change vendor op mnemonics to `spv.VENDOR.name`

Make vendor ops more consistent with the naming scheme within the SPIR-V
dialect.

Issue: https://github.com/llvm/llvm-project/issues/56863

Reviewed By: antiagainst

Differential Revision: https://reviews.llvm.org/D133247

commit | commitdiff | tree

Jakub Kuderski [Fri, 2 Sep 2022 18:59:42 +0000 (14:59 -0400)]

[mlir][spirv] Refactor vendor op definitions

Use dedicated vendor op classes/categories. This is so that we can later
change the mnemonics of all vendor ops by changing the base class: `SPV_VendorOp`.

Issue: https://github.com/llvm/llvm-project/issues/56863

commit | commitdiff | tree

Jakub Kuderski [Fri, 2 Sep 2022 18:15:51 +0000 (14:15 -0400)]

[mlir][spirv] Add base classes for vendor ops

This is the first patch in the series to rename vendor ops from
`spv.NameVENDOR` to `spv.VENDOR.Name`. The goal is to make the SPIR-V
dialect more internally consistent.

Issue: https://github.com/llvm/llvm-project/issues/56863

commit | commitdiff | tree

Matheus Izvekov [Thu, 1 Sep 2022 02:17:06 +0000 (04:17 +0200)]

[clang] Implement setting crash_diagnostics_dir through env variable

This implements setting the equivalent of `-fcrash-diagnostics-dir`
through the environment variable `CLANG_CRASH_DIAGNOSTICS_DIR`.
If present, the flag still takes precedence.

This helps integration with test frameworks and pipelines.

With this feature, we change the libcxx bootstrapping build
pipeline to produce clang crash reproducers as artifacts.

Signed-off-by: Matheus Izvekov <mizvekov@gmail.com>
Differential Revision: https://reviews.llvm.org/D133082

commit | commitdiff | tree

Florian Hahn [Tue, 6 Sep 2022 17:20:16 +0000 (18:20 +0100)]

Revert "[SCCP] convert signed div/rem to unsigned for non-negative operands"

This reverts commit fe1f3cfc2669aca387a45c8ce615b45c1db50fc6.

It looks like this commit breaks building llvm-test-suite.

To reproduce, run `opt -passes=ipsccp` on the IR below.

    @g = internal global i32 256, align 4

    define void @test() {
    entry:
      %0 = load i32, ptr @g, align 4
      %div = sdiv i32 %0, undef
      ret void
    }

commit | commitdiff | tree

Guozhi Wei [Tue, 6 Sep 2022 17:18:01 +0000 (17:18 +0000)]

[AArch64] Add an option to reserve physical registers from RA

This patch adds an option --reserve-regs-for-regalloc, so we can reserve a list
of physical registers. These registers will not be used by register allocator,
but can still be used as ABI requests such as passing arguments to function
call.

Its main purpose is simulating high register pressure by reserving many physical
registers. So it will be much easier to test and debug register allocation
changes.

Differential Revision: https://reviews.llvm.org/D132717

commit | commitdiff | tree

Nikolas Klauser [Mon, 5 Sep 2022 22:33:34 +0000 (00:33 +0200)]

[libc++] Avoid instantiating type_trait classes

Use `using` aliases to avoid instantiating lots of types

Reviewed By: ldionne, #libc

Spies: libcxx-commits, miyuki

Differential Revision: https://reviews.llvm.org/D132785

commit | commitdiff | tree

Florian Hahn [Tue, 6 Sep 2022 17:03:37 +0000 (18:03 +0100)]

[ConstraintElimination] Replace pair with named struct (NFC).

This slightly improves the readability and allows further extensions in
follow-ups.

commit | commitdiff | tree

Peiming Liu [Tue, 6 Sep 2022 16:57:09 +0000 (16:57 +0000)]

[mlir][sparse] Refactoring: renaming StorageNewOp to StorageOp

To address comment in https://reviews.llvm.org/D133241

Reviewed By: aartbik

Differential Revision: https://reviews.llvm.org/D133363

commit | commitdiff | tree

Vitaly Buka [Tue, 6 Sep 2022 16:46:46 +0000 (09:46 -0700)]

Revert "[tsan] Replace mem intrinsics with calls to interceptors"

Breaks
http://45.33.8.238/macm1/43944/step_4.txt
https://lab.llvm.org/buildbot/#/builders/70/builds/26926

This reverts commit 77654a65a373da9c4829de821e7b393ea811ee40.

commit | commitdiff | tree

Aart Bik [Sat, 3 Sep 2022 00:54:17 +0000 (17:54 -0700)]

[mlir][sparse] codegen for sparse alloc

Reviewed By: Peiming

Differential Revision: https://reviews.llvm.org/D133241

commit | commitdiff | tree

Craig Topper [Tue, 6 Sep 2022 16:33:13 +0000 (09:33 -0700)]

[RISCV] Improve vector fround lowering by changing FRM.

This is a follow up to D133238 which did this for ceil/floor.

Reviewed By: arcbbb, frasercrmck

Differential Revision: https://reviews.llvm.org/D133335

commit | commitdiff | tree

Matheus Izvekov [Wed, 31 Aug 2022 23:44:38 +0000 (01:44 +0200)]

[clang] fix profiling of template arguments of template and declaration kind

Template arguments of template and declaration kind were being profiled
only by their canonical properties, which would cause incorrect
uniquing of constrained AutoTypes, leading to a crash in some cases.

This exposed some places in CheckTemplateArgumentList where non-canonical
arguments where being pushed into the resulting converted list.

We also throw in some asserts to catch early and explain the crashes.

Note that the fix for the 'declaration' kind is untestable at this point,
because there should be no cases right now in the AST where we try
to unique a non-canonical converted template argument.

This fixes GH55567.

Signed-off-by: Matheus Izvekov <mizvekov@gmail.com>
Differential Revision: https://reviews.llvm.org/D133072

commit | commitdiff | tree

Simon Pilgrim [Tue, 6 Sep 2022 16:27:15 +0000 (17:27 +0100)]

[CostModel][X86] Add CostKinds handling for ctpop ops

This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695 (although it still struggles with avx512 predicate numbers which had to be done manually)

Some of the pre-AVX values still aren't great - atom/slm worst case numbers for ctpop expansion really affect these (especially throughput/latency), so we need to clean them up in a more consistent way - its a pity we don't have models for more older cpus (merom/nehalem etc.) as other examples.

commit | commitdiff | tree

Michael Eisel [Tue, 6 Sep 2022 16:18:23 +0000 (12:18 -0400)]

Add docs for Mach-O lld

I wasn't able to find any docs for Mach-O in `lld/docs`, so here's an attempt at adding basic docs. One of my goals here is to make it easy for users who are unfamiliar with linkers to successfully use lld.

Reviewed By: #lld-macho, int3

Differential Revision: https://reviews.llvm.org/D132893

commit | commitdiff | tree

Sanjay Patel [Tue, 6 Sep 2022 15:55:53 +0000 (11:55 -0400)]

[InstCombine] remove dead code for add (select cond, (sub), 0); NFC

This pattern is handled more generally in SimplifySelectsFeedingBinaryOp().
Tests to confirm that added to the add.ll test file in the previous commit.

commit | commitdiff | tree

Sanjay Patel [Tue, 6 Sep 2022 15:54:16 +0000 (11:54 -0400)]

[InstCombine] add/move tests for add with select operands that simplify; NFC

commit | commitdiff | tree

Sanjay Patel [Tue, 6 Sep 2022 14:19:18 +0000 (10:19 -0400)]

[InstCombine] add tests for add of select with 0 and negate arms; NFC

commit | commitdiff | tree

Doru Bercea [Thu, 1 Sep 2022 16:04:21 +0000 (16:04 +0000)]

Fix OpenMP Opt for target without a parallel region.

Remove ctx redeclaration.

Format code.

Remove parallel check. Modify tests. Clean-up code.

Fix another test.

Move code to helper functions.

Format file.

Minor fixes.

commit | commitdiff | tree

Stella Stamenova [Tue, 6 Sep 2022 15:57:28 +0000 (08:57 -0700)]

Revert "[lldb][bindings] Fix module_access handling of regex"

This reverts commit 75f05fccbbdd91393bdc7b6183b9dd2b1e859f8e.

This commit broke the windows lldb bot: https://lab.llvm.org/buildbot/#/builders/83/builds/23284

commit | commitdiff | tree

Benjamin Kramer [Tue, 6 Sep 2022 15:42:51 +0000 (17:42 +0200)]

[bazel] port 5dbc7cf7cac44

commit | commitdiff | tree

Nico Weber [Tue, 6 Sep 2022 15:39:02 +0000 (11:39 -0400)]

[gn build] port 5dbc7cf7cac44

commit | commitdiff | tree

Matthias Gehre [Tue, 6 Sep 2022 15:32:29 +0000 (16:32 +0100)]

Fix remaining test failures for "[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64"

commit | commitdiff | tree

Vitaly Buka [Sun, 4 Sep 2022 05:24:33 +0000 (22:24 -0700)]

commit | commitdiff | tree

Matthias Gehre [Tue, 6 Sep 2022 15:18:14 +0000 (16:18 +0100)]

Fix AMDGPU test failures due to "[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64"

commit | commitdiff | tree

Matthias Gehre [Tue, 6 Sep 2022 14:55:28 +0000 (15:55 +0100)]

Fix "[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64" compilation on Windows

commit | commitdiff | tree

David Green [Tue, 6 Sep 2022 15:04:28 +0000 (16:04 +0100)]

[AArch64] Additional tests for sinking splats to muls. NFC

commit | commitdiff | tree

Amara Emerson [Tue, 6 Sep 2022 14:17:40 +0000 (15:17 +0100)]

Add parantheses to silence warning.

commit | commitdiff | tree

Marco Elver [Tue, 6 Sep 2022 13:48:58 +0000 (15:48 +0200)]

[SelectionDAG] Properly copy ExtraInfo on RAUW

During SelectionDAG legalization SDNodes with associated extra info may
be replaced with a new SDNode. Preserve associated extra info on
ReplaceAllUsesWith and remove entries in DeallocateNode.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D130881

commit | commitdiff | tree

Marco Elver [Tue, 6 Sep 2022 13:48:52 +0000 (15:48 +0200)]

[SelectionDAG] Rename CallSiteDbgInfo to NodeExtraInfo

For information infrequently attached to SDNodes, it is useful to
provide a way to add this information out-of-line. This is already done
for call-site specific information.

Rename CallSiteDbgInfo to NodeExtraInfo in preparation of adding
additional information not necessarily related to call sites only.

Reviewed By: vitalybuka

Differential Revision: https://reviews.llvm.org/D130880

commit | commitdiff | tree

Matthias Gehre [Tue, 19 Jul 2022 10:28:54 +0000 (11:28 +0100)]

[llvm/CodeGen] Enable the ExpandLargeDivRem pass for X86, Arm and AArch64

This adds the ExpandLargeDivRem to the default pass pipeline.
The limit at which it expands div/rem instructions is configured
via a new TargetTransformInfo hook (default: no expansion)
X86, Arm and AArch64 backends implement this hook to expand div/rem
instructions with more than 128 bits.

Differential Revision: https://reviews.llvm.org/D130076

Domain: System / Toolchain;

RSS Atom