platform/upstream/llvm.git
19 months agoRemove the dependency between lib/DebugInfoDWARF and MC.
Shubham Sandeep Rastogi [Thu, 6 Oct 2022 19:15:07 +0000 (12:15 -0700)]
Remove the dependency between lib/DebugInfoDWARF and MC.

Differential Revision: https://reviews.llvm.org/D134817

19 months ago[mlir][sparse] Added new SparseTensorEncodingAttr::withoutOrdering factory
wren romano [Thu, 15 Dec 2022 22:32:58 +0000 (14:32 -0800)]
[mlir][sparse] Added new SparseTensorEncodingAttr::withoutOrdering factory

Reviewed By: aartbik, Peiming

Differential Revision: https://reviews.llvm.org/D140171

19 months ago[NFC][bazel] Update comment in llvm-config.h.cmake so headers match
Jordan Rupprecht [Fri, 16 Dec 2022 01:41:48 +0000 (17:41 -0800)]
[NFC][bazel] Update comment in llvm-config.h.cmake so headers match

19 months ago[flang] Check for polymorphism in DEALLOCATE statements in pure procedures
Peter Klausler [Wed, 7 Dec 2022 23:41:47 +0000 (15:41 -0800)]
[flang] Check for polymorphism in DEALLOCATE statements in pure procedures

Semantic checking for DEALLOCATE statements omitted checks for
polymorphic objects and ultimate allocatable components in a pure
procedure, which if not caught would allow execution of an impure
FINAL subroutine defined on a type extension.

Differential Revision: https://reviews.llvm.org/D140129

19 months ago[mlir][tosa] Make tosa.resize to linalg avoid redundant loads for unit width
Rob Suderman [Fri, 16 Dec 2022 00:07:31 +0000 (16:07 -0800)]
[mlir][tosa] Make tosa.resize to linalg avoid redundant loads for unit width

When using a tosa resize for ?x1x1x? to ?x1x?x? we should avoid doing a 2D
interpolation as only two unique values are loaded. As the extract operation
performance numerical computation on its values the superfluous extracts may
fail to be coalesced. Instead we only interpolate between the values if there
are multiple values to interpolate between.

For the integer case we also perform scaling by the scaling-factor to apply
the same integer scaling behavior as interpolation.

Reviewed By: jpienaar, NatashaKnk

Differential Revision: https://reviews.llvm.org/D139979

19 months agoImplement LWG-3646 std::ranges::view_interface::size returns a signed type
Igor Zhukov [Tue, 13 Dec 2022 00:07:26 +0000 (07:07 +0700)]
Implement LWG-3646 std::ranges::view_interface::size returns a signed type

Reviewed By: Mordante, philnik, #libc

Differential Revision: https://reviews.llvm.org/D139791

19 months ago[mlir][sparse] add dim level type toString convenience method
Aart Bik [Thu, 15 Dec 2022 21:50:20 +0000 (13:50 -0800)]
[mlir][sparse] add dim level type toString convenience method

Reviewed By: wrengr, bixia

Differential Revision: https://reviews.llvm.org/D140165

19 months ago[clang][deps] Implement `dump()` for the VFS
Jan Svoboda [Thu, 15 Dec 2022 23:03:07 +0000 (15:03 -0800)]
[clang][deps] Implement `dump()` for the VFS

19 months ago[mlir][sparse] add folder to sparse_tensor.storage.get operation.
Peiming Liu [Thu, 15 Dec 2022 23:04:17 +0000 (23:04 +0000)]
[mlir][sparse] add folder to sparse_tensor.storage.get operation.

Reviewed By: aartbik, wrengr

Differential Revision: https://reviews.llvm.org/D140172

19 months ago[SelectionDAG] Give all the target specific subclasses of SelectionDAGISel their...
Craig Topper [Thu, 15 Dec 2022 23:32:55 +0000 (15:32 -0800)]
[SelectionDAG] Give all the target specific subclasses of SelectionDAGISel their own pass ID.

Previously we had a shared ID in SelectionDAGISel. AMDGPU has an
initializePass function for its subclass of SelectionDAGISel. No
other target does.

This causes all target specific SelectionDAGISel passes to be known
as "amdgpu-isel".

I'm not sure what would happen if another target tried to implement
an initializePass function too since the ID is already claimed.

This patch gives all targets their own ID and passes it down to
SelectionDAGISel constructor to MachineFunctionPass's constructor.

Unfortunately, I think this causes most targets to lose
print-before/after-all support for their SelectionDAGISel pass.
And they probably no longer support start/stop-before/after. We
can add initializePass functions to fix this as a follow up. NOTE:
This was probably also broken if the AMDGPU target isn't compiled in.

Step 1 to fixing PR59538.

Reviewed By: arsenm

Differential Revision: https://reviews.llvm.org/D140161

19 months agoRevert "[lld-macho] Private label aliases to weak symbols should not retain section...
Jez Ng [Wed, 7 Dec 2022 16:17:51 +0000 (11:17 -0500)]
Revert "[lld-macho] Private label aliases to weak symbols should not retain section data"

This reverts commit 6736bce6db5fe15bcb765b976c99fff34500d1eb.

It's causing Swift-related crashes in e.g.
https://bugs.chromium.org/p/chromium/issues/detail?id=1400716 and
elsewhere.

19 months ago[AMDGPU] Lower VGPR to physical SGPR COPY to S_MOV_B32 if VGPR contains the compile...
Alexander Timofeev [Mon, 12 Dec 2022 20:18:19 +0000 (21:18 +0100)]
[AMDGPU] Lower VGPR to physical SGPR COPY to S_MOV_B32 if VGPR contains the compile time constant

Sometimes we have a constant value loaded to VGPR. In case we further
need to rematrerialize it in the physical scalar register we may avoid VGPR to
SGPR copy replacing it with S_MOV_B32.

Reviewed By: JonChesterfield, arsenm

Differential Revision: https://reviews.llvm.org/D139874

19 months agoReland "Delete sanitizer_common-based ('old') scudo: o7"
Mitch Phillips [Thu, 15 Dec 2022 23:34:18 +0000 (15:34 -0800)]
Reland "Delete sanitizer_common-based ('old') scudo: o7"

This reverts commit 78f786d02a464ae69594ee8f678e4cee495642f5.

Internal use cases have been migrated.

This has been on life support for a long time. Now that -fsanitize=scudo
is scudo_standalone, this can be removed.

Tests are sticking around for now to be reused for scudo_standalone
later on.

19 months ago[OpenMP][libomptarget] Add AMDGPU NextGen plugin with asynchronous behavior
Kevin Sala [Mon, 12 Dec 2022 17:13:48 +0000 (18:13 +0100)]
[OpenMP][libomptarget] Add AMDGPU NextGen plugin with asynchronous behavior

This commit adds the AMDGPU NextGen plugin inheriting from PluginInterface's classes.
It also implements the asynchronous behavior in the plugin operations: kernel launches
and memory transfers. To this end, it implements the concept of streams of asynchronous
operations. The streams are implemented using the HSA signals to define input and output
dependencies between asynchronous operations.

Missing features:
  - Retrieve the maximum number of threads per group that a kernel can run. This requires
    reading the image.
  - Implement __tgt_rtl_sync_event, not used on the libomptarget side.

Differential Revision: https://reviews.llvm.org/D138389

19 months ago[mlir][tensor][linalg] Add a pattern that generalizes tensor.pack op.
Hanhan Wang [Wed, 14 Dec 2022 22:36:37 +0000 (14:36 -0800)]
[mlir][tensor][linalg] Add a pattern that generalizes tensor.pack op.

The pattern generalizes a tensor::PackOp into a sequence of tensor +
Linalg ops, when the outer dims are all 1s. It uses the trick of
rank-reduced tensor.extract_slice to get the tile; transpose the tile;
use tensor.insert_slice to insert it to the destination of inner tile.

Reviewed By: pifon2a, tyb0807

Differential Revision: https://reviews.llvm.org/D140060

19 months ago[flang] When resolving a binding, allow for overriding
Peter Klausler [Thu, 15 Dec 2022 21:56:17 +0000 (13:56 -0800)]
[flang] When resolving a binding, allow for overriding

In type-bound generic resolution, when the actual argument
is monomorphic, resolve the call to the target of the
most recent (i.e., least deeply inherited) override of
the binding, if any.

Differential Revision: https://reviews.llvm.org/D140127

19 months ago[mlir] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 22:49:20 +0000 (14:49 -0800)]
[mlir] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months ago[MachOYAML] Improve section offset misalignment error message
Michael Buch [Thu, 15 Dec 2022 17:57:53 +0000 (17:57 +0000)]
[MachOYAML] Improve section offset misalignment error message

Before:
```
wrote too much data somewhere, section offsets don't line up
```

After:
```
wrote too much data somewhere, section offsets in section __debug_str for segment __DWARF don't line up: [cursor=0x581], [fileStart=0x0], [sectionOffset=0x579]
```

Differential Revision: https://reviews.llvm.org/D140116

19 months ago[lldb][Test] Propagate llvm::yaml error message in TestFile::fromYaml
Michael Buch [Thu, 15 Dec 2022 16:26:20 +0000 (16:26 +0000)]
[lldb][Test] Propagate llvm::yaml error message in TestFile::fromYaml

Currently the test-suite would swallow the error message
on `llvm::yaml::convertYAML` failures.

This patch simply propagates the error string up to the caller.

Before patch:
```
[ RUN      ] DWARFASTParserClangTests.TestDefaultTemplateParamParsing
/Users/michaelbuch/Git/llvm-worktrees/playground/lldb/unittests/TestingSupport/Symbol/YAMLModuleTester.cpp:19: Failure
Value of: llvm::detail::TakeExpected(File)
Expected: succeeded
  Actual: failed  (convertYAML() failed: )
Assertion failed: (!HasError && "Cannot get value when an error exists!"), function getStorage, file Error.h, line 671.
```

After patch:
```
[ RUN      ] DWARFASTParserClangTests.TestDefaultTemplateParamParsing
/Users/michaelbuch/Git/llvm-worktrees/playground/lldb/unittests/TestingSupport/Symbol/YAMLModuleTester.cpp:19: Failure
Value of: llvm::detail::TakeExpected(File)
Expected: succeeded
  Actual: failed  (convertYAML() failed: wrote too much data somewhere, section offsets don't line up)
Assertion failed: (!HasError && "Cannot get value when an error exists!"), function getStorage, file Error.h, line 671.
```

Differential Revision: https://reviews.llvm.org/D140112

19 months ago[llvm] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 22:40:14 +0000 (14:40 -0800)]
[llvm] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months ago[mlgo] Retire LLVM_HAVE_TF_API
Kazu Hirata [Thu, 15 Dec 2022 22:40:16 +0000 (14:40 -0800)]
[mlgo] Retire LLVM_HAVE_TF_API

I've eliminated all uses of LLVM_HAVE_TF_API except a couple of them
being removed in llvm/lib/CodeGen/CMakeLists.txt.  This patch removes
remaining definitions and uses of LLVM_HAVE_TF_API.

Differential Revision: https://reviews.llvm.org/D140169

19 months ago[LoopRotate] Clear block and loop dispo cache when merging blocks.
Florian Hahn [Thu, 15 Dec 2022 22:22:20 +0000 (22:22 +0000)]
[LoopRotate] Clear block and loop dispo cache when merging blocks.

Merging blocks in LoopRotate may remove blocks reference in the block
disposition cache. Clear the cache.

Fixes #59534.

19 months ago[MCJIT] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 22:08:36 +0000 (14:08 -0800)]
[MCJIT] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months ago[lld] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 21:57:50 +0000 (13:57 -0800)]
[lld] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months ago[mlgo] Use have_tflite instead of have_tf_api
Kazu Hirata [Thu, 15 Dec 2022 21:54:25 +0000 (13:54 -0800)]
[mlgo] Use have_tflite instead of have_tf_api

We are in the process of retiring LLVM_HAVE_TF_API in favor of
LLVM_HAVE_TFLITE.  This patch takes care of the transition in
llvm/test.

Differential Revision: https://reviews.llvm.org/D140133

19 months ago[flang] Correct folding of SAME_TYPE_AS()
Peter Klausler [Mon, 5 Dec 2022 23:26:28 +0000 (15:26 -0800)]
[flang] Correct folding of SAME_TYPE_AS()

The result can't be known to be true at compilation time when
either operand is polymorphic.

Differential Revision: https://reviews.llvm.org/D140126

19 months agoHeaders: tweak inclusion condition for stdatomic.h
Saleem Abdulrasool [Thu, 15 Dec 2022 20:34:04 +0000 (20:34 +0000)]
Headers: tweak inclusion condition for stdatomic.h

MSVC requires that C++23 be available (_HAS_CXX23) else the entire
content is elided. Conditionalise the inclusion properly so that C/C++
code using stdatomic.h for memory_order_* constants are able to do
so without changing the C++ standard. This repairs builds of Swift and
libdispatch after ba49d39b20cc5358da28af2ac82bd336028780bc.

Differential Revision: https://reviews.llvm.org/D139266
Reviewed By: aaron.ballman, Mordante, fsb4000

19 months ago[mailmap] Add my entry
Tommy Chiang [Thu, 15 Dec 2022 21:30:29 +0000 (05:30 +0800)]
[mailmap] Add my entry

19 months ago[PowerPC][GIsel] Materialize i64 constants.
Kai Nacke [Thu, 15 Dec 2022 20:26:32 +0000 (20:26 +0000)]
[PowerPC][GIsel] Materialize i64 constants.

Adds support for i64 constant. It uses the same pattern-based
approach as in SDAG (see PPCISelDAGToDAG::selectI64ImmDirect(),
PPCISelDAGToDAG::selectI64Imm()). It does not support the
prefixed instructions.

Reviewed By: arsenm, tschuett

Differential Revision: https://reviews.llvm.org/D140119

19 months ago[NFC] Cleanup: Remove instances of Function::getBasicBlockList()
Vasileios Porpodas [Thu, 15 Dec 2022 03:07:24 +0000 (19:07 -0800)]
[NFC] Cleanup: Remove instances of Function::getBasicBlockList()

This is part of a series of patches that aim at making Function::getBasicBlockList() private.

Differential Revision: https://reviews.llvm.org/D140121

19 months ago[libc] Handle allocation failures in the dirent API gracefully.
Siva Chandra Reddy [Wed, 14 Dec 2022 22:37:28 +0000 (22:37 +0000)]
[libc] Handle allocation failures in the dirent API gracefully.

Along the way, setting of errno has been moved out of the internal code.

Reviewed By: lntue

Differential Revision: https://reviews.llvm.org/D140078

19 months ago[libc] fix shifting exact multiples of 64 in uint
Michael Jones [Thu, 15 Dec 2022 00:51:23 +0000 (16:51 -0800)]
[libc] fix shifting exact multiples of 64 in uint

The internal uint class had a bug introduced recently when optimizing
the shift routines. When calculating the value of a block, it would
shift an adjacent block by 64 - the shift amount. If the shift amount
was 0, this would be a shift of 64, which is undefined for a 64 bit
integer. This patch fixes this by adding a conditional to catch this
case.

Reviewed By: lntue

Differential Revision: https://reviews.llvm.org/D140070

19 months ago[WindowsDriver] Improve VSInstallPath check for IDE subdirectory
Dimitry Andric [Thu, 15 Dec 2022 20:38:07 +0000 (21:38 +0100)]
[WindowsDriver] Improve VSInstallPath check for IDE subdirectory

This avoids potential memory allocation failures, if VSInstallPath is
not empty, but also does not contain the string "\Common7\IDE".

Fixes: https://github.com/llvm/llvm-project/issues/59434

Reviewed By: mstorsjo

Differential Revision: https://reviews.llvm.org/D140019

19 months agoRevert "[WindowsDriver] Improve VSInstallPath check for IDE subdirectory"
Dimitry Andric [Thu, 15 Dec 2022 20:36:32 +0000 (21:36 +0100)]
Revert "[WindowsDriver]  Improve VSInstallPath check for IDE subdirectory"

This reverts commit cc40dacbd0b736522d0254fb5525cde22bd5f166.

19 months ago[WindowsDriver] Improve VSInstallPath check for IDE subdirectory
Dimitry Andric [Sun, 11 Dec 2022 18:18:21 +0000 (19:18 +0100)]
[WindowsDriver]  Improve VSInstallPath check for IDE subdirectory

This avoids potential memory allocation failures, if VSInstallPath is
not empty, but also does not contain the string "\Common7\IDE".

Fixes: https://github.com/llvm/llvm-project/issues/59434

Reviewed By: mstorsjo

Differential Revision: https://reviews.llvm.org/D140019

19 months ago[NFC] Rename Instruction::insertAt() to Instruction::insertInto(), to be consistent...
Vasileios Porpodas [Thu, 15 Dec 2022 01:42:26 +0000 (17:42 -0800)]
[NFC] Rename Instruction::insertAt() to Instruction::insertInto(), to be consistent with BasicBlock::insertInto()

Differential Revision: https://reviews.llvm.org/D140085

19 months ago[clang] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 20:25:59 +0000 (12:25 -0800)]
[clang] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months agoRevert "[NFC][DebugInfo] Autogenerate check lines in assignment-tracking/sroa/*"
Roman Lebedev [Thu, 15 Dec 2022 20:20:37 +0000 (23:20 +0300)]
Revert "[NFC][DebugInfo] Autogenerate check lines in assignment-tracking/sroa/*"

As per post-commit feedback, DebugInfo owners are not receptive of the
idea of having a single source of truth instead of relying on everyone
to know how to update these tests manually.

This reverts commit 4ff8d1e315bdd26daae8a6928aa46e347fe0d6eb.
This reverts commit a33b40d61c64b95d736f1a0dd537e9087c330304.

19 months ago[mlir][sparse] performs a tab cleanup (NFC)
Aart Bik [Thu, 15 Dec 2022 19:38:55 +0000 (11:38 -0800)]
[mlir][sparse] performs a tab cleanup (NFC)

Reviewed By: Peiming

Differential Revision: https://reviews.llvm.org/D140142

19 months ago[AArch64] Minor changes and sanity checks in relation to https://reviews.llvm.org...
Nilanjana Basu [Thu, 15 Dec 2022 19:35:34 +0000 (01:05 +0530)]
[AArch64] Minor changes and sanity checks in relation to https://reviews.llvm.org/D135229

19 months ago[clang] Support for read-only types
MalavikaSamak [Thu, 15 Dec 2022 19:02:46 +0000 (11:02 -0800)]
[clang] Support for read-only types

The main goal of this work is to allow developers to express the need to place instances
of a class or structure in the read-only part of the program memory. Such a placement is
desirable to prevent any further modifications to the instances of a given structure, by
leveraging the read-only run time protection.

To achieve this, we are introducing a new attribute that can be attached to any record
definition or a declaration. The compiler enforces that every instance of this type can
be placed in the read-only segment of the program memory, provided the target triplet
supports such a placement. If an instance of a given type bearing this attribute doesn’t
satisfy such a placement, the compiler attaches an appropriate warning at suitable program
locations. In other words, adding this attribute to a type requires every instance of this
type to be a global const, which are placed in the read-only segments for most target
triplets. However, this is *not a language feature* and it *need not* be true for
*all target triplets*.

The current patch emits a warning at global variable declaration sites of types bearing
the attribute without const qualification and corresponding note attached to the type
definition/declaration.

Differential Revision: https://reviews.llvm.org/D135851

19 months ago[OpenMP][libomptarget] Add utility header for AMDGPU plugins
Kevin Sala [Sun, 11 Dec 2022 15:28:02 +0000 (16:28 +0100)]
[OpenMP][libomptarget] Add utility header for AMDGPU plugins

This patch prepares the PluginInterface for the new AMDGPU NextGen plugin. The original and the
NextGen plugin will share some structures and functionalities. We use this header for defining
them and avoiding code duplication.

Differential Revision: https://reviews.llvm.org/D139792

19 months ago[libc][NFC] Rename "loader" to "startup".
Siva Chandra Reddy [Wed, 14 Dec 2022 20:38:45 +0000 (20:38 +0000)]
[libc][NFC] Rename "loader" to "startup".

Reviewed By: lntue

Differential Revision: https://reviews.llvm.org/D140049

19 months ago[mlir][sparse] remove unintended file and trailing white space
Peiming Liu [Thu, 15 Dec 2022 19:32:25 +0000 (19:32 +0000)]
[mlir][sparse] remove unintended file and trailing white space

Reviewed By: aartbik

Differential Revision: https://reviews.llvm.org/D140139

19 months ago[mlir][VectorToGPU] Support transposed+broadcasted 2D MMA load
Lei Zhang [Thu, 15 Dec 2022 19:34:14 +0000 (19:34 +0000)]
[mlir][VectorToGPU] Support transposed+broadcasted 2D MMA load

This is loading from 2-D memref, in addition to D139655 where we
load from 1-D memref cases.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D140136

19 months agoFix: use "using namespace" in a header file.
Emmmer [Thu, 15 Dec 2022 19:30:55 +0000 (03:30 +0800)]
Fix: use "using namespace" in a header file.

19 months ago[mlir][VectorToGPU] Support transposed+broadcasted 1D MMA load
Lei Zhang [Thu, 15 Dec 2022 19:16:01 +0000 (19:16 +0000)]
[mlir][VectorToGPU] Support transposed+broadcasted 1D MMA load

This is now possible with transpose semantics on subgroup MMA
load ops.

Reviewed By: ThomasRaoux

Differential Revision: https://reviews.llvm.org/D139655

19 months ago[AutoFDO] Use getHeadSamplesEstimate instead of getTotalSamples to compute profile...
wlei [Wed, 14 Dec 2022 22:47:37 +0000 (14:47 -0800)]
[AutoFDO] Use getHeadSamplesEstimate instead of getTotalSamples to compute profile callsite staleness

Fix two issues for profile staleness report.

1) It should be more accurate to use the sum of all entry count(`getHeadSamplesEstimate`) for the callsite samples than the total samples, since even the top-level callsite is mismatched, it does affect the inlining but it can still be merged into base profile and used later.

2) I accidentally missed to persist the num of mismatched callsite into binary.

Also added the asm testing to test the decoding of the section.

Reviewed By: hoy, wenlei

Differential Revision: https://reviews.llvm.org/D140063

19 months agoRevert "Revert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with diffe...
Kevin Athey [Thu, 15 Dec 2022 19:19:24 +0000 (11:19 -0800)]
Revert "Revert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different lengths""

This reverts commit 192cc76e0be688106492989cd845ba786a7ae36d.

Reverted Revert, as build was fixed while I was examining.

19 months agoRevert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different...
Kevin Athey [Thu, 15 Dec 2022 17:21:37 +0000 (09:21 -0800)]
Revert "[AArch64][GlobalISel][Legalizer] Legalize G_SHUFFLE_VECTOR with different lengths"

This reverts commit 4c52fb1a5ee20846627d16e38f5dec08c08f8884.

Breaks sanitizer ubsan buildbot:
https://lab.llvm.org/buildbot/#/builders/85/builds/12983

19 months ago[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API
Kazu Hirata [Thu, 15 Dec 2022 19:11:24 +0000 (11:11 -0800)]
[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API

This patch replaces uses of LLVM_HAVE_TF_API with LLVM_HAVE_TFLITE in
a couple of CMakeLists.txt.

Now that 842b0d0fe2dd142305a9461e50cdce9aff7f86bc has landed,
we now have:

  LLVM_HAVE_TF_API is defined if and only if LLVM_HAVE_TFLITE
  evaluates to true

in the CMake variable world (assuming that you do not set
LLVM_HAVE_TF_API on the cmake invocation).

FWIW, the story is a little different in the C++ macro world, where:

  LLVM_HAVE_TF_API is defined if and only if LLVM_HAVE_TFLITE is
  defined

This is why edc83a15b45e6b91fce3f35622a6b0a6d34e5211 consisted only of
mechanical replacements.

Differential Revision: https://reviews.llvm.org/D140061

19 months ago[RISCV][InsertVSETVL] Incorporate demanded fields into compatibility interface [nfc]
Philip Reames [Thu, 15 Dec 2022 19:09:49 +0000 (11:09 -0800)]
[RISCV][InsertVSETVL] Incorporate demanded fields into compatibility interface [nfc]

This reworks the API to explicitly pass in the demanded fields instead of requering them internally.  At the moment, this is NFC, but it will stop being so in future changes which adjust the demanded bits in the caller.

19 months agoAPFloat: Add isSmallestNormalized predicate function
Matt Arsenault [Thu, 1 Dec 2022 14:12:20 +0000 (09:12 -0500)]
APFloat: Add isSmallestNormalized predicate function

It was annoying to write the check for this in the one case I added,
and I'm planning on adding another, so add a convenient PatternMatch
like for other special case values.

I have no idea what is going on in the DoubleAPFloat case, I reversed
this from the makeSmallestNormalized test. Also could implement this
as *this == getSmallestNormalized() for less code, but this avoids the
construction of a temporary APFloat copy and follows the style of the
other functions.

19 months ago[RISCV] Teach RISCVSExtWRemoval to remove sext.w whose upper bits aren't demanded.
Craig Topper [Thu, 15 Dec 2022 19:01:20 +0000 (11:01 -0800)]
[RISCV] Teach RISCVSExtWRemoval to remove sext.w whose upper bits aren't demanded.

SelectionDAG aggressively creates sext_inreg operations after
promoting an i32 add. If the add is later matched to a sh1add,
sh2add or sh3add, a sext.w from the sext_inreg will get left behind.
In many cases we can prove this sext.w is unnecessary by checking
if its upper bits are ever used.

19 months ago[APInt] Fix a bug in the unit testing introduced in 55968109d
Peter Rong [Thu, 15 Dec 2022 18:51:33 +0000 (10:51 -0800)]
[APInt] Fix a bug in the unit testing introduced in 55968109d

Signed-off-by: Peter Rong <PeterRong96@gmail.com>
19 months ago[flang] Semantic-check for procedure pointers with assumed character length
Kelvin Li [Thu, 15 Dec 2022 16:59:08 +0000 (11:59 -0500)]
[flang] Semantic-check for procedure pointers with assumed character length

Fixes: https://github.com/llvm/llvm-project/issues/59496

Committed on behalf of tislam

Differential Revision: https://reviews.llvm.org/D139333

19 months ago[NFC][DebugInfo] assignment-tracking/sroa: do check !DIAssignID, and the metadata
Roman Lebedev [Thu, 15 Dec 2022 18:20:17 +0000 (21:20 +0300)]
[NFC][DebugInfo] assignment-tracking/sroa: do check !DIAssignID, and the metadata

Based on post-commit feedback for the original change.

19 months ago[UpdateTestChecks] Handle !DIAssignID metadata
Roman Lebedev [Thu, 15 Dec 2022 18:19:07 +0000 (21:19 +0300)]
[UpdateTestChecks] Handle !DIAssignID metadata

19 months ago[NFC][UpdateTestChecks] Add coverage for missing !DIAssignID handling
Roman Lebedev [Thu, 15 Dec 2022 18:05:47 +0000 (21:05 +0300)]
[NFC][UpdateTestChecks] Add coverage for missing !DIAssignID handling

19 months agoFix buildbot out of memory
Emmmer [Sun, 30 Oct 2022 17:35:16 +0000 (01:35 +0800)]
Fix buildbot out of memory

https://lab.llvm.org/buildbot#builders/17/builds/31659

19 months ago[AMDGPU] Test fixup
Christudasan Devadasan [Thu, 15 Dec 2022 17:46:53 +0000 (23:16 +0530)]
[AMDGPU] Test fixup

Changing cast_lds_gv into a kernel function to
lower the LDS usage appropriately. The LDS lowering
is currently won't happen for orphan device functions.

19 months ago[APInt] provide a safe API for zext value and sext value.
Peter Rong [Fri, 9 Dec 2022 00:59:22 +0000 (16:59 -0800)]
[APInt] provide a safe API for zext value and sext value.

Currently, APInt::getSExtValue and getZExtValue crashes on values with more than 64 bits.
Users may accidently crash the compiler with this setting when the integer may be i128.
As shown in https://github.com/llvm/llvm-project/issues/59316

In this patch we provide a trySExtValue and tryZExtValue to return an Optional, the user
needs to explictly unwrap it and condsier the possibility where there my no value in it.

Reviewed By: RKSimon

Differential Revision: https://reviews.llvm.org/D139683

19 months ago[NFC] Rename Function::insertBasicBlockAt() to Function::insert().
Vasileios Porpodas [Wed, 14 Dec 2022 23:57:08 +0000 (15:57 -0800)]
[NFC] Rename Function::insertBasicBlockAt() to Function::insert().

I think this is a better name because it is what STL uses.

Differential Revision: https://reviews.llvm.org/D140068

19 months ago[clang-tools-extra] Convert tests to check 'target=...'
Paul Robinson [Thu, 15 Dec 2022 17:50:21 +0000 (09:50 -0800)]
[clang-tools-extra] Convert tests to check 'target=...'

Part of the project to eliminate special handling for triples in lit
expressions.

19 months ago[libc] Templatize ato* tests
Alex Brachet [Thu, 15 Dec 2022 17:30:00 +0000 (17:30 +0000)]
[libc] Templatize ato* tests

This catches missing corner cases from atol tests.

Differential Revision: https://reviews.llvm.org/D140076

19 months ago[ADT] Remove llvm::Optional::transform
Kazu Hirata [Thu, 15 Dec 2022 17:24:54 +0000 (09:24 -0800)]
[ADT] Remove llvm::Optional::transform

I've removed the last uses of transform on Dec 14, 2022 in commit
230df792e17519071a9ef4dc0fb10132540dfbb8.

Differential Revision: https://reviews.llvm.org/D140082

19 months agoApply clang-tidy fixes for llvm-qualified-auto in ElementwiseOpFusion.cpp (NFC)
Mehdi Amini [Sat, 10 Dec 2022 10:57:20 +0000 (10:57 +0000)]
Apply clang-tidy fixes for llvm-qualified-auto in ElementwiseOpFusion.cpp (NFC)

19 months agoApply clang-tidy fixes for bugprone-argument-comment in BufferizableOpInterfaceImpl...
Mehdi Amini [Sat, 10 Dec 2022 10:50:43 +0000 (10:50 +0000)]
Apply clang-tidy fixes for bugprone-argument-comment in BufferizableOpInterfaceImpl.cpp (NFC)

19 months ago[lldb] Fix a warning
Kazu Hirata [Thu, 15 Dec 2022 17:18:55 +0000 (09:18 -0800)]
[lldb] Fix a warning

This patch fixes:

  lldb/source/Plugins/Instruction/RISCV/EmulateInstructionRISCV.cpp:1402:18:
  error: default label in switch which covers all enumeration values
  [-Werror,-Wcovered-switch-default]

19 months ago[InstCombine] add nuw to any (1<<x)
Sanjay Patel [Thu, 15 Dec 2022 16:41:22 +0000 (11:41 -0500)]
[InstCombine] add nuw to any (1<<x)

https://alive2.llvm.org/ce/z/9EjDKE

This was mentioned as a missing fold in D139598.

It can unlock follow-on folds in some cases.
This verifies one of the changed tests:
https://alive2.llvm.org/ce/z/B_btDM

19 months ago[mlir][bufferize][NFC] Add statistics to OneShotBufferizePass
Matthias Springer [Thu, 15 Dec 2022 16:55:39 +0000 (17:55 +0100)]
[mlir][bufferize][NFC] Add statistics to OneShotBufferizePass

Print statistics about the number of alloc/deallocs and in-place/out-of-place bufferization.

Differential Revision: https://reviews.llvm.org/D139538

19 months ago[LLDB][RISCV] Add RVD instruction support for EmulateInstructionRISCV
Emmmer [Sat, 10 Dec 2022 08:59:13 +0000 (16:59 +0800)]
[LLDB][RISCV] Add RVD instruction support for EmulateInstructionRISCV

RVD extension is a double-precision floating-point instruction-set extension, which adds double-precision floating-point computational instructions compliant with the IEEE 754-2008 arithmetic standard.

This patch:
- Reuse most of the functions in the "F extension" to impl the"D extension"
- corresponding unittests.

Reviewed By: DavidSpickett

Differential Revision: https://reviews.llvm.org/D140032

19 months agoRevert "[SelectionDAG] Do not second-guess alignment for alloca"
Ron Lieberman [Thu, 15 Dec 2022 16:54:41 +0000 (10:54 -0600)]
Revert "[SelectionDAG] Do not second-guess alignment for alloca"

Breaks amdgpu buildbot https://lab.llvm.org/buildbot/#/builders/193
 23491

This reverts commit ffedf47d8b793e07317f82f9c2a5f5425ebb71ad.

19 months ago[mlir][memref] Fold subview(subview(x))
Matthias Springer [Thu, 15 Dec 2022 16:49:46 +0000 (17:49 +0100)]
[mlir][memref] Fold subview(subview(x))

Folding of rank-reduced subviews is also supported.

Differential Revision: https://reviews.llvm.org/D140110

19 months ago[mlir][cf] Print message in cf.assert to LLVM lowering
Matthias Springer [Thu, 15 Dec 2022 16:37:49 +0000 (17:37 +0100)]
[mlir][cf] Print message in cf.assert to LLVM lowering

The assert message was previously ignored. The lowered IR now calls `puts` it in case of a failed assertion.

Differential Revision: https://reviews.llvm.org/D138647

19 months ago[X86] combineBitcastvxi1 - handle boolmask sign-extension through vselect
Simon Pilgrim [Thu, 15 Dec 2022 16:40:39 +0000 (16:40 +0000)]
[X86] combineBitcastvxi1 - handle boolmask sign-extension through vselect

See if we can freely sign-extend both sources of a vselect operand, also handle allones constant build vectors (easily rematerializable and uses in the test case).

Fixes #59526

19 months ago[LoopVersioningLICM] Regenerate test checks (NFC)
Nikita Popov [Thu, 15 Dec 2022 16:33:13 +0000 (17:33 +0100)]
[LoopVersioningLICM] Regenerate test checks (NFC)

19 months ago[RISCV] Bugfix for 90f91683 noticed in follow up work
Philip Reames [Thu, 15 Dec 2022 16:26:25 +0000 (08:26 -0800)]
[RISCV] Bugfix for 90f91683 noticed in follow up work

I went to extend this locally, and then promptly tripped across a bug which is possible with the landed patch.  The problematic case is:
vsetvli zero, 4, <some vtype>
vmv.x.s x1, v0
vsetvli a0, zero, <same type>

In this case, the naive rewrite - what I had implemented - would form:
vsetvli zero, zero, <same vtype>
vmv.x.s x1, v0

This is, amusingly, correct for the vmv.x.s, but is incorrect for the instructions which follow the sequence and probably rely on VL=VLMAX.  (The VL before the sequence is unknown, and thus doesn't have to be VLMAX.)

I plan to rework the rewrite code to be more robust here, but I wanted to directly fix the bug first.  Sorry for the lack of test; I didn't manage to reproduce this without an additional optimization change after a few minutes of trying.

19 months ago[UpdateTestChecks][PowerPC] Add ppc64 triple support
Kai Nacke [Thu, 15 Dec 2022 15:44:38 +0000 (15:44 +0000)]
[UpdateTestChecks][PowerPC] Add ppc64 triple support

Newer tests use ppc64le-linux triple. It is identical to ppc32.
Some of the PPC GIsel tests were generated by this change.

Reviewed by: RKSimon

Differential Revision: https://reviews.llvm.org/D140107

19 months ago[X86] Add test case for Issue #59526
Simon Pilgrim [Thu, 15 Dec 2022 16:19:35 +0000 (16:19 +0000)]
[X86] Add test case for Issue #59526

19 months ago[ProgressMeter] Remove unnecessary semicolon (NFC)
Nikita Popov [Thu, 15 Dec 2022 16:04:37 +0000 (17:04 +0100)]
[ProgressMeter] Remove unnecessary semicolon (NFC)

Fix compiler warning.

19 months ago[clang][PPC] Supporting -mcpu=405
Qiongsi Wu [Thu, 15 Dec 2022 15:03:05 +0000 (10:03 -0500)]
[clang][PPC] Supporting -mcpu=405

The  ClangBuiltLinux  project relies on `-mcpu=405`. Before https://reviews.llvm.org/D139720, `clang` treated `-mcpu=405` implicitly in the same way as `-mcpu=generic`, because `405` was an unknown value and `clang` did not validate unknown input values. https://reviews.llvm.org/D139720 added the validation of `-mcpu` input value, and `clang` now generates an error with `-mcpu=405`. For further details of the problem, see https://github.com/ClangBuiltLinux/linux/issues/1771.

This patch adds support of `-mcpu=405` explicitly, and treats it as an equivalent of `-mcpu=generic`.

Reviewed By: nemanjai

Differential Revision: https://reviews.llvm.org/D140080

19 months ago[NFC][DebugInfo] Autogenerate check lines in assignment-tracking/sroa/*
Roman Lebedev [Thu, 15 Dec 2022 15:43:36 +0000 (18:43 +0300)]
[NFC][DebugInfo] Autogenerate check lines in assignment-tracking/sroa/*

Manual checklines are generally not welcomed in LLVM tests.
They are really brittle, and really hard to update.
*Please* use scripts!

19 months ago[GVN] reportMayClobberedLoad - avoid repeated cast<> calls. NFCI.
Simon Pilgrim [Thu, 15 Dec 2022 15:44:27 +0000 (15:44 +0000)]
[GVN] reportMayClobberedLoad - avoid repeated cast<> calls. NFCI.

Just perform each cast<Instruction> once - we can make OtherAccess a Instruction* type as we only ever assign it from a known LoadInst/StoreInst

19 months ago[clang-tidy] Add configuration option to bugprone-unchecked-optional-access check.
Yitzhak Mandelbaum [Wed, 14 Dec 2022 14:05:38 +0000 (14:05 +0000)]
[clang-tidy] Add configuration option to bugprone-unchecked-optional-access check.

The underlying model already supports ignoring accesses to optionals through
smart pointers. This patch exposes that option through ClangTidy's configuration
options.

Differential Revision: https://reviews.llvm.org/D140021

19 months ago[MRI] Print more debug infor in clearVirtRegs() (NFC)
Nikita Popov [Thu, 15 Dec 2022 15:42:23 +0000 (16:42 +0100)]
[MRI] Print more debug infor in clearVirtRegs() (NFC)

19 months ago[clang][dataflow] Improve optional model's support for ignoring smart pointers.
Yitzhak Mandelbaum [Wed, 14 Dec 2022 13:54:38 +0000 (13:54 +0000)]
[clang][dataflow] Improve optional model's support for ignoring smart pointers.

The optional model has an option to ignore optionals accessed through smart
pointer types (other than optional itself). This patch improves this feature in
two ways:

1. We extend support to optionals accessed directly through the smart pointer,
like `ptr->value()`. Previously, support was limited to cases that went through
an intermediate field.

2. We clean up the implementation by removing the option from the analysis,
leaving it only in the diagnostic phase (where it is relevant).

3. Adjusts a test which was misleading in what it was testing.

Differential Revision: https://reviews.llvm.org/D140020

19 months agoFix failing test in presence of exceptions D140098
Guillaume Chatelet [Thu, 15 Dec 2022 15:32:21 +0000 (15:32 +0000)]
Fix failing test in presence of exceptions D140098

There is no point in testing this behavior since this is std::optional's semantic.

19 months ago[RISCV][InsertVSETVLI] Mutate prior vsetvli AVL if doing so allows us to remove a...
Philip Reames [Wed, 14 Dec 2022 20:27:53 +0000 (12:27 -0800)]
[RISCV][InsertVSETVLI] Mutate prior vsetvli AVL if doing so allows us to remove a toggle

This extends the backwards walk to allow mutating the previous vsetvl's AVL value if it was not used by any instructions in between. In practice, this mostly benefits vmv.x.s and fvmv.f.s patterns since vector instructions which ignore VL are rare.

Differential Revision: https://reviews.llvm.org/D140048

19 months ago[AArch64] Extending lowering of 'trunc <(8|16) x i64> %x to <(8|16) x i8>' to use...
Nilanjana Basu [Tue, 22 Nov 2022 20:04:26 +0000 (12:04 -0800)]
[AArch64] Extending lowering of 'trunc <(8|16) x i64> %x to <(8|16) x i8>' to use tbl instructions

[AArch64] Patch for lowering trunc instructions to 'tbl' for (8|16)xi32 -> (8|16)xi8 conversions in https://reviews.llvm.org/D133495 is extended to support trunc to tbl lowering for (8|16) x i64 to (8|16) x i8.

A microbenchmark for runtime for these transformations is added in https://reviews.llvm.org/D136274

Reviewed by: fhahn, t.p.northover

Differential Revision: https://reviews.llvm.org/D135229

19 months ago[AArch64] Extra unit tests for trunc lowering of vectors
Nilanjana Basu [Wed, 23 Nov 2022 00:16:44 +0000 (16:16 -0800)]
[AArch64] Extra unit tests for trunc lowering of vectors

These tests show code generation for vectorized trunc lowering from i16 to i8 in AArch64.

Reviewed By: fhahn

Differential Revision: https://reviews.llvm.org/D137293

19 months ago[Hexagon] Record original result type in wide multiplication
Krzysztof Parzyszek [Thu, 15 Dec 2022 00:23:45 +0000 (16:23 -0800)]
[Hexagon] Record original result type in wide multiplication

19 months ago[SelectionDAG] Do not second-guess alignment for alloca
Andrew Savonichev [Thu, 15 Dec 2022 15:16:45 +0000 (18:16 +0300)]
[SelectionDAG] Do not second-guess alignment for alloca

Alignment of an alloca in IR can be lower than the preferred alignment
on purpose, but this override essentially treats the preferred
alignment as the minimum alignment.

The patch changes this behavior to always use the specified
alignment. If alignment is not set explicitly in LLVM IR, it is set to
DL.getPrefTypeAlign(Ty) in computeAllocaDefaultAlign.

Tests are changed as well: explicit alignment is increased to match
the preferred alignment if it changes output, or omitted when it is
hard to determine the right value (e.g. for pointers, some structs, or
weird types).

Differential Revision: https://reviews.llvm.org/D135462

19 months ago[NFC] Use std::optional over llvm::Optional to implement MaybeAlign
Guillaume Chatelet [Thu, 15 Dec 2022 13:52:00 +0000 (13:52 +0000)]
[NFC] Use std::optional over llvm::Optional to implement MaybeAlign

Differential Revision: https://reviews.llvm.org/D140098

19 months agoRevert "[clang][CodeGen] Use base subobject type layout for potentially-overlapping...
Vladislav Dzhidzhoev [Thu, 15 Dec 2022 15:09:06 +0000 (18:09 +0300)]
Revert "[clang][CodeGen] Use base subobject type layout for potentially-overlapping fields"

This reverts commit 731abdfdcc33d813e6c3b4b89eff307aa5c18083.

This commit breaks some tests in libcxx, e.g.
`std/utilities/expected/expected.expected/ctor/ctor.inplace.pass.cpp`

19 months ago[CodeComplete] Complete members of dependent `auto` variables
Sam McCall [Wed, 14 Dec 2022 19:53:18 +0000 (20:53 +0100)]
[CodeComplete] Complete members of dependent `auto` variables

When the initializer of an `auto` variable is dependent, clang doesn't give the
DeclRefExpr a useful dependent type that we can apply heuristics to.
However we can dig one up by looking at the initializer.

Differential Revision: https://reviews.llvm.org/D140044

19 months ago[PowerPC] Simplify PPCSubtarget
Kai Nacke [Wed, 14 Dec 2022 15:55:53 +0000 (15:55 +0000)]
[PowerPC] Simplify PPCSubtarget

The flags, initialization of the flags, and the getter methods for
features defined in PPC.td can be generated by TableGen.

Reviewed By: shchenz

Differential Revision: https://reviews.llvm.org/D140028

19 months ago[PPC] Convert tests to check 'target=<triple>'
Paul Robinson [Tue, 29 Nov 2022 22:41:36 +0000 (14:41 -0800)]
[PPC] Convert tests to check 'target=<triple>'

Two tests checked 'ppc64be' which appears not to exist; the tests
pass on clang-ppc64be-linux-multistage so I assume they are fine
and just removed those UNSUPPORTED lines. All others were converted
to the new target= format, and get the same results on ppc bots as
before.

Part of the project to eliminate special handling for triples in lit
expressions.

Differential Revision: https://reviews.llvm.org/D138954

19 months agoReland "[TargetLowering] Teach DemandedBits about VSCALE"
Benjamin Maxwell [Mon, 12 Dec 2022 16:26:20 +0000 (16:26 +0000)]
Reland "[TargetLowering] Teach DemandedBits about VSCALE"

Reland with a fixup to avoid converting APInts to int64_t which allowed for
overflows (UB) with sufficiently high/low multiplier values.

This allows DemandedBits to see the result of VSCALE will be at most
VScaleMax * some compile-time constant. This relies on the vscale_range()
attribute being present on the function, with a max set. (This is done by
default when clang is targeting AArch64+SVE).

Using this various redundant operations (zexts, sexts, ands, ors, etc)
can be eliminated.

Differential Revision: https://reviews.llvm.org/D138508

19 months ago[MachineCombiner][RISCV] Support inverse instructions reassociation
Anton Sidorenko [Thu, 24 Nov 2022 12:30:01 +0000 (15:30 +0300)]
[MachineCombiner][RISCV] Support inverse instructions reassociation

This patch adds reassociation of FADD/FSUB instruction pairs.

Differential Revision: https://reviews.llvm.org/D138660