review.tizen.org Git - platform/upstream/llvm.git/log

projects / platform / upstream / llvm.git / log

commit | commitdiff | tree

Michael Kuperstein [Thu, 19 Jan 2017 23:22:55 +0000 (23:22 +0000)]

Revert r292530 since it breaks buildbots.

llvm-svn: 292534

commit | commitdiff | tree

Dehao Chen [Thu, 19 Jan 2017 23:20:31 +0000 (23:20 +0000)]

clang-format SampleProfile.cpp (NFC)

llvm-svn: 292533

commit | commitdiff | tree

Peter Collingbourne [Thu, 19 Jan 2017 23:10:14 +0000 (23:10 +0000)]

LTO: Flush the resolution file after writing to it.

Without this the file could be truncated if the linker crashes.

llvm-svn: 292532

commit | commitdiff | tree

Davide Italiano [Thu, 19 Jan 2017 23:07:51 +0000 (23:07 +0000)]

[SCCP] Teach the pass how to handle `div` with overdefined operands.

This can prove that:

extern int f;
int g() {
    int x = 0;
    for (int i = 0; i < 365; ++i) {
        x /= f;
    }
    return x;
}

always returns zero. Thanks to Sanjoy for confirming this
transformation actually made sense (bugs are mine).

llvm-svn: 292531

commit | commitdiff | tree

Michael Kuperstein [Thu, 19 Jan 2017 22:55:46 +0000 (22:55 +0000)]

[PM] Make default pipeline test for the new PM strict

Use CHECK-NEXT to verify that a test breaks whenever unexpected passes,
analyses, or invalidations show up in default pipelines. The test case
is constructed so that we don't expect to invalidate anything, and needs
to be kept that way.

(Right now it does show some invalidations - all of those are intentional
and temporary.)

Differential Revision: https://reviews.llvm.org/D28887

llvm-svn: 292530

commit | commitdiff | tree

Haicheng Wu [Thu, 19 Jan 2017 22:51:03 +0000 (22:51 +0000)]

Revert "[InlineCost] Use TTI to check if GEP is free."

This reverts commit r292526. The test case has problem.

llvm-svn: 292529

commit | commitdiff | tree

Simon Pilgrim [Thu, 19 Jan 2017 22:41:22 +0000 (22:41 +0000)]

[SelectionDAG] Improve knownbits handling of UMIN/UMAX (PR31293)

This patch improves the knownbits logic for unsigned integer min/max opcodes.

For UMIN we know that the result will have the maximum of the inputs' known leading zero bits in the result, similarly for UMAX the maximum of the inputs' leading one bits.

This is particularly useful for simplifying clamping patterns,. e.g. as SSE doesn't have a uitofp instruction we want to use sitofp instead where possible and for that we need to confirm that the top bit is not set.

Differential Revision: https://reviews.llvm.org/D28853

llvm-svn: 292528

commit | commitdiff | tree

Haicheng Wu [Thu, 19 Jan 2017 22:28:34 +0000 (22:28 +0000)]

[InlineCost] Use TTI to check if GEP is free.

Currently, a GEP is considered free only if its indices are all constant.
TTI::getGEPCost() can give target-specific more accurate analysis. TTI is
already used for the cost of many other instructions.

Differential Revision: https://reviews.llvm.org/D28693

llvm-svn: 292526

commit | commitdiff | tree

Alex Shlyapnikov [Thu, 19 Jan 2017 22:15:54 +0000 (22:15 +0000)]

Whenever reasonable, merge ASAN quarantine batches to save memory.

Summary:
There are cases when thread local quarantine drains almost empty
quarantine batches into the global quarantine. The current approach leaves
them almost empty, which might create a huge memory overhead (each batch
is 4K/8K, depends on bitness).

Reviewers: eugenis

Subscribers: kubabrecka, llvm-commits

Differential Revision: https://reviews.llvm.org/D28068

llvm-svn: 292525

commit | commitdiff | tree

Hans Wennborg [Thu, 19 Jan 2017 21:33:13 +0000 (21:33 +0000)]

Don't inline dllimport functions referencing non-imported methods

This is another follow-up to r246338. I had assumed methods were already
handled by the AST visitor, but turns out they weren't.

llvm-svn: 292522

commit | commitdiff | tree

Stanislav Mekhanoshin [Thu, 19 Jan 2017 21:26:22 +0000 (21:26 +0000)]

[AMDGPU] Add exec copy to LiveIntervals in SILowerControlFlow::emitElse

This instruction is missing from LiveIntervals.
I'm not aware of any problems because of this though.

Differential Revision: https://reviews.llvm.org/D28879

llvm-svn: 292521

commit | commitdiff | tree

Kostya Serebryany [Thu, 19 Jan 2017 21:14:47 +0000 (21:14 +0000)]

[libFuzzer] ensure that entries in PersistentAutoDictionary are not empty

llvm-svn: 292520

commit | commitdiff | tree

Davide Italiano [Thu, 19 Jan 2017 21:07:42 +0000 (21:07 +0000)]

[SCCP] Update comment in visitBinaryOp() after recent changes.

llvm-svn: 292519

commit | commitdiff | tree

Richard Smith [Thu, 19 Jan 2017 21:00:13 +0000 (21:00 +0000)]

PR13403 (+duplicates): implement C++ DR1310 (wg21.link/cwg1310).

Under this defect resolution, the injected-class-name of a class or class
template cannot be used except in very limited circumstances (when declaring a
constructor, in a nested-name-specifier, in a base-specifier, or in an
elaborated-type-specifier). This is apparently done to make parsing easier, but
it's a pain for us since we don't know whether a template-id using the
injected-class-name is valid at the point when we annotate it (we don't yet
know whether the template-id will become part of an elaborated-type-specifier).

As a tentative resolution to a perceived language defect, mem-initializer-ids
are added to the list of exceptions here (they generally follow the same rules
as base-specifiers).

When the reference to the injected-class-name uses the 'typename' or 'template'
keywords, we permit it to be used to name a type or template as an extension;
other compilers also accept some cases in this area. There are also a couple of
corner cases with dependent template names that we do not yet diagnose, but
which will also get this treatment.

llvm-svn: 292518

commit | commitdiff | tree

Serge Rogatch [Thu, 19 Jan 2017 20:27:11 +0000 (20:27 +0000)]

[XRay][Arm] Enable back XRay testing on Arm32 and fix the failing tests

Summary:
Testing of XRay was occasionally disabled on 32-bit Arm targets (someone assumed that XRay was supported on 64-bit targets only). This patch should fix that problem. Also here the instruction&data cache incoherency problem is fixed, because it may be causing a test to fail.
This patch is one of a series: see also
- https://reviews.llvm.org/D28624

Reviewers: dberris, rengolin

Reviewed By: rengolin

Subscribers: llvm-commits, aemerson, rengolin, dberris, iid_iunknown

Differential Revision: https://reviews.llvm.org/D28623

llvm-svn: 292517

commit | commitdiff | tree

Serge Rogatch [Thu, 19 Jan 2017 20:24:23 +0000 (20:24 +0000)]

[XRay][Arm] Repair XRay table emission on Arm32 and add tests to identify such problem earlier

Summary:
Emission of XRay table was occasionally disabled for Arm32, but this bug was not then detected because earlier (also by mistake) testing of XRay was occasionally disabled on 32-bit Arm targets. This patch should fix that problem and detect such problems in the future.
This patch is one of a series, see also
- https://reviews.llvm.org/D28623

Reviewers: rengolin, dberris

Reviewed By: dberris

Subscribers: llvm-commits, aemerson, rengolin, dberris, iid_iunknown

Differential Revision: https://reviews.llvm.org/D28624

llvm-svn: 292516

commit | commitdiff | tree

Chad Rosier [Thu, 19 Jan 2017 20:06:32 +0000 (20:06 +0000)]

[Assembler] Improve error when unable to evaluate expression.

Add a SMLoc to MCExpr. Most code does not generate or consume the SMLoc (yet).

Patch by Sanne Wouda <sanne.wouda@arm.com>!
Differential Revision: https://reviews.llvm.org/D28861

llvm-svn: 292515

commit | commitdiff | tree

Evgeniy Stepanov [Thu, 19 Jan 2017 20:04:11 +0000 (20:04 +0000)]

Fix aliases to thumbfunc-based exprs to be thumbfunc.

If F is a Thumb function symbol, and G = F + const, and G is a
function symbol, then G is Thumb. Because what else could it be?

Differential Revision: https://reviews.llvm.org/D28878

llvm-svn: 292514

commit | commitdiff | tree

Rafael Espindola [Thu, 19 Jan 2017 19:51:02 +0000 (19:51 +0000)]

Also define 'end' if it is present in a .so.

I don't know of anything using it, but we should handle it like _end.

llvm-svn: 292513

commit | commitdiff | tree

Rafael Espindola [Thu, 19 Jan 2017 19:43:34 +0000 (19:43 +0000)]

Create _end symbol even if a .so defines it.

The freebsd sbrk implementation uses _end to find the initial value of
brk, so it has to be defined in the main binary.

This should fix the emacs build.

llvm-svn: 292512

commit | commitdiff | tree

Kostya Serebryany [Thu, 19 Jan 2017 19:38:12 +0000 (19:38 +0000)]

[libFuzzer] improve -minimize_crash: honor -artifact_prefix= and don't special case 2-byte inputs

llvm-svn: 292511

commit | commitdiff | tree

Xin Tong [Thu, 19 Jan 2017 19:31:40 +0000 (19:31 +0000)]

Improve what can be promoted in LICM.

Summary:
In case of non-alloca pointers, we check for whether it is a pointer
from malloc-like calls and it is not captured. In such case, we can
promote the pointer, as the caller will have no way to access this pointer
even if there is unwinding in middle of the loop.

Reviewers: hfinkel, sanjoy, reames, eli.friedman

Subscribers: llvm-commits

Differential Revision: https://reviews.llvm.org/D28834

llvm-svn: 292510

commit | commitdiff | tree

Kostya Serebryany [Thu, 19 Jan 2017 19:07:26 +0000 (19:07 +0000)]

[libFuzzer] add two tests for experimenting with equivalence fuzzing

llvm-svn: 292509

commit | commitdiff | tree

Manman Ren [Thu, 19 Jan 2017 19:05:55 +0000 (19:05 +0000)]

Module: Improve diagnostic message when cxx modules are disabled and @import is used in Objective CXX.

rdar://problem/19399671

llvm-svn: 292508

commit | commitdiff | tree

Easwaran Raman [Thu, 19 Jan 2017 18:53:16 +0000 (18:53 +0000)]

Add an interface to scale the frequencies of a set of blocks.

The scaling is done with reference to the the new frequency of a reference block.

Differential Revision: https://reviews.llvm.org/D28535

llvm-svn: 292507

commit | commitdiff | tree

Davide Italiano [Thu, 19 Jan 2017 18:51:56 +0000 (18:51 +0000)]

[InstCombine] Simplify gep (gep p, a), (b-a)

Patch by Andrea Canciani.

Differential Revision: https://reviews.llvm.org/D27413

llvm-svn: 292506

commit | commitdiff | tree

Weiming Zhao [Thu, 19 Jan 2017 18:46:11 +0000 (18:46 +0000)]

[Builtin] [ARM] Update CMake to support the build of armv6m

Summary:
Setting -DCOMPILER_RT_TEST_TARGET_TRIPLE=armv6m-none-eabi will enable the build of builtin functions ARMv6m.
Currently, only those asms that support armv6m are added.

TODO:All asm sin ARM_EABI_Sources are ported for thumb1 so Thumb1_EABI_Sources will be deprecated.

Reviewers: rengolin, compnerd

Reviewed By: compnerd

Subscribers: aemerson, mgorny, llvm-commits

Differential Revision: https://reviews.llvm.org/D28463

llvm-svn: 292504

commit | commitdiff | tree

Simon Pilgrim [Thu, 19 Jan 2017 18:18:32 +0000 (18:18 +0000)]

[X86][SSE] Improve comments describing combineTruncatedArithmetic. NFCI.

llvm-svn: 292502

commit | commitdiff | tree

Kevin Enderby [Thu, 19 Jan 2017 18:07:22 +0000 (18:07 +0000)]

Remove this test from the r292500 commit till Chris and I figure out
why it is failing on a couple of build bots.

llvm-svn: 292501

commit | commitdiff | tree

Kevin Enderby [Thu, 19 Jan 2017 17:36:31 +0000 (17:36 +0000)]

Add support for the new LC_NOTE load command.

It describes a region of arbitrary data included in a Mach-O file.
Its initial use is to record extra data in MH_CORE files.

rdar://30001545
rdar://30001731

llvm-svn: 292500

commit | commitdiff | tree

Hafiz Abid Qadeer [Thu, 19 Jan 2017 17:32:50 +0000 (17:32 +0000)]

Provide a substitute to load command of gdb.

For bare-metal targets, lldb was missing a command like 'load' in gdb
which can be used to create executable image on the target. This was
discussed in
http://lists.llvm.org/pipermail/lldb-dev/2016-December/011752.html

This commits adds an option to "target module load" command to provide
that functionality. It does not set the PC to entry address which will
be done separately.

Reviewed in https://reviews.llvm.org/D28804

llvm-svn: 292499

commit | commitdiff | tree

Malcolm Parsons [Thu, 19 Jan 2017 17:19:22 +0000 (17:19 +0000)]

[Sema] Reword unused lambda capture warning

Summary:
The warning doesn't know why the variable was looked up but not
odr-used, so reword it to not claim that it was used in an unevaluated
context.

Reviewers: aaron.ballman

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D28902

llvm-svn: 292498

commit | commitdiff | tree

Alex Lorenz [Thu, 19 Jan 2017 17:17:57 +0000 (17:17 +0000)]

[Sema] Fix PR28181 by avoiding calling BuildOverloadedBinOp in C mode

rdar://28532840

Differential Revision: https://reviews.llvm.org/D25213

llvm-svn: 292497

commit | commitdiff | tree

Sumanth Gundapaneni [Thu, 19 Jan 2017 16:54:04 +0000 (16:54 +0000)]

[Hexagon] Linux linker does not support .gnu-hash

Hexagon Linux dynamic loader does not use (in fact does not support)
.gnu-hash

Differential Revision: https://reviews.llvm.org/D28865

llvm-svn: 292496

commit | commitdiff | tree

Simon Pilgrim [Thu, 19 Jan 2017 16:25:02 +0000 (16:25 +0000)]

[X86][SSE] Attempt to pre-truncate arithmetic operations that have already been extended

As discussed on D28219 - it is profitable to combine trunc(binop (s/zext(x), s/zext(y)) to binop(trunc(s/zext(x)), trunc(s/zext(y))) assuming the trunc(ext()) will simplify further

llvm-svn: 292493

commit | commitdiff | tree

Sanjay Patel [Thu, 19 Jan 2017 16:12:10 +0000 (16:12 +0000)]

[InstCombine] icmp Pred (shl nsw X, C1), C0 --> icmp Pred X, C0 >> C1

Try harder to fold icmp with shl nsw as discussed here:
http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html

This is similar to the 'shl nuw' transforms that were added with D25913.

This may eventually help solve:
https://llvm.org/bugs/show_bug.cgi?id=30773

Differential Revision: https://reviews.llvm.org/D28406

llvm-svn: 292492

commit | commitdiff | tree

Felix Berger [Thu, 19 Jan 2017 15:51:10 +0000 (15:51 +0000)]

[clang-tidy] Do not trigger move fix for non-copy assignment operators in performance-unnecessary-value-param check

Reviewers: alexfh, sbenza, malcolm.parsons

Subscribers: JDevlieghere, cfe-commits

Differential Revision: https://reviews.llvm.org/D28899

llvm-svn: 292491

commit | commitdiff | tree

Marshall Clow [Thu, 19 Jan 2017 15:30:36 +0000 (15:30 +0000)]

Mark two of the TS implementations as 'in progress'

llvm-svn: 292490

commit | commitdiff | tree

Pavel Labath [Thu, 19 Jan 2017 15:26:04 +0000 (15:26 +0000)]

Refactor logging in NativeProcessLinux

Use the LLDB_LOG macro instead of the more verbose if(log) ... syntax.

I have also consolidated the log channels (everything now goes to the posix
channel, instead of a mixture of posix and lldb), and cleaned up some of the
more convoluted log statements.

llvm-svn: 292489

commit | commitdiff | tree

Hafiz Abid Qadeer [Thu, 19 Jan 2017 15:11:01 +0000 (15:11 +0000)]

Avoid unused variable warning when assert is disabled.

llvm-svn: 292488

commit | commitdiff | tree

Simon Pilgrim [Thu, 19 Jan 2017 15:03:00 +0000 (15:03 +0000)]

[X86][SSE] Added tests for pre-truncating arithmetic operations that have already been extended

As discussed on D28219 - it is profitable to combine trunc(binop (s/zext(x), s/zext(y)) to binop(trunc(s/zext(x)), trunc(s/zext(y))) assuming the trunc(ext()) will simplify further

llvm-svn: 292487

commit | commitdiff | tree

Tobias Grosser [Thu, 19 Jan 2017 14:12:45 +0000 (14:12 +0000)]

BlockGenerator: Do not redundantly reload from PHI-allocas in non-affine stmts

Before this change we created an additional reload in the copy of the incoming
block of a PHI node to reload the incoming value, even though the necessary
value has already been made available by the normally generated scalar loads.
In this change, we drop the code that generates this redundant reload and
instead just reuse the scalar value already available.

Besides making the generated code slightly cleaner, this change also makes sure
that scalar loads go through the normal logic, which means they can be remapped
(e.g. to array slots) and corresponding code is generated to load from the
remapped location. Without this change, the original scalar load at the
beginning of the non-affine region would have been remapped, but the redundant
scalar load would continue to load from the old PHI slot location.

It might be possible to further simplify the code in addOperandToPHI,
but this would not only mean to pull out getNewValue, but to also change the
insertion point update logic. As this did not work when trying it the first
time, this change is likely not trivial. To not introduce bugs last minute, we
postpone further simplications to a subsequent commit.

We also document the current behavior a little bit better.

Reviewed By: Meinersbur

Differential Revision: https://reviews.llvm.org/D28892

llvm-svn: 292486

commit | commitdiff | tree

Mikael Holmen [Thu, 19 Jan 2017 13:55:55 +0000 (13:55 +0000)]

[DAG] Don't increase SDNodeOrder for dbg.value/declare.

Summary:
The SDNodeOrder is saved in the IROrder field in the SDNode, and this
field may affects scheduling. Thus, letting dbg.value/declare increase
the order numbers may in turn affect scheduling.

Because of this change we also need to update the code deciding when
dbg values should be output, in ScheduleDAGSDNodes.cpp/ProcessSDDbgValues.

Dbg values now have the same order as the SDNode they are connected to,
not the following orders.

Test cases provided by Florian Hahn.

Reviewers: bogner, aprantl, sunfish, atrick

Reviewed By: atrick

Subscribers: fhahn, probinson, andreadb, llvm-commits, MatzeB

Differential Revision: https://reviews.llvm.org/D25318

llvm-svn: 292485

commit | commitdiff | tree

Malcolm Parsons [Thu, 19 Jan 2017 13:38:19 +0000 (13:38 +0000)]

[docs] Tell Doxygen to expand LLVM_ALIGNAS to nothing

Summary:
Docs for clang::Decl and clang::TemplateSpecializationType have
not been generated since LLVM_ALIGNAS was added to them.

Tell Doxygen to expand LLVM_ALIGNAS to nothing as described at
https://www.stack.nl/~dimitri/doxygen/manual/preprocessing.html

Reviewers: aaron.ballman, klimek, alexfh

Subscribers: ioeric, cfe-commits

Differential Revision: https://reviews.llvm.org/D28850

llvm-svn: 292484

commit | commitdiff | tree

Malcolm Parsons [Thu, 19 Jan 2017 13:37:42 +0000 (13:37 +0000)]

commit | commitdiff | tree

Mikael Holmen [Thu, 19 Jan 2017 13:35:13 +0000 (13:35 +0000)]

Test commit access, remove trailing whitespace

llvm-svn: 292482

commit | commitdiff | tree

Kristof Beyls [Thu, 19 Jan 2017 13:32:14 +0000 (13:32 +0000)]

[GlobalISel] Pointers are legal operands for G_SELECT on AArch64

Differential Revision: https://reviews.llvm.org/D28805

llvm-svn: 292481

commit | commitdiff | tree

Tobias Grosser [Thu, 19 Jan 2017 13:25:52 +0000 (13:25 +0000)]

BlockGenerator: remove obfuscating const and const casts

Making certain values 'const' to just cast it away a little later mainly
obfuscates the code. Hence, we just drop the 'const' parts.

Suggested-by: Michael Kruse <llvm@meinersbur.de>
llvm-svn: 292480

commit | commitdiff | tree

Elena Demikhovsky [Thu, 19 Jan 2017 12:08:21 +0000 (12:08 +0000)]

Recommiting unsigned saturation with a bugfix.
A test case that crached is added to avx512-trunc.ll.
(PR31589)

llvm-svn: 292479

commit | commitdiff | tree

Daniel Sanders [Thu, 19 Jan 2017 11:15:55 +0000 (11:15 +0000)]

Re-commit: [globalisel] Tablegen-erate current Register Bank Information

Summary:
Adds a RegisterBank tablegen class that can be used to declare the register
banks and an associated tablegen pass to generate the necessary code.

Changes since first commit attempt:
* Added missing guards
* Added more missing guards
* Found and fixed a use-after-free bug involving Twine locals

Reviewers: t.p.northover, ab, rovka, qcolombet

Reviewed By: qcolombet

Subscribers: aditya_nandakumar, rengolin, kristof.beyls, vkalintiris, mgorny, dberris, llvm-commits, rovka

Differential Revision: https://reviews.llvm.org/D27338

llvm-svn: 292478

commit | commitdiff | tree

Malcolm Parsons [Thu, 19 Jan 2017 09:27:45 +0000 (09:27 +0000)]

commit | commitdiff | tree

Justin Bogner [Thu, 19 Jan 2017 07:51:17 +0000 (07:51 +0000)]

GlobalISel: Implement widening for shifts

llvm-svn: 292476

commit | commitdiff | tree

Craig Topper [Thu, 19 Jan 2017 07:37:45 +0000 (07:37 +0000)]

[AVX-512] Add test cases that show where we are using two subvector inserts to broadcast a 128-bit subvector into a 512-bit vector. We'd be better off using something like SHUFF32X4.

If the subvector comes from a load, we convert to SUBV_BROADCAST and use a broadcast instruction. But if there is no load we keep the inserts. I think we should create the SUBV_BROADCAST even without the load and let isel use the fallback patterns that are used if the load can't be folded. This will use the SHUFF32X4 or similar instruction for the 128-bit into 512-bit case and a single insert for 128 into 256 or 256 into 512.

This should be fixed so subvector broadcast intrinsics can be replaced with native IR since some of those currently lower directly to SHUFF32X4.

llvm-svn: 292475

commit | commitdiff | tree

Craig Topper [Thu, 19 Jan 2017 07:12:35 +0000 (07:12 +0000)]

[AVX-512] Support ADD/SUB/MUL of mask vectors

Summary:
Currently we expand and scalarize these operations, but I think we should be able to implement ADD/SUB with KXOR and MUL with KAND.

We already do this for scalar i1 operations so I just extended it to vectors of i1.

Reviewers: zvi, delena

Reviewed By: delena

Subscribers: guyblank, llvm-commits

Differential Revision: https://reviews.llvm.org/D28888

llvm-svn: 292474

commit | commitdiff | tree

Matt Arsenault [Thu, 19 Jan 2017 06:35:27 +0000 (06:35 +0000)]

AMDGPU: Disable some fneg combines unless nsz

For -(x + y) -> (-x) + (-y), if x == -y, this would
change the result from -0.0 to 0.0. Since the fma/fmad
combine is an extension of this problem it also
applies there.

fmul should be fine, and I don't think any of the unary
operators or conversions should be a problem either.

llvm-svn: 292473

commit | commitdiff | tree

Matt Arsenault [Thu, 19 Jan 2017 06:04:12 +0000 (06:04 +0000)]

AMDGPU: Remove modifiers from v_div_scale_*

They seem to produce nonsense results when used.

This should be applied to the release branch.

llvm-svn: 292472

commit | commitdiff | tree

Tobias Grosser [Thu, 19 Jan 2017 05:09:23 +0000 (05:09 +0000)]

Use range-based for loop [NFC]

llvm-svn: 292471

commit | commitdiff | tree

Tobias Grosser [Thu, 19 Jan 2017 04:54:45 +0000 (04:54 +0000)]

Improve test coverage in test/Isl/CodeGen/loop_partially_in_scop.ll [NFC]

We rename the test case with -metarenamer to make the variable names easier to
read and add additional check lines that verify the code we currently generate
for PHI nodes. This code is interesting as it contains a PHI node in a
non-affine sub-region, where some incoming blocks are within the non-affine
sub-region and others are outside of the non-affine subregion.

As can be seen in the check lines we currently load the PHI-node value twice.
This commit documents this behavior. In a subsequent patch we will try to
improve this.

llvm-svn: 292470

commit | commitdiff | tree

Craig Topper [Thu, 19 Jan 2017 03:49:29 +0000 (03:49 +0000)]

[X86] Merge LowerADD and LowerSUB into a single LowerADD_SUB since they are identical.

llvm-svn: 292469

commit | commitdiff | tree

Mike Aizatsky [Thu, 19 Jan 2017 03:49:18 +0000 (03:49 +0000)]

[sancov] applying blacklist to covered points too

Differential Revision: https://reviews.llvm.org/D28872

llvm-svn: 292468

commit | commitdiff | tree

Saleem Abdulrasool [Thu, 19 Jan 2017 02:58:46 +0000 (02:58 +0000)]

llvm-cxxfilt: filter out invalid manglings

c++filt does not attempt to demangle symbols which do not match its
expected format.  This means that the symbol must start with _Z or ___Z
(block invocation function extension).  Any other symbols are returned
as is.  Note that this is different from the behaviour of __cxa_demangle
which will demangle fragments.

llvm-svn: 292467

commit | commitdiff | tree

Craig Topper [Thu, 19 Jan 2017 02:34:29 +0000 (02:34 +0000)]

[AVX-512] Use VSHUF instructions instead of two inserts as fallback for subvector broadcasts that can't fold the load.

llvm-svn: 292466

commit | commitdiff | tree

Craig Topper [Thu, 19 Jan 2017 02:34:25 +0000 (02:34 +0000)]

[AVX-512] Add additional test cases for broadcast intrinsics that demonstates that we don't fold the loads to use a broadcast instruction.

llvm-svn: 292465

commit | commitdiff | tree

Michael Kuperstein [Thu, 19 Jan 2017 02:21:54 +0000 (02:21 +0000)]

[PM] Add LoopVectorize to the default module pipeline

LV no longer "requires" LCSSA and LoopSimplify, and instead forms
them internally as required. So, there's nothing preventing it from
being enabled.

llvm-svn: 292464

commit | commitdiff | tree

Peter Collingbourne [Thu, 19 Jan 2017 01:20:11 +0000 (01:20 +0000)]

LowerTypeTests: Implement exporting of type identifiers.

Type identifiers are exported by:
- Adding coarse-grained information about how to test the type
identifier to the summary.
- Creating symbols in the object file (aliases and absolute symbols)
containing fine-grained information about the type identifier.

Differential Revision: https://reviews.llvm.org/D28424

llvm-svn: 292462

commit | commitdiff | tree

Justin Bogner [Thu, 19 Jan 2017 01:05:48 +0000 (01:05 +0000)]

GlobalISel: Implement narrowing for G_LOAD

llvm-svn: 292461

commit | commitdiff | tree

Justin Bogner [Thu, 19 Jan 2017 01:04:46 +0000 (01:04 +0000)]

GlobalISel: Fix text wrapping in a comment. NFC

llvm-svn: 292460

commit | commitdiff | tree

Matthias Braun [Thu, 19 Jan 2017 01:04:08 +0000 (01:04 +0000)]

Use an actual valid register in test

llvm-svn: 292459

commit | commitdiff | tree

Dehao Chen [Thu, 19 Jan 2017 00:44:21 +0000 (00:44 +0000)]

Add -fdebug-info-for-profiling to emit more debug info for sample pgo profile collection

Summary:
SamplePGO uses profile with debug info to collect profile. Unlike the traditional debugging purpose, sample pgo needs more accurate debug info to represent the profile. We add -femit-accurate-debug-info for this purpose. It can be combined with all debugging modes (-g, -gmlt, etc). It makes sure that the following pieces of info is always emitted:

* start line of all subprograms
* linkage name of all subprograms
* standalone subprograms (functions that has neither inlined nor been inlined)

The impact on speccpu2006 binary size (size increase comparing with -g0 binary, also includes data for -g binary, which does not change with this patch):

               -gmlt(orig) -gmlt(patched) -g
433.milc       4.68%       5.40%          19.73%
444.namd       8.45%       8.93%          45.99%
447.dealII     97.43%      115.21%        374.89%
450.soplex     27.75%      31.88%         126.04%
453.povray     21.81%      26.16%         92.03%
470.lbm        0.60%       0.67%          1.96%
482.sphinx3    5.77%       6.47%          26.17%
400.perlbench  17.81%      19.43%         73.08%
401.bzip2      3.73%       3.92%          12.18%
403.gcc        31.75%      34.48%         122.75%
429.mcf        0.78%       0.88%          3.89%
445.gobmk      6.08%       7.92%          42.27%
456.hmmer      10.36%      11.25%         35.23%
458.sjeng      5.08%       5.42%          14.36%
462.libquantum 1.71%       1.96%          6.36%
464.h264ref    15.61%      16.56%         43.92%
471.omnetpp    11.93%      15.84%         60.09%
473.astar      3.11%       3.69%          14.18%
483.xalancbmk  56.29%      81.63%         353.22%
geomean        15.60%      18.30%         57.81%

Debug info size change for -gmlt binary with this patch:

433.milc       13.46%
444.namd       5.35%
447.dealII     18.21%
450.soplex     14.68%
453.povray     19.65%
470.lbm        6.03%
482.sphinx3    11.21%
400.perlbench  8.91%
401.bzip2      4.41%
403.gcc        8.56%
429.mcf        8.24%
445.gobmk      29.47%
456.hmmer      8.19%
458.sjeng      6.05%
462.libquantum 11.23%
464.h264ref    5.93%
471.omnetpp    31.89%
473.astar      16.20%
483.xalancbmk  44.62%
geomean        16.83%

Reviewers: davidxl, andreadb, rob.lougher, dblaikie, echristo

Reviewed By: dblaikie, echristo

Subscribers: hfinkel, rob.lougher, andreadb, gbedwell, cfe-commits, probinson, llvm-commits, mehdi_amini

Differential Revision: https://reviews.llvm.org/D25435

llvm-svn: 292458

commit | commitdiff | tree

Dehao Chen [Thu, 19 Jan 2017 00:44:11 +0000 (00:44 +0000)]

Add -debug-info-for-profiling to emit more debug info for sample pgo profile collection

Summary:
SamplePGO binaries built with -gmlt to collect profile. The current -gmlt debug info is limited, and we need some additional info:

* start line of all subprograms
* linkage name of all subprograms
* standalone subprograms (functions that has neither inlined nor been inlined)

This patch adds these information to the -gmlt binary. The impact on speccpu2006 binary size (size increase comparing with -g0 binary, also includes data for -g binary, which does not change with this patch):

               -gmlt(orig) -gmlt(patched) -g
433.milc       4.68%       5.40%          19.73%
444.namd       8.45%       8.93%          45.99%
447.dealII     97.43%      115.21%        374.89%
450.soplex     27.75%      31.88%         126.04%
453.povray     21.81%      26.16%         92.03%
470.lbm        0.60%       0.67%          1.96%
482.sphinx3    5.77%       6.47%          26.17%
400.perlbench  17.81%      19.43%         73.08%
401.bzip2      3.73%       3.92%          12.18%
403.gcc        31.75%      34.48%         122.75%
429.mcf        0.78%       0.88%          3.89%
445.gobmk      6.08%       7.92%          42.27%
456.hmmer      10.36%      11.25%         35.23%
458.sjeng      5.08%       5.42%          14.36%
462.libquantum 1.71%       1.96%          6.36%
464.h264ref    15.61%      16.56%         43.92%
471.omnetpp    11.93%      15.84%         60.09%
473.astar      3.11%       3.69%          14.18%
483.xalancbmk  56.29%      81.63%         353.22%
geomean        15.60%      18.30%         57.81%

Debug info size change for -gmlt binary with this patch:

433.milc       13.46%
444.namd       5.35%
447.dealII     18.21%
450.soplex     14.68%
453.povray     19.65%
470.lbm        6.03%
482.sphinx3    11.21%
400.perlbench  8.91%
401.bzip2      4.41%
403.gcc        8.56%
429.mcf        8.24%
445.gobmk      29.47%
456.hmmer      8.19%
458.sjeng      6.05%
462.libquantum 11.23%
464.h264ref    5.93%
471.omnetpp    31.89%
473.astar      16.20%
483.xalancbmk  44.62%
geomean        16.83%

Reviewers: davidxl, echristo, dblaikie

Reviewed By: echristo, dblaikie

Subscribers: aprantl, probinson, llvm-commits, mehdi_amini

Differential Revision: https://reviews.llvm.org/D25434

llvm-svn: 292457

commit | commitdiff | tree

Michael Kuperstein [Thu, 19 Jan 2017 00:42:28 +0000 (00:42 +0000)]

[LV] Run loop-simplify and LCSSA explicitly instead of "requiring" them

This changes the vectorizer to explicitly use the loopsimplify and lcssa utils,
instead of "requiring" the transformations as if they were analyses.

This is not NFC, since it changes the LCSSA behavior - we no longer run LCSSA
for all loops, but rather only for the loops we expect to modify.

Differential Revision: https://reviews.llvm.org/D28868

llvm-svn: 292456

commit | commitdiff | tree

Matthias Braun [Thu, 19 Jan 2017 00:32:13 +0000 (00:32 +0000)]

LiveIntervalAnalysis: Cleanup; NFC

- Fix doxygen comments: Do not repeat name, remove duplicated doxygen
comment (on declaration + implementation), etc.
- Use more range based for

llvm-svn: 292455

commit | commitdiff | tree

Jason Molenda [Thu, 19 Jan 2017 00:20:29 +0000 (00:20 +0000)]

Fix a problem with the new dyld interface code -- when a new process
starts up, we need to clear the target's image list and only add
the binaries into the target that are actually present in this
process run.

<rdar://problem/29857613>

llvm-svn: 292454

commit | commitdiff | tree

Artem Belevich [Thu, 19 Jan 2017 00:14:45 +0000 (00:14 +0000)]

[NVPTX] Fix lowering of fp16 ISD::FNEG.

There's no neg.f16 instruction, so negation has to
be done via subtraction from zero.

Differential Revision: https://reviews.llvm.org/D28876

llvm-svn: 292452

commit | commitdiff | tree

Peter Collingbourne [Thu, 19 Jan 2017 00:04:44 +0000 (00:04 +0000)]

Add llvm-dis dependency to check-clang.

llvm-svn: 292450

commit | commitdiff | tree

Eli Friedman [Wed, 18 Jan 2017 23:56:42 +0000 (23:56 +0000)]

[SCEV] Make getUDivExactExpr handle non-nuw multiplies correctly.

To avoid regressions, make ScalarEvolution::createSCEV a bit more
clever.

Also get rid of some useless code in ScalarEvolution::howFarToZero
which was hiding this bug.

No new testcase because it's impossible to actually expose this bug:
we don't have any in-tree users of getUDivExactExpr besides the two
functions I just mentioned, and they both dodged the problem. I'll
try to add some interesting users in a followup.

Differential Revision: https://reviews.llvm.org/D28587

llvm-svn: 292449

commit | commitdiff | tree

Peter Collingbourne [Wed, 18 Jan 2017 23:55:27 +0000 (23:55 +0000)]

Move vtable type metadata emission behind a cc1-level flag.

In ThinLTO mode, type metadata will require the module to be written as a
multi-module bitcode file, which is currently incompatible with the Darwin
linker. It is also useful to be able to enable or disable multi-module bitcode
for testing purposes. This introduces a cc1-level flag, -f{,no-}lto-unit,
which is used by the driver to enable multi-module bitcode on all but
Darwin+ThinLTO, and can also be used to enable/disable the feature manually.

Differential Revision: https://reviews.llvm.org/D28877

llvm-svn: 292448

commit | commitdiff | tree

Eli Friedman [Wed, 18 Jan 2017 23:26:37 +0000 (23:26 +0000)]

Preserve domtree and loop-simplify for runtime unrolling.

Mostly straightforward changes; we just didn't do the computation before.
One sort of interesting change in LoopUnroll.cpp: we weren't handling
dominance for children of the loop latch correctly, but
foldBlockIntoPredecessor hid the problem for complete unrolling.

Currently punting on loop peeling; made some minor changes to isolate
that problem to LoopUnrollPeel.cpp.

Adds a flag -unroll-verify-domtree; it verifies the domtree immediately
after we finish updating it. This is on by default for +Asserts builds.

Differential Revision: https://reviews.llvm.org/D28073

llvm-svn: 292447

commit | commitdiff | tree

Krzysztof Parzyszek [Wed, 18 Jan 2017 23:12:19 +0000 (23:12 +0000)]

Treat segment [B, E) as not overlapping block with boundaries [A, B)

llvm-svn: 292446

commit | commitdiff | tree

Krzysztof Parzyszek [Wed, 18 Jan 2017 23:11:40 +0000 (23:11 +0000)]

[Hexagon] Remove dead defs from the live set when expanding wstores

llvm-svn: 292445

commit | commitdiff | tree

Michael Kuperstein [Wed, 18 Jan 2017 23:05:58 +0000 (23:05 +0000)]

Revert r291670 because it introduces a crash.

r291670 doesn't crash on the original testcase from PR31589,
but it crashes on a slightly more complex one.

PR31589 has the new reproducer.

llvm-svn: 292444

commit | commitdiff | tree

Stephan T. Lavavej [Wed, 18 Jan 2017 22:19:14 +0000 (22:19 +0000)]

[libcxx] [test] Add msvc_stdlib_force_include.hpp.

No functional change; nothing includes this, instead our test harness
injects it via the /FI compiler option.

No code review; blessed in advance by EricWF.

llvm-svn: 292443

commit | commitdiff | tree

Mehdi Amini [Wed, 18 Jan 2017 21:37:11 +0000 (21:37 +0000)]

Improve the `-filter-print-funcs` option to skip the banner for CGSCC pass when nothing is to be printed

Before, it would print a sequence of:

  *** IR Dump After Function Integration/Inlining ******
  *** IR Dump After Function Integration/Inlining ******
  *** IR Dump After Function Integration/Inlining ******
  ...

for every single function in the module.

llvm-svn: 292442

commit | commitdiff | tree

Sanjay Patel [Wed, 18 Jan 2017 21:31:21 +0000 (21:31 +0000)]

[InstCombine] add tests for shl nsw with icmp eq/ne; NFCI

These should be fixed with D28406.

llvm-svn: 292441

commit | commitdiff | tree

Sanjay Patel [Wed, 18 Jan 2017 21:16:12 +0000 (21:16 +0000)]

[InstCombine] add an assert to make a shl+icmp transform assumption explicit; NFCI

llvm-svn: 292440

commit | commitdiff | tree

David Blaikie [Wed, 18 Jan 2017 21:15:18 +0000 (21:15 +0000)]

Remove now redundant code that ensured debug info for class definitions was emitted under certain circumstances

Introduced in r181561 - it may've been subsumed by work done to allow
emission of declarations for vtable types while still emitting some of
their member functions correctly for those declarations. Whatever the
reason, the tests pass without this code now.

llvm-svn: 292439

commit | commitdiff | tree

Haicheng Wu [Wed, 18 Jan 2017 21:12:10 +0000 (21:12 +0000)]

[CodeGenPrepare] Fix a typo in the comment. NFC.

encode => endcode.

Differential Revision: https://reviews.llvm.org/D28866

llvm-svn: 292438

commit | commitdiff | tree

Arpith Chacko Jacob [Wed, 18 Jan 2017 20:40:48 +0000 (20:40 +0000)]

[OpenMP] Support for the if-clause on the combined directive 'target parallel'.

The if-clause on the combined directive potentially applies to both the
'target' and the 'parallel' regions.  Codegen'ing the if-clause on the
combined directive requires additional support because the expression in
the clause must be captured by the 'target' capture statement but not
the 'parallel' capture statement.  Note that this situation arises for
other clauses such as num_threads.

The OMPIfClause class inherits OMPClauseWithPreInit to support capturing
of expressions in the clause.  A member CaptureRegion is added to
OMPClauseWithPreInit to indicate which captured statement (in this case
'target' but not 'parallel') captures these expressions.

To ensure correct codegen of captured expressions in the presence of
combined 'target' directives, OMPParallelScope was added to 'parallel'
codegen.

Reviewers: ABataev
Differential Revision: https://reviews.llvm.org/D28781

llvm-svn: 292437

commit | commitdiff | tree

Graydon Hoare [Wed, 18 Jan 2017 20:36:59 +0000 (20:36 +0000)]

[ASTReader] Add a DeserializationListener callback for IMPORTED_MODULES

Summary:
Add a callback from ASTReader to DeserializationListener when the former
reads an IMPORTED_MODULES block. This supports Swift in using PCH for
bridging headers.

Reviewers: doug.gregor, manmanren, bruno

Reviewed By: manmanren

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D28779

llvm-svn: 292436

commit | commitdiff | tree

Graydon Hoare [Wed, 18 Jan 2017 20:34:44 +0000 (20:34 +0000)]

[Modules] Correct test comment from obsolete earlier version of code. NFC

Summary:
Code committed in rL290219 went through a few iterations; test wound up with
stale comment.

Reviewers: doug.gregor, manmanren

Reviewed By: manmanren

Subscribers: cfe-commits

Differential Revision: https://reviews.llvm.org/D28790

llvm-svn: 292435

commit | commitdiff | tree

Stephan T. Lavavej [Wed, 18 Jan 2017 20:10:25 +0000 (20:10 +0000)]

[libcxx] [test] Fix comment typos, strip trailing whitespace.

No functional change, no code review.

llvm-svn: 292434

commit | commitdiff | tree

Sanjay Patel [Wed, 18 Jan 2017 20:09:59 +0000 (20:09 +0000)]

[InstCombine] remove a redundant check; NFCI

I missed deleting this check when I refactored this chunk in:
https://reviews.llvm.org/rL292260

llvm-svn: 292433

commit | commitdiff | tree

Stephan T. Lavavej [Wed, 18 Jan 2017 20:09:56 +0000 (20:09 +0000)]

[libcxx] [test] Fix MSVC warnings C4127 and C6326 about constants.

MSVC has compiler warnings C4127 "conditional expression is constant" (enabled
by /W4) and C6326 "Potential comparison of a constant with another constant"
(enabled by /analyze). They're potentially useful, although they're slightly
annoying to library devs who know what they're doing. In the latest version of
the compiler, C4127 is suppressed when the compiler sees simple tests like
"if (name_of_thing)", so extracting comparison expressions into named
constants is a workaround. At the same time, using std::integral_constant
avoids C6326, which doesn't look at template arguments.

test/std/containers/sequences/vector.bool/emplace.pass.cpp
Replace 1 == 1 with true, which is the same as far as the library is concerned.

Fixes D28837.

llvm-svn: 292432

commit | commitdiff | tree

Peter Collingbourne [Wed, 18 Jan 2017 20:03:02 +0000 (20:03 +0000)]

ThinLTOBitcodeWriter: Clear comdats on filtered globals.

Differential Revision: https://reviews.llvm.org/D28839

llvm-svn: 292431

commit | commitdiff | tree

Peter Collingbourne [Wed, 18 Jan 2017 20:02:31 +0000 (20:02 +0000)]

Cloning: Copy comdats when cloning globals.

Differential Revision: https://reviews.llvm.org/D28838

llvm-svn: 292430

commit | commitdiff | tree

Arpith Chacko Jacob [Wed, 18 Jan 2017 19:35:00 +0000 (19:35 +0000)]

[OpenMP] Codegen for the 'target parallel' directive on the NVPTX device.

This patch adds codegen for the 'target parallel' directive on the NVPTX
device. We term offload OpenMP directives such as 'target parallel' and
'target teams distribute parallel for' as SPMD constructs. SPMD constructs,
in contrast to Generic ones like the plain 'target', can never contain
a serial region.

SPMD constructs can be handled more efficiently on the GPU and do not
require the Warp Loop of the Generic codegen scheme. This patch adds
SPMD codegen support for 'target parallel' on the NVPTX device and can
be reused for other SPMD constructs.

Reviewers: ABataev
Differential Revision: https://reviews.llvm.org/D28755

llvm-svn: 292428

commit | commitdiff | tree

Richard Smith [Wed, 18 Jan 2017 19:19:22 +0000 (19:19 +0000)]

PR9551: Implement DR1004 (wg21.link/cwg1004).

This rule permits the injected-class-name of a class template to be used as
both a template type argument and a template template argument, with no extra
syntax required to disambiguate.

llvm-svn: 292426

commit | commitdiff | tree

Michael Kuperstein [Wed, 18 Jan 2017 19:05:48 +0000 (19:05 +0000)]

Fix up a comment. NFC.

llvm-svn: 292425

commit | commitdiff | tree

Michael Kuperstein [Wed, 18 Jan 2017 19:02:52 +0000 (19:02 +0000)]

[LV] Allow reductions that have several uses outside the loop

We currently check whether a reduction has a single outside user. We don't
really need to require that - we just need to make sure a single value is
used externally. The number of external users of that value shouldn't actually
matter.

Differential Revision: https://reviews.llvm.org/D28830

llvm-svn: 292424

commit | commitdiff | tree

Justin Bogner [Wed, 18 Jan 2017 19:01:58 +0000 (19:01 +0000)]

cmake: Only sanitize use-after-scope if the host compiler supports it

In r292256, we started adding -fsanitize-use-after-scope when using
the address sanitizer, but that flag wasn't always available. This
fixes the config to only add the flag if the host compiler supports
it.

llvm-svn: 292423

Domain: System / Toolchain;

RSS Atom