With -mcpu=generic the loop alignment is currently 4.
authorWilco Dijkstra <wdijkstr@arm.com>
Thu, 4 May 2017 17:43:43 +0000 (17:43 +0000)
committerWilco Dijkstra <wilco@gcc.gnu.org>
Thu, 4 May 2017 17:43:43 +0000 (17:43 +0000)
commit6b13482bb6c33d2d37494ec33e2028fcbe4e0635
tree476b428a993a41239bf817f3a96793133a260e2b
parent0f2c4a8f1d2903f9c07b86bf8faeabe1a01ed34b
With -mcpu=generic the loop alignment is currently 4.

With -mcpu=generic the loop alignment is currently 4.  All but one of the
supported cores use 8 or higher.  Since using 8 provides performance gains
on several cores, it is best to use that by default.  As discussed in [1],
the jump alignment has no effect on performance, yet has a relatively high
codesize cost [2], so setting it to 4 is best.  This gives a 0.2% overall
codesize improvement as well as performance gains in several benchmarks.

    gcc/
* config/aarch64/aarch64.c (generic_tunings): Set jump alignment to 4.
Set loop alignment to 8.

[1] https://gcc.gnu.org/ml/gcc-patches/2017-04/msg00574.html
[2] https://gcc.gnu.org/ml/gcc-patches/2016-06/msg02075.html

From-SVN: r247608
gcc/ChangeLog
gcc/config/aarch64/aarch64.c