ARM: tlb: don't perform inner-shareable invalidation for local TLB ops
authorWill Deacon <will.deacon@arm.com>
Mon, 11 Feb 2013 13:47:48 +0000 (13:47 +0000)
committerWill Deacon <will.deacon@arm.com>
Mon, 12 Aug 2013 11:25:44 +0000 (12:25 +0100)
commitf0915781bd5edf78b1154e61efe962dc15872d09
treea8dc2c7b3c4f0f437171a80c6bde06b020418dc3
parent792a843a9f353d3e2474b6f5057b7eaecba41675
ARM: tlb: don't perform inner-shareable invalidation for local TLB ops

Inner-shareable TLB invalidation is typically more expensive than local
(non-shareable) invalidation, so performing the broadcasting for
local_flush_tlb_* operations is a waste of cycles and needlessly
clobbers entries in the TLBs of other CPUs.

This patch introduces __flush_tlb_* versions for many of the TLB
invalidation functions, which only respect inner-shareable variants of
the invalidation instructions when presented with the TLB_V7_UIS_FULL
flag. The local version is also inlined to prevent SMP_ON_UP kernels
from missing flushes, where the __flush variant would be called with
the UP flags.

This gains us around 0.5% in hackbench scores for a dual-core A15, but I
would expect this to improve as more cores (and clusters) are added to
the equation.

Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
Reported-by: Albin Tonnerre <Albin.Tonnerre@arm.com>
Signed-off-by: Will Deacon <will.deacon@arm.com>
arch/arm/include/asm/tlbflush.h
arch/arm/kernel/smp_tlb.c
arch/arm/mm/context.c