Add LLL_MUTEX_READ_LOCK [BZ #28537]
authorH.J. Lu <hjl.tools@gmail.com>
Wed, 3 Nov 2021 01:33:07 +0000 (18:33 -0700)
committerH.J. Lu <hjl.tools@gmail.com>
Fri, 12 Nov 2021 18:32:09 +0000 (10:32 -0800)
commitd672a98a1af106bd68deb15576710cd61363f7a6
treecb697ddc3004aa47fefea927dea0c7e2f4d5ae79
parent49302b8fdf9103b6fc0a398678668a22fa19574c
Add LLL_MUTEX_READ_LOCK [BZ #28537]

CAS instruction is expensive.  From the x86 CPU's point of view, getting
a cache line for writing is more expensive than reading.  See Appendix
A.2 Spinlock in:

https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/xeon-lock-scaling-analysis-paper.pdf

The full compare and swap will grab the cache line exclusive and cause
excessive cache line bouncing.

Add LLL_MUTEX_READ_LOCK to do an atomic load and skip CAS in spinlock
loop if compare may fail to reduce cache line bouncing on contended locks.

Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>
nptl/pthread_mutex_lock.c