locking/rwsem: Prevent potential lock starvation
authorWaiman Long <longman@redhat.com>
Sat, 21 Nov 2020 04:14:13 +0000 (23:14 -0500)
committerPeter Zijlstra <peterz@infradead.org>
Wed, 9 Dec 2020 16:08:48 +0000 (17:08 +0100)
The lock handoff bit is added in commit 4f23dbc1e657 ("locking/rwsem:
Implement lock handoff to prevent lock starvation") to avoid lock
starvation. However, allowing readers to do optimistic spinning does
introduce an unlikely scenario where lock starvation can happen.

The lock handoff bit may only be set when a waiter is being woken up.
In the case of reader unlock, wakeup happens only when the reader count
reaches 0. If there is a continuous stream of incoming readers acquiring
read lock via optimistic spinning, it is possible that the reader count
may never reach 0 and so the handoff bit will never be asserted.

One way to prevent this scenario from happening is to disallow optimistic
spinning if the rwsem is currently owned by readers. If the previous
or current owner is a writer, optimistic spinning will be allowed.

If the previous owner is a reader but the reader count has reached 0
before, a wakeup should have been issued. So the handoff mechanism
will be kicked in to prevent lock starvation. As a result, it should
be OK to do optimistic spinning in this case.

This patch may have some impact on reader performance as it reduces
reader optimistic spinning especially if the lock critical sections
are short the number of contending readers are small.

Signed-off-by: Waiman Long <longman@redhat.com>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Davidlohr Bueso <dbueso@suse.de>
Link: https://lkml.kernel.org/r/20201121041416.12285-3-longman@redhat.com
kernel/locking/rwsem.c

index 5768b90223c06c5288694105d9fc791ad1850540..c055f4b28b23c6529f9a21048b72ab0890fa0888 100644 (file)
@@ -1010,16 +1010,27 @@ rwsem_spin_on_owner(struct rw_semaphore *sem, unsigned long nonspinnable)
 static struct rw_semaphore __sched *
 rwsem_down_read_slowpath(struct rw_semaphore *sem, long count, int state)
 {
-       long adjustment = -RWSEM_READER_BIAS;
+       long owner, adjustment = -RWSEM_READER_BIAS;
+       long rcnt = (count >> RWSEM_READER_SHIFT);
        struct rwsem_waiter waiter;
        DEFINE_WAKE_Q(wake_q);
        bool wake = false;
 
+       /*
+        * To prevent a constant stream of readers from starving a sleeping
+        * waiter, don't attempt optimistic spinning if the lock is currently
+        * owned by readers.
+        */
+       owner = atomic_long_read(&sem->owner);
+       if ((owner & RWSEM_READER_OWNED) && (rcnt > 1) &&
+          !(count & RWSEM_WRITER_LOCKED))
+               goto queue;
+
        /*
         * Save the current read-owner of rwsem, if available, and the
         * reader nonspinnable bit.
         */
-       waiter.last_rowner = atomic_long_read(&sem->owner);
+       waiter.last_rowner = owner;
        if (!(waiter.last_rowner & RWSEM_READER_OWNED))
                waiter.last_rowner &= RWSEM_RD_NONSPINNABLE;