md/raid5: fetch_block must fetch all the blocks handle_stripe_dirtying wants.
authorNeilBrown <neilb@suse.de>
Wed, 3 Dec 2014 05:07:58 +0000 (16:07 +1100)
committerJiri Slaby <jslaby@suse.cz>
Thu, 29 Jan 2015 14:45:14 +0000 (15:45 +0100)
commit361efd1a32c21485081012543c7a62d935e6bff2
tree375ea6affaaed710c46c6db5501b1c3b29f7bd86
parentc367abe482eaef94fb094726c6b06b9435c68198
md/raid5: fetch_block must fetch all the blocks handle_stripe_dirtying wants.

commit 108cef3aa41669610e1836fe638812dd067d72de upstream.

It is critical that fetch_block() and handle_stripe_dirtying()
are consistent in their analysis of what needs to be loaded.
Otherwise raid5 can wait forever for a block that won't be loaded.

Currently when writing to a RAID5 that is resyncing, to a location
beyond the resync offset, handle_stripe_dirtying chooses a
reconstruct-write cycle, but fetch_block() assumes a
read-modify-write, and a lockup can happen.

So treat that case just like RAID6, just as we do in
handle_stripe_dirtying.  RAID6 always does reconstruct-write.

This bug was introduced when the behaviour of handle_stripe_dirtying
was changed in 3.7, so the patch is suitable for any kernel since,
though it will need careful merging for some versions.

Cc: stable@vger.kernel.org (v3.7+)
Fixes: a7854487cd7128a30a7f4f5259de9f67d5efb95f
Reported-by: Henry Cai <henryplusplus@gmail.com>
Signed-off-by: NeilBrown <neilb@suse.de>
Signed-off-by: Jiri Slaby <jslaby@suse.cz>
drivers/md/raid5.c