io_uring: save atomic dec for inline executed reqs
authorPavel Begunkov <asml.silence@gmail.com>
Tue, 19 Jan 2021 13:32:47 +0000 (13:32 +0000)
committerJens Axboe <axboe@kernel.dk>
Mon, 1 Feb 2021 17:02:42 +0000 (10:02 -0700)
commite342c807f556dbcee1370ab78af1d8faf497d771
treeab6f899c1118e520edc267eadc9e47df521913a1
parent9affd664f0e0512d8997dbdddb1448a4faf9bc82
io_uring: save atomic dec for inline executed reqs

When a request is completed with comp_state, its completion reference
put is deferred to io_submit_flush_completions(), but the submission
is put not far from there, so do it together to save one atomic dec per
request. That targets requests that complete inline, e.g. buffered rw,
send/recv.

Proper benchmarking haven't been conducted but for nops(batch=32) it was
around 7901 vs 8117 KIOPS (~2.7%), or ~4% per perf profiling.

Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
Signed-off-by: Jens Axboe <axboe@kernel.dk>
fs/io_uring.c