nvme-fabrics: fix state check in nvmf_ctlr_matches_baseopts()
authorUday Shankar <ushankar@purestorage.com>
Thu, 20 Jan 2022 20:17:37 +0000 (12:17 -0800)
committerChristoph Hellwig <hch@lst.de>
Thu, 3 Feb 2022 06:30:57 +0000 (07:30 +0100)
commit6a51abdeb259a56d95f13cc67e3a0838bcda0377
tree6033be756fc3df62fc23ccbfe0f62f53963010c2
parentb6bb1722f34bbdbabed27acdceaf585d300c5fd2
nvme-fabrics: fix state check in nvmf_ctlr_matches_baseopts()

Controller deletion/reset, immediately followed by or concurrent with
a reconnect, is hard failing the connect attempt resulting in a
complete loss of connectivity to the controller.

In the connect request, fabrics looks for an existing controller with
the same address components and aborts the connect if a controller
already exists and the duplicate connect option isn't set. The match
routine filters out controllers that are dead or dying, so they don't
interfere with the new connect request.

When NVME_CTRL_DELETING_NOIO was added, it missed updating the state
filters in the nvmf_ctlr_matches_baseopts() routine. Thus, when in this
new state, it's seen as a live controller and fails the connect request.

Correct by adding the DELETING_NIO state to the match checks.

Fixes: ecca390e8056 ("nvme: fix deadlock in disconnect during scan_work and/or ana_work")
Cc: <stable@vger.kernel.org> # v5.7+
Signed-off-by: Uday Shankar <ushankar@purestorage.com>
Reviewed-by: James Smart <jsmart2021@gmail.com>
Reviewed-by: Sagi Grimberg <sagi@grimberg.me>
Signed-off-by: Christoph Hellwig <hch@lst.de>
drivers/nvme/host/fabrics.h