drm/amdgpu: tighten gpu_recover in mailbox_flr to avoid duplicate recover in sriov
authorwentalou <Wentao.Lou@amd.com>
Wed, 30 Jan 2019 03:13:01 +0000 (11:13 +0800)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 13 Feb 2019 22:50:13 +0000 (17:50 -0500)
sriov's gpu_recover inside xgpu_ai_mailbox_flr_work would cause duplicate recover in TDR.
TDR's gpu_recover would be triggered by amdgpu_job_timedout,
that could avoid vk-cts failure by unexpected recover.

Signed-off-by: Wentao Lou <Wentao.Lou@amd.com>
Acked-by: Andrey Grodzovsky <andrey.grodzovsky@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/mxgpu_ai.c

index b11a1c1..73851eb 100644 (file)
@@ -266,7 +266,8 @@ flr_done:
        }
 
        /* Trigger recovery for world switch failure if no TDR */
-       if (amdgpu_device_should_recover_gpu(adev))
+       if (amdgpu_device_should_recover_gpu(adev)
+               && amdgpu_lockup_timeout == MAX_SCHEDULE_TIMEOUT)
                amdgpu_device_gpu_recover(adev, NULL);
 }