drm/amdgpu: revert "implement tdr advanced mode"
authorChristian König <christian.koenig@amd.com>
Wed, 26 Oct 2022 10:26:37 +0000 (12:26 +0200)
committerAlex Deucher <alexander.deucher@amd.com>
Tue, 15 Nov 2022 20:25:22 +0000 (15:25 -0500)
commit06a2d7cc3f0476be4682ef90eb09a28fa3daed37
tree17c7f45421a398fbe7137fa627127321cf457ea8
parentd27252b5706e51188aed7647126e44dcf9e940c1
drm/amdgpu: revert "implement tdr advanced mode"

This reverts commit e6c6338f393b74ac0b303d567bb918b44ae7ad75.

This feature basically re-submits one job after another to
figure out which one was the one causing a hang.

This is obviously incompatible with gang-submit which requires
that multiple jobs run at the same time. It's also absolutely
not helpful to crash the hardware multiple times if a clean
recovery is desired.

For testing and debugging environments we should rather disable
recovery alltogether to be able to inspect the state with a hw
debugger.

Additional to that the sw implementation is clearly buggy and causes
reference count issues for the hardware fence.

Signed-off-by: Christian König <christian.koenig@amd.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c
drivers/gpu/drm/scheduler/sched_main.c
include/drm/gpu_scheduler.h