drm/amdgpu: Clarify error when hitting bad page threshold
authorKent Russell <kent.russell@amd.com>
Tue, 19 Oct 2021 13:53:17 +0000 (09:53 -0400)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 20 Oct 2021 15:43:57 +0000 (11:43 -0400)
Change the error message when the bad_page_threshold is reached,
explicitly stating that the GPU will not be initialized.

Cc: Luben Tuikov <luben.tuikov@amd.com>
Cc: Mukul Joshi <Mukul.Joshi@amd.com>
Signed-off-by: Kent Russell <kent.russell@amd.com>
Reviewed-by: Luben Tuikov <luben.tuikov@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c

index 9873251..f4c05ff 100644 (file)
@@ -1101,7 +1101,7 @@ int amdgpu_ras_eeprom_init(struct amdgpu_ras_eeprom_control *control,
                        *exceed_err_limit = true;
                        dev_err(adev->dev,
                                "RAS records:%d exceed threshold:%d, "
-                               "maybe retire this GPU?",
+                               "GPU will not be initialized. Replace this GPU or increase the threshold",
                                control->ras_num_recs, ras->bad_page_cnt_threshold);
                }
        } else {