drm/amdgpu: support check xgmi/walf error mask bit for aldebaran
authorStanley.Yang <Stanley.Yang@amd.com>
Thu, 12 Jan 2023 06:27:26 +0000 (14:27 +0800)
committerAlex Deucher <alexander.deucher@amd.com>
Tue, 17 Jan 2023 21:11:52 +0000 (16:11 -0500)
commit828fc79dcfa16340340382d888c03c7ea6a8c33e
tree24d14919625e0cd467f51ef7da1ae903f8ebf874
parent1427a720273976a81d13d9d9fa60d53ce881cbd7
drm/amdgpu: support check xgmi/walf error mask bit for aldebaran

The pcs error count should be determined by PCS ERROR status and
PCS ERROR MASK registers, only PCS ERROR status register can not
refect error counts accurately.

Changed from V1:
remove clean noncorrectable mask registers
optimize query pcs error status

Changed from V2:
remove check mask_value bits
correct set value corresponding bit

Signed-off-by: Stanley.Yang <Stanley.Yang@amd.com>
Reviewed-by: Lijo Lazar <lijo.lazar@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c