drm/amdgpu: Fix SDMA RAS error reporting on Aldebaran
authorMukul Joshi <mukul.joshi@amd.com>
Wed, 24 Mar 2021 15:51:35 +0000 (11:51 -0400)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 21 Apr 2021 01:45:17 +0000 (21:45 -0400)
commitceb47e0d84d9d327ef3cd7091fb54f612394f189
tree7e5909849ce474ad92436abce27dd644b20e0fa7
parent1f0d8e3781f40c6cae7cb68a4cccfc54dd4ad3a1
drm/amdgpu: Fix SDMA RAS error reporting on Aldebaran

Fix the following issues with SDMA RAS error reporting:
1. Read the EDC_COUNTER2 register also to fetch error counts
   for all sub-blocks in SDMA.
2. SDMA RAS on Aldebaran suports single-bit uncorrectable errors
   only. So, report error count in UE count instead of CE count.

Signed-off-by: Mukul Joshi <mukul.joshi@amd.com>
Reviewed-By: John Clements <John.Clements@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/sdma_v4_4.c