drm/amdgpu: add deferred error check for UMC v12 address query
authorTao Zhou <tao.zhou1@amd.com>
Thu, 29 Feb 2024 03:41:25 +0000 (11:41 +0800)
committerAlex Deucher <alexander.deucher@amd.com>
Fri, 1 Mar 2024 01:35:14 +0000 (20:35 -0500)
Both RAS UE and deferred errors need page retirement.

Signed-off-by: Tao Zhou <tao.zhou1@amd.com>
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/umc_v12_0.c

index 14ef7a24be7b567424c0cfecfdc2bf822923abc6..77af4e25ff465c8e760fa533e8271f5c83965e01 100644 (file)
@@ -348,7 +348,8 @@ static int umc_v12_0_query_error_address(struct amdgpu_device *adev,
        }
 
        /* calculate error address if ue error is detected */
-       if (umc_v12_0_is_uncorrectable_error(adev, mc_umc_status)) {
+       if (umc_v12_0_is_uncorrectable_error(adev, mc_umc_status) ||
+           umc_v12_0_is_deferred_error(adev, mc_umc_status)) {
                mc_umc_addrt0 =
                        SOC15_REG_OFFSET(UMC, 0, regMCA_UMC_UMC0_MCUMC_ADDRT0);