drm/nouveau/fifo: kill channel on a selection of PBDMA errors
authorBen Skeggs <bskeggs@redhat.com>
Wed, 1 Jun 2022 10:47:33 +0000 (20:47 +1000)
committerBen Skeggs <bskeggs@redhat.com>
Wed, 9 Nov 2022 00:44:49 +0000 (10:44 +1000)
A bunch of these can be handled in such a way that the channel can
continue, however, any of these are a pretty decent sign something
has gone horribly wrong, and the safest option is to disable the
channel.

This is a bit of a hack, we will want to handle these individually
and dump relevant debug info for each at some point.

Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>
drivers/gpu/drm/nouveau/nvkm/engine/fifo/gf100.c

index 4c3338c4d47a8c44ba6bfd328927bd29e761a9ea..ff28b5a4c36f964673c38203a1f75e388269515c 100644 (file)
@@ -30,7 +30,6 @@
 #include "gf100.h"
 #include "changf100.h"
 
-#include <core/client.h>
 #include <core/gpuobj.h>
 #include <subdev/bar.h>
 #include <subdev/fault.h>
@@ -138,8 +137,9 @@ gf100_runq_intr(struct nvkm_runq *runq, struct nvkm_runl *null)
                nvkm_error(subdev, "PBDMA%d: %08x [%s] ch %d [%010llx %s] "
                                   "subc %d mthd %04x data %08x\n",
                           runq->id, show, msg, chid, chan ? chan->inst->addr : 0,
-                          chan ? chan->object.client->name : "unknown",
-                          subc, mthd, data);
+                          chan ? chan->name : "unknown", subc, mthd, data);
+               if ((stat & 0xc67fe000) && chan)
+                       nvkm_chan_error(chan, true);
                nvkm_chan_put(&chan, flags);
        }