accel/habanalabs: abort device reset for consecutive heartbeat failures
authorTomer Tayar <ttayar@habana.ai>
Sun, 24 Dec 2023 22:28:36 +0000 (00:28 +0200)
committerOded Gabbay <ogabbay@kernel.org>
Mon, 26 Feb 2024 07:30:40 +0000 (09:30 +0200)
commit246d8b6cfb80a31e3cc287e3c1db6a5515b7c20a
treea6b459211b2a6a5f0002f4c0069c3d1dd52a4600
parentd0df8a35a76d310fe0a06eaf1ac1ade98f4d05e4
accel/habanalabs: abort device reset for consecutive heartbeat failures

The mechanism of aborting device reset for consecutive fatal errors is
currently only for fatal errors that are reported by FW.
A non-responsive FW and consecutive heartbeat failures is also
considered fatal, so add them as well to this mechanism to avoid
recurring device reset in such a case.

Signed-off-by: Tomer Tayar <ttayar@habana.ai>
Reviewed-by: Oded Gabbay <ogabbay@kernel.org>
Signed-off-by: Oded Gabbay <ogabbay@kernel.org>
drivers/accel/habanalabs/common/device.c