watchdog/hpwdt: Only claim UNKNOWN NMI if from iLO
authorJerry Hoemann <jerry.hoemann@hpe.com>
Wed, 13 Dec 2023 21:53:38 +0000 (14:53 -0700)
committerWim Van Sebroeck <wim@linux-watchdog.org>
Sun, 17 Dec 2023 15:14:21 +0000 (16:14 +0100)
Avoid unnecessary crashes by claiming only NMIs that are due to
ERROR signalling or generated by the hpwdt hardware device.

The code does this, but only for iLO5.

The intent was to preserve legacy, Gen9 and earlier, semantics of
using hpwdt for error containtment as hardware/firmware would signal
fatal IO errors as an NMI with the expectation of hpwdt crashing
the system.  Howerver, these IO errors should be received by hpwdt
as an NMI_IO_CHECK.  So the test is overly permissive and should
not be limited to only ilo5.

We need to enable this protection for future iLOs not matching the
current PCI IDs.

Fixes: 62290a5c194b ("watchdog: hpwdt: Claim NMIs generated by iLO5")
Signed-off-by: Jerry Hoemann <jerry.hoemann@hpe.com>
Reviewed-by: Guenter Roeck <linux@roeck-us.net>
Link: https://lore.kernel.org/r/20231213215340.495734-2-jerry.hoemann@hpe.com
Signed-off-by: Guenter Roeck <linux@roeck-us.net>
Signed-off-by: Wim Van Sebroeck <wim@linux-watchdog.org>
drivers/watchdog/hpwdt.c

index f79f932bca14898fc70ca2dd58fc3f5b4320ec78..79ed1626d8ea11a17ba42653318e2b682d27fa37 100644 (file)
@@ -178,7 +178,7 @@ static int hpwdt_pretimeout(unsigned int ulReason, struct pt_regs *regs)
                "3. OA Forward Progress Log\n"
                "4. iLO Event Log";
 
-       if (ilo5 && ulReason == NMI_UNKNOWN && !mynmi)
+       if (ulReason == NMI_UNKNOWN && !mynmi)
                return NMI_DONE;
 
        if (ilo5 && !pretimeout && !mynmi)