sched/fair: Consider capacity inversion in util_fits_cpu()
authorQais Yousef <qais.yousef@arm.com>
Thu, 4 Aug 2022 14:36:09 +0000 (15:36 +0100)
committerPeter Zijlstra <peterz@infradead.org>
Thu, 27 Oct 2022 09:01:20 +0000 (11:01 +0200)
We do consider thermal pressure in util_fits_cpu() for uclamp_min only.
With the exception of the biggest cores which by definition are the max
performance point of the system and all tasks by definition should fit.

Even under thermal pressure, the capacity of the biggest CPU is the
highest in the system and should still fit every task. Except when it
reaches capacity inversion point, then this is no longer true.

We can handle this by using the inverted capacity as capacity_orig in
util_fits_cpu(). Which not only addresses the problem above, but also
ensure uclamp_max now considers the inverted capacity. Force fitting
a task when a CPU is in this adverse state will contribute to making the
thermal throttling last longer.

Signed-off-by: Qais Yousef <qais.yousef@arm.com>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Link: https://lore.kernel.org/r/20220804143609.515789-10-qais.yousef@arm.com
kernel/sched/fair.c

index 4c4ea474fdc217d9ba7814e2436bfe77109d56c1..919d016c5d77d7580bf4b24863aa055923824e5c 100644 (file)
@@ -4465,12 +4465,16 @@ static inline int util_fits_cpu(unsigned long util,
         * For uclamp_max, we can tolerate a drop in performance level as the
         * goal is to cap the task. So it's okay if it's getting less.
         *
-        * In case of capacity inversion, which is not handled yet, we should
-        * honour the inverted capacity for both uclamp_min and uclamp_max all
-        * the time.
+        * In case of capacity inversion we should honour the inverted capacity
+        * for both uclamp_min and uclamp_max all the time.
         */
-       capacity_orig = capacity_orig_of(cpu);
-       capacity_orig_thermal = capacity_orig - arch_scale_thermal_pressure(cpu);
+       capacity_orig = cpu_in_capacity_inversion(cpu);
+       if (capacity_orig) {
+               capacity_orig_thermal = capacity_orig;
+       } else {
+               capacity_orig = capacity_orig_of(cpu);
+               capacity_orig_thermal = capacity_orig - arch_scale_thermal_pressure(cpu);
+       }
 
        /*
         * We want to force a task to fit a cpu as implied by uclamp_max.