Find which pod is OOMKilling and propose a fix
Когда использовать: You're paged for memory pressure and don't have a dashboard up.
Предварительные требования
- Server/skill installed and authenticated — See repo README
Поток
-
Spot the OOMsFind pods in any namespace with restartCount > 3 in the last hour. Show pod, namespace, last termination reason.✓ Скопировано→ Targeted list, likely with OOMKilled in the reason
-
Propose the fixGet the limits/requests for the worst offender. Tell me what to raise and write an apply_manifest dry-run patch.✓ Скопировано→ Resource diff + a dry-run apply
Итог: Targeted patch ready to apply, RBAC-checked.
Подводные камни
- MKP only exposes what the bound SA can do — MKP only exposes what the bound SA can do — to enable apply_manifest you must bind an SA with patch permission.