Skip to content

[BUG] Missing requests/limits for AKS core components on worker nodes #3496

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
slawekww opened this issue Feb 28, 2023 · 9 comments
Closed
Labels
action-required addon/scaling Handling req/limit settings for AKS managed addon pods bug Needs Attention 👋 Issues needs attention/assignee/owner

Comments

@slawekww
Copy link

Describe the bug
Run https://github.com/Azure/kube-advisor on AKS 1.24.6
to verify CPU/Memory requests/limits set for k8s workloads.

The following components provided and maintain by AKS on worker nodes have no requests/limits set:

  • calico-kube-controllers
  • calico-node
  • calico-typha
  • cloud-node-manager
  • csi-azuredisk-node
  • csi-azurefile-node
  • kube-proxy
  • tigera-operator

To Reproduce
Steps to reproduce the behavior:

  1. https://github.com/Azure/kube-advisor#running-in-a-kubernetes-cluster-with-rbac-enabled
  2. See output on AKS cluster

Expected behavior
All AKS managed workloads on worker nodes should have limits/request set and have guidelines to update them by AKS administrators.

Screenshots
kube-advisory.log

Environment (please complete the following information):

  • CLI Version: 2.44.1
  • Kubernetes version: AKS 1.24.6
  • CLI Extension version: n/a
  • Browser: n/a

Additional context
Cost of worker node is under customers and it should be clear and precise way to control AKS managed workloads.

@slawekww slawekww added the bug label Feb 28, 2023
@marcindulak
Copy link

Discussed previously in #2125 (comment)

@slawekww
Copy link
Author

slawekww commented Mar 8, 2023

@marcindulak Thanks!
Discussion is about OOMKill for AKS core components.

Once they would have Guaranteed QoS class https://kubernetes.io/docs/tasks/configure-pod-container/quality-service-pod/#create-a-pod-that-gets-assigned-a-qos-class-of-guaranteed
it would not be issue with OOMKill as they would requests=limits the expected resources.

AKS administrators should have possibility to update requests/limits for those AKS core components.
In case of avoiding OOMKill, use Guaranteed QoS class.
In case of cost optimization, set values which are acceptable by customers - we need to know minimum values for requests CPU/memory for AKS core components.

@ghost ghost added the action-required label Apr 2, 2023
@RooMaiku RooMaiku added the addon/scaling Handling req/limit settings for AKS managed addon pods label Apr 6, 2023
@ghost ghost removed the action-required label Apr 6, 2023
@ghost ghost added the action-required label May 2, 2023
@ghost
Copy link

ghost commented May 7, 2023

Action required from @Azure/aks-pm

@ghost ghost added the Needs Attention 👋 Issues needs attention/assignee/owner label May 7, 2023
@ghost
Copy link

ghost commented May 22, 2023

Issue needing attention of @Azure/aks-leads

5 similar comments
@ghost
Copy link

ghost commented Jun 6, 2023

Issue needing attention of @Azure/aks-leads

@ghost
Copy link

ghost commented Jun 21, 2023

Issue needing attention of @Azure/aks-leads

@ghost
Copy link

ghost commented Jul 6, 2023

Issue needing attention of @Azure/aks-leads

@ghost
Copy link

ghost commented Jul 22, 2023

Issue needing attention of @Azure/aks-leads

@ghost
Copy link

ghost commented Aug 6, 2023

Issue needing attention of @Azure/aks-leads

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
action-required addon/scaling Handling req/limit settings for AKS managed addon pods bug Needs Attention 👋 Issues needs attention/assignee/owner
Projects
None yet
Development

No branches or pull requests

3 participants