Skip to content

Instantly share code, notes, and snippets.

@juniorz
Last active June 21, 2023 09:56
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save juniorz/d3dc429ec8c4a0e52966b54940b0a3b3 to your computer and use it in GitHub Desktop.
Save juniorz/d3dc429ec8c4a0e52966b54940b0a3b3 to your computer and use it in GitHub Desktop.
Things on my radar but not yet (OUTDATED)

Tecnology and Society

Kubernetes autoscaling

Kubernetes management

Linux process scheduling

cgroups

X in cgroups/containers

Network?

distributed

performance tools

tools

monitoring

envoy (in practice)

Cloud Native

Concurrecy control

Multi-zone

K8s supports running a single cluster in multiple failure zones (zones in GCP, availability zones in AWS). A single k8s cluster is limited to a single region (and cloud provider). Multi-cloud providers and multi-region requires multiple clusters.

However, Istio enables locality load balancing by default.

Constraints of having an EBS-backed PV in a multi-zone cluster:

Docs:

Kubernnetes Security

QoS and oversubscription

CPU Limit (and throttling)

Prometheus

This is a v. good intro to the 4 types of metrics:

How other metric collection systems integrate with prometheus metrics?

There is also this free course: https://training.robustperception.io/p/introduction-to-prometheus

  • alertmanager
  • thanos
  • cortex

InfluxDB

Ingress-nginx

ingress-nginx 0.26.0+ takes up to 300s (5 minutes) to terminate while waiting for termination of incoming connections. See release notes for: https://github.com/kubernetes/ingress-nginx/releases/tag/nginx-0.26.0

  1. How long does it take for a pod scheduled for deletion to be removed from the list of backends across all ingress controller instances? Is it configurable?
    1. How long does it take to propagate a removal of pod to its Endpoints?
    2. How long does it take to propagate an Endpoint change to the "Lua handler" (https://kubernetes.github.io/ingress-nginx/how-it-works/#avoiding-reloads-on-endpoints-changes) ?
    3. Can these things be measured? Is there any metric for this?

The "Cloud Native security" landscape

https://github.com/GoogleContainerTools/kaniko

Microsoft Kubernetes landscape

https://azure.microsoft.com/en-us/topic/what-is-kubernetes/

https://daemonza.github.io/2017/02/20/using-helm-to-deploy-to-kubernetes/ https://medium.com/@gajus/the-missing-ci-cd-kubernetes-component-helm-package-manager-1fe002aac680 https://cloudblogs.microsoft.com/opensource/2019/05/06/announcing-keda-kubernetes-event-driven-autoscaling-containers/

The CNCF landscape

https://github.com/helm/helm/releases/tag/v3.0.0-rc.3

Spinakker and Kayenta

Flagger

http://port.us.org/ vs https://github.com/goharbor/harbor/blob/master/README.md

https://brigade.sh/

https://gravitational.com/teleport/docs/kubernetes_ssh/ and https://gravitational.com/teleport/docs/architecture/teleport_architecture_overview/

https://github.com/aquasecurity/kube-hunter

https://github.com/GoogleContainerTools/skaffold vs https://www.deployhub.com/ vs https://tilt.dev/ vs https://squash.solo.io/ vs https://www.telepresence.io/ vs https://okteto.com/ vs https://draft.sh/

https://github.com/vmware-tanzu/octant

Data visualization

(Watch|Read)list

Delivery

web server

Problems

Kubernetes monitoring architecture (in depth)

Extending Kubernetes

Multi-region

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment