potato-cluster

Local Kubernetes potato cluster and telemetry generator for Grafana Cloud demos.

Multi-node Kubernetes cluster

Launch a multi-node k8s cluster.

Node 1: control-plane
Node 2: worker
Node 3: worker 2

kind create cluster --config kubernetes/potato-cluster-config.yaml

# check cadvisor metrics (optional)
kubectl proxy
curl http://localhost:8001/api/v1/nodes/potato-worker/proxy/metrics/cadvisor

# install metrics server
kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml
kubectl patch -n kube-system deployment metrics-server --type=json \
  -p '[{"op":"add","path":"/spec/template/spec/containers/0/args/-","value":"--kubelet-insecure-tls"}]'

# check the running nodes
k9s -c nodes

Kubernetes monitoring

Grab Grafana Kubernetes configuration, toggle desired features, copy paste and adjust kubernetes/helm/grafana-k8s-monitoring/values.yaml accordingly.

# alloy operator https://github.com/grafana/k8s-monitoring-helm/blob/main/charts/k8s-monitoring/README.md
kubectl apply -f https://github.com/grafana/alloy-operator/releases/latest/download/collectors.grafana.com_alloy.yaml

helm repo add grafana https://grafana.github.io/helm-charts
helm repo update

echo """
GRAFANA_METRICS_URL="REPLACE_ME"
GRAFANA_METRICS_USER="REPLACE_ME"
GRAFANA_CLUSTER_METRICS_URL="REPLACE_ME"
GRAFANA_LOGS_URL="REPLACE_ME"
GRAFANA_LOGS_USER="REPLACE_ME"
GRAFANA_OTLP_URL="REPLACE_ME"
GRAFANA_OTLP_USER="REPLACE_ME"
GRAFANA_PROFILES_URL="REPLACE_ME"
GRAFANA_PROFILES_USER="REPLACE_ME"
GRAFANA_ACCESS_TOKEN="REPLACE_ME"
""" > kubernetes/helm/grafana-k8s-monitoring/.env
export $(cat kubernetes/helm/grafana-k8s-monitoring/.env | xargs)

helm upgrade --install --atomic --timeout 300s -n monitoring --create-namespace grafana-k8s-monitoring grafana/k8s-monitoring \
  -f <(envsubst < kubernetes/helm/grafana-k8s-monitoring/values.yaml)

# issue since v2.1 might need to rerun the install to deploy missing alloy-* resources after an uninstall because of dangling operator finalizer
# https://github.com/grafana/k8s-monitoring-helm/issues/1615

# check the running pods in the monitoring namespace
k9s -n monitoring -c pods

Resource stress test

Sample stress-mem and stress-cpu pods will hog resources to trigger generic Kubernetes alerting rules, available out of the box.

kubectl create namespace noisy-neighborhood
kubectl replace --force -f kubernetes/resources/noisy-neighborhood

# monitor resource usage
k9s -n noisy-neighborhood -c pods

NOTE: If containers last terminated reason is Error instead of OOMKilled and there are "failed to create inotify fd" warnings in the node logs then you probably need to bump inotify resources on the host

Otel demo

helm repo add open-telemetry https://open-telemetry.github.io/opentelemetry-helm-charts

# helm upgrade --install --create-namespace -n otel-demo-local -f kubernetes/helm/otel-demo/values-local.yaml otel-demo-local open-telemetry/opentelemetry-demo
helm upgrade --install --create-namespace -n otel-demo -f kubernetes/helm/otel-demo/values.yaml otel-demo open-telemetry/opentelemetry-demo

# check the running deployments
# k9s -n otel-demo-local -c deploy
k9s -n otel-demo -c deploy

# port forward the frontend-proxy
kubectl -n otel-demo port-forward svc/frontend-proxy 8080:8080

# open http://localhost:8080
# open http://localhost:8080/grafana

Auto-sync from git

Automatically pull changes to kubernetes/helm/otel-demo/values.yaml and run a helm upgrade on a remote machine.

On the remote machine, set up a cron job that runs scripts/sync-otel-demo.sh:

# add a cron job (runs every 2 minutes)
(crontab -l 2>/dev/null; echo "*/2 * * * * /path/to/potato-cluster/scripts/sync-otel-demo.sh >> /tmp/otel-demo-cron.log 2>&1") | crontab -

NOTE: Replace /path/to/potato-cluster with the actual repo path on the remote machine. Adjust the cron interval as needed.

OOM Kafka

scripts/oom-kafka.sh resets the Kafka memory limit to 512Mi (enough to trigger OOM), commits, and pushes. The sync script on the remote machine will pick up the change and apply it. This way, even if someone fixes the memory limit in git, it gets reverted back to OOM-inducing levels.

(crontab -l 2>/dev/null; echo "*/30 * * * * /path/to/potato-cluster/scripts/oom-kafka.sh >> /tmp/otel-demo-cron.log 2>&1") | crontab -

Python API with OTLP instrumentation

Send data to the Grafana Cloud OTLP endpoint

Dev (optional)

Prerequisites: python >=3.13.2, poetry

cd hello-api
poetry install
source .venv/bin/activate
uvicorn app:app --host 0.0.0.0 --port 8000
curl "http://localhost:8000/hello?name=world"
pip freeze > requirements.txt

Build

docker buildx build \
  --platform linux/amd64,linux/arm64 \
  --build-arg GIT_REF=$(git rev-parse HEAD) \
  -t ar2pi/hello-api \
  --push .

Deploy

kubectl create namespace hello-api
kubectl replace --force -f kubernetes/resources/hello-api

k9s -n hello-api -c pods

# port forward hello-api
kubectl port-forward -n hello-api svc/nginx-reverse-proxy 8000:8000

Deploy standalone alloy for profiles

Grafana Alloy Helm chart

echo """
GRAFANA_PROFILES_URL="REPLACE_ME"
GRAFANA_PROFILES_USER="REPLACE_ME"
GRAFANA_ACCESS_TOKEN="REPLACE_ME"
""" > kubernetes/helm/alloy-sdk-profiles/.env
export $(cat kubernetes/helm/alloy-sdk-profiles/.env | xargs)

helm upgrade --install --create-namespace -n monitoring alloy-sdk-profiles grafana/alloy \
  -f <(envsubst < kubernetes/helm/alloy-sdk-profiles/values.yaml)

Deploy standalone alloy for rabbitmq

Grafana Alloy Helm chart

echo """
GRAFANA_METRICS_URL="REPLACE_ME"
GRAFANA_METRICS_USER="REPLACE_ME"
GRAFANA_CLUSTER_METRICS_URL="REPLACE_ME"
GRAFANA_ACCESS_TOKEN="REPLACE_ME"
""" > kubernetes/helm/alloy-rabbitmq.env
export $(cat kubernetes/helm/alloy-rabbitmq/.env | xargs)

helm upgrade --install --create-namespace -n monitoring alloy-rabbitmq grafana/alloy \
  -f <(envsubst < kubernetes/helm/alloy-rabbitmq/values.yaml)

Run the rabbitmq pods

kubectl create namespace rabbitmq
kubectl replace --force -f kubernetes/resources/rabbitmq/

Install dashboards from https://ar2p2.grafana.net/connections/add-new-connection/rabbitmq

Generate some traffic through k6

Prerequisites: k6

while true; do k6 run k6/loadtest.js; done

# to send k6 metrics to grafana cloud via prom remote write (optional):
export $(cat kubernetes/helm/grafana-k8s-monitoring/.env | xargs)
while true; do
    K6_PROMETHEUS_RW_USERNAME="$GRAFANA_METRICS_USER" \
    K6_PROMETHEUS_RW_PASSWORD="$GRAFANA_ACCESS_TOKEN" \
    K6_PROMETHEUS_RW_SERVER_URL="$GRAFANA_METRICS_URL" \
    k6 run -o experimental-prometheus-rw k6/loadtest.js
done

# monitor resource usage
k9s -n hello-api -c pods

k6 Prometheus dashboard

NGINX exporter

Dasboards

@TODO: dashboards screenshots + json links

fpog otel
fpog beyla
k6 results
nginx exporter
otel demo dashboard

Clean up

kind delete cluster --name potato

# or more fine-grained
kubectl delete --ignore-not-found -f kubernetes/resources/noisy-neighborhood
kubectl delete --ignore-not-found -f kubernetes/resources/hello-api
kubectl delete --ignore-not-found -f kubernetes/resources/rabbitmq
helm uninstall -n otel-demo --ignore-not-found otel-demo
helm uninstall -n monitoring --ignore-not-found grafana-k8s-monitoring
helm uninstall -n monitoring --ignore-not-found alloy-sdk-profiles
# issue since v2.1 might need to remove alloy-* subcharts manually
# https://github.com/grafana/k8s-monitoring-helm/issues/1615
helm uninstall -n monitoring --ignore-not-found grafana-k8s-monitoring-alloy-logs grafana-k8s-monitoring-alloy-metrics grafana-k8s-monitoring-alloy-profiles grafana-k8s-monitoring-alloy-receiver grafana-k8s-monitoring-alloy-singleton

Additional resources

@TODO:

install script, makefile all the things
export Grafana dashboards
- single pane of glass (otel)
- single pane of glass (beyla)
- warning and errors
- otel-demo
- nginx exporter
- k6 metrics

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

potato-cluster

Multi-node Kubernetes cluster

Kubernetes monitoring

Resource stress test

Otel demo

Auto-sync from git

OOM Kafka

Python API with OTLP instrumentation

Dev (optional)

Build

Deploy

Deploy standalone alloy for profiles

Deploy standalone alloy for rabbitmq

Generate some traffic through k6

NGINX exporter

Dasboards

Clean up

Additional resources

@TODO:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 217 Commits
grafana		grafana
hello-api		hello-api
k6		k6
kubernetes		kubernetes
scripts		scripts
skills		skills
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

potato-cluster

Multi-node Kubernetes cluster

Kubernetes monitoring

Resource stress test

Otel demo

Auto-sync from git

OOM Kafka

Python API with OTLP instrumentation

Dev (optional)

Build

Deploy

Deploy standalone alloy for profiles

Deploy standalone alloy for rabbitmq

Generate some traffic through k6

NGINX exporter

Dasboards

Clean up

Additional resources

@TODO:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages