DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
CPU Humbled Me — A Kubernetes Throttling Story Hidden Between Prometheus Scrapes

CPU Humbled Me — A Kubernetes Throttling Story Hidden Between Prometheus Scrapes

Comments
3 min read
I loaded 30 days of real LLM traces into a live demo. Here is what they reveal

I loaded 30 days of real LLM traces into a live demo. Here is what they reveal

Comments
2 min read
Free event — OTel Night Amsterdam v2026.05 | May 20 | ING + Albert Heijn on OpenTelemetry at enterprise scale

Free event — OTel Night Amsterdam v2026.05 | May 20 | ING + Albert Heijn on OpenTelemetry at enterprise scale

Comments
1 min read
Logging & Observability Best Practices from Bronto

Logging & Observability Best Practices from Bronto

2
Comments
6 min read
Why Heuristic Detectors Beat LLMs at Finding Agent Failures

Why Heuristic Detectors Beat LLMs at Finding Agent Failures

Comments
5 min read
Build Your Own Telemetry UI Using Lovable & Bronto

Build Your Own Telemetry UI Using Lovable & Bronto

2
Comments
2 min read
Grafana Loki: Cost-Effective Log Aggregation at Scale

Grafana Loki: Cost-Effective Log Aggregation at Scale

Comments
2 min read
How to make Time-Shifed Compare Metrics in Grafana Across Datasources

How to make Time-Shifed Compare Metrics in Grafana Across Datasources

1
Comments
2 min read
How to Monitor OpenAI API Costs and Token Usage with OpenTelemetry

How to Monitor OpenAI API Costs and Token Usage with OpenTelemetry

5
Comments
10 min read
From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective

From TCP Retransmits to MCP-Driven Cluster Investigations: An eBPF GPU Agent Retrospective

1
Comments
8 min read
You WON'T Get Realtime LLM Cost From Your Public Cloud

You WON'T Get Realtime LLM Cost From Your Public Cloud

Comments
5 min read
Coding agents produce causal DAGs, not logs

Coding agents produce causal DAGs, not logs

1
Comments
5 min read
Observability and evidence in AI coding workflows: two log streams, two masters

Observability and evidence in AI coding workflows: two log streams, two masters

Comments
5 min read
I Built a Dashboard in 30 Seconds with AI

I Built a Dashboard in 30 Seconds with AI

5
Comments
5 min read
Why We Stopped Using Log Aggregation for Everything

Why We Stopped Using Log Aggregation for Everything

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.