Skip to content
#

SRE

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Here are 28 public repositories matching this topic...

A lightweight, cross-platform chaos engineering framework built in Rust for testing service resilience through controlled failure injection. Supports network latency, packet loss, CPU/memory pressure, and more on Windows, macOS, and Linux.

  • Updated Dec 18, 2025
  • Rust
tumult

Rust-native chaos engineering platform with native OpenTelemetry, embedded DuckDB analytics, and Apache Arrow data pipelines. 10 plugins (Kubernetes, Docker, Pumba, PostgreSQL, Redis, Kafka, SSH). 7 regulatory compliance frameworks (DORA, NIS2, PCI-DSS). Single binary, zero unsafe.

  • Updated Apr 4, 2026
  • Rust
Followers
149 followers
Website
github.com/topics/sre
Wikipedia
Wikipedia