Skip to content
#

SRE

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Here are 14 public repositories matching this topic...

A lightweight, cross-platform chaos engineering framework built in Rust for testing service resilience through controlled failure injection. Supports network latency, packet loss, CPU/memory pressure, and more on Windows, macOS, and Linux.

  • Updated Oct 4, 2025
  • Rust
Followers
142 followers
Website
github.com/topics/sre
Wikipedia
Wikipedia