SRE
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Here are 60 public repositories matching this topic...
Ladies in DevOps is a safe place for self-identifying women either in the DevOps/SRE/Cloud space already or who are interested in learning more.
-
Updated
Oct 21, 2024 - JavaScript
A role-playing game for incident management training
-
Updated
Dec 6, 2020 - JavaScript
Live Kubernetes visualizer
-
Updated
May 30, 2017 - JavaScript
🌟 Modern responsive CV built with Vue.js 3 + TailwindCSS showcasing Senior SRE expertise. SEO optimized, PWA-ready, WCAG compliant. 8+ years production systems experience, Kubernetes & multi-cloud specialist.
-
Updated
Jun 13, 2025 - JavaScript
Simple Sloth SLO generator on the browser using Sloth as a Go library and WASM.
-
Updated
Nov 20, 2025 - JavaScript
-
Updated
Dec 15, 2025 - JavaScript
Created with CodeSandbox (Tuesday 11/17 thru Sunday 11/22 after work hours - approx. 16 hours of effort including site design and mentally absorbing Google's lecture content)
-
Updated
Nov 29, 2020 - JavaScript
⚡ Enhance your AI coding with Superpowers—an essential skills library for testing, debugging, and collaboration efficiency.
-
Updated
Feb 10, 2026 - JavaScript
IntelliReq - A Requirement Elicitation tool for Solo Developers
-
Updated
Jan 14, 2026 - JavaScript
My personal blog to keep technical notes.
-
Updated
Jan 12, 2026 - JavaScript
Scalable User Management Application
-
Updated
Jan 3, 2025 - JavaScript
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
-
Updated
Feb 25, 2025 - JavaScript
🚨 IncidentFlow - Production-Ready Incident Management Modern microservices-based platform for DevOps teams. Built with React, Node.js, MongoDB. Features Docker containerization, Nginx reverse proxy, real-time updates, dark mode UI, and comprehensive security. ✨ Docker • Nginx • SSL • Real-time • Dark Mode • Microservices 🚀 make nginx-start
-
Updated
Aug 8, 2025 - JavaScript
-
Updated
Jan 28, 2022 - JavaScript
This project implements a Self-Healing Infrastructure designed to automatically detect, respond, and recover from system failures without human intervention. By combining cloud-native tooling, event-driven automation, and observability-driven intelligence, it ensures high availability, scalability, and operational resilience.
-
Updated
Aug 13, 2025 - JavaScript
My Tech Notebook
-
Updated
May 3, 2025 - JavaScript
- Followers
- 145 followers
- Website
- github.com/topics/sre
- Wikipedia
- Wikipedia