libraries for dewey
-
Updated
Mar 6, 2019
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
libraries for dewey
Kubernetes & Linux infrastructure health check (OS + K8s + Services), CSV/DOCX report, multi-channel alerts for DevOps/LLMOps engineers.
Open source implementation of the syslog protocol for Unix and Unix-like systems
Schema-first, type-safe structured logging and observability for TypeScript.
Develop low-cost software supply chains and IT infrastructure with our products, toolkits, solutions, and training, built for cybersecurity, automation, and AI.
A Production-Grade, High-Availability Portfolio. Architected with Kubernetes, Terraform, and GitOps. Features automated DevSecOps pipelines, FinOps-optimized serverless backend, and full-stack observability.
Our base GitHub Runner image.
A CLI tool designed for CI/CD processes, enabling automatic service versioning and changelog generation.
Production-ready Kubernetes v1.33 cluster setup scripts for Ubuntu 24.04 LTS. Autoted deployment with kubeadm, containerd, and Flannel CNI. ma
🚀 A curated list of awesome Site Reliability Engineering resources, tools, and best practices. Open Source focused!
Cloud Native Infrastructure Diagnostics and Self-Healing Platform
Ansible play which collects relevant information and attaches them to an existing Aerospike Support Case.
Automated production health monitor - SRE portfolio project. Python · PowerShell · SQL · Grafana · GitHub Actions
An Autonomous AI SRE Agent for Kubernetes, built with Java Spring Boot & LangChain4j. Implements OODA loop for self-healing.
Fault-tolerant Kubernetes job orchestration control plane with persistent lifecycle tracking and reconciliation-driven execution recovery.
Best practices and strategies for reducing operational toil in engineering teams
DevOps and SRE use case problems solving with Go programming language - Udemy course "Programação Go para DevOps e SREs"