Skip to content
View J-Kibaki's full-sized avatar

Highlights

  • Pro

Block or report J-Kibaki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
J-Kibaki/README.md

πŸ‘‹ Hi, I'm Joseph Kibaki

NOC Analyst | Observability Engineer | SRE Enthusiast
πŸ“ Nairobi County, Kenya

Welcome to my GitHub! I'm a Network Operations Center (NOC) Analyst with 5+ years in financial services, specializing in observability, monitoring, and reliability engineering. I transform complex system data into actionable insights using modern monitoring stacks, automation, and SRE principles to ensure high-availability infrastructure.


🎯 Current Focus

  • System Observability: Building comprehensive monitoring solutions with ELK, Zabbix, and Grafana
  • Site Reliability Engineering: Implementing SLOs, error budgets, and automation for infrastructure resilience
  • Security Operations: Integrating security monitoring into observability pipelines
  • Infrastructure as Code: Automating deployment and monitoring with Python and Bash scripting

πŸ› οΈ Technical Stack

Observability & Monitoring
ELK Stack Grafana Zabbix
Prometheus Streamlit Splunk

Data Analytics & ML
Pandas NumPy SciPy

Infrastructure & Automation
Python Bash Linux
Docker AWS

Network & Security
Cisco Wireshark pfSense


πŸš€ Featured Projects

β–Ά Python-based dead code detector for identifying unused and orphaned files in repositories
β–Ά Multi-language support including Python, JavaScript, Java, C++, Go, and 20+ file types
β–Ά Smart categorization of unreferenced, orphaned, and suspicious files with heuristic analysis
β–Ά CI/CD integration with JSON output and configurable rules for automated code maintenance
β–Ά Production-ready tool with comprehensive documentation and example configurations

β–Ά Production-grade Streamlit application for comprehensive uptime monitoring and SLA management
β–Ά Advanced analytics engine with adaptive spike detection using Z-score and MAD algorithms
β–Ά Automated SLA reporting with daily uptime calculations pushed back to Zabbix via trapper items
β–Ά Cross-host correlation analysis for identifying infrastructure-wide performance patterns
β–Ά Smart caching system with configurable TTL and auto history/trends switching for optimal performance

β–Ά Python-based infrastructure provisioning and configuration management
β–Ά Automated remediation scripts for common NOC incidents (reduced MTTR by 35%)
β–Ά Health check orchestration with self-healing capabilities
β–Ά Compliance monitoring with automated security posture assessments

β–Ά Real-time network telemetry collection from Cisco and SD-WAN infrastructure
β–Ά Predictive analytics for capacity planning and anomaly detection
β–Ά Integration with threat intelligence feeds for security-aware monitoring
β–Ά Custom Grafana panels for executive-level reporting

β–Ά Incident response automation with Slack/Teams integration
β–Ά Post-incident analysis templates and blameless culture documentation
β–Ά Chaos engineering experiments for system resilience testing
β–Ά Toil identification and elimination tracking


πŸ“Š SRE Metrics & Achievements

  • 99.8% average uptime across monitored services (tracked via custom Zabbix analytics)
  • 12 minutes Mean Time to Detection (MTTD) using adaptive spike detection algorithms
  • 22 minutes Mean Time to Resolution (MTTR) with automated correlation analysis
  • 65% reduction in manual operational tasks through Python automation
  • Zero false positive alerts through intelligent filtering and MAD-based anomaly detection
  • Production-grade SLA monitoring with automated daily uptime reporting to stakeholders

πŸ“œ Professional Certifications

Certification Badge Status
ISC2 Certified in Cybersecurity (CC) ISC2 CC βœ… Active
CompTIA Linux+ Linux+ βœ… Active
CISCO Cybersecurity Essentials Cisco βœ… Active
AWS Solutions Architect AWS 🎯 In Progress

πŸ“ˆ GitHub Analytics

Joseph's GitHub Stats

Top Languages

GitHub Streak


🌟 Current Learning Path & Key Projects

  • Advanced Analytics: Implementing Z-score and MAD algorithms for infrastructure anomaly detection
  • SLA Engineering: Building comprehensive uptime calculation engines with automated reporting
  • API Integration: Developing robust API clients with OAuth2, caching, and rate limiting
  • Statistical Analysis: Cross-host correlation matrices for infrastructure pattern identification
  • Performance Optimization: Auto-switching between Zabbix history/trends for optimal query performance
  • Production Systems: Deploying enterprise-grade monitoring dashboards with 14-day log retention
  • Code Quality Automation: Building dead code detection tools for automated repository maintenance

🀝 Let's Connect & Collaborate!

I'm passionate about advanced observability engineering, statistical monitoring algorithms, and production-grade SLA systems. Whether you're interested in anomaly detection, automated uptime reporting, or building enterprise monitoring dashboards, I'd love to share insights and collaborate!

πŸ”— LinkedIn: josephkibaki
πŸ“§ Email: kibaki.joseph1@gmail.com
🐦 Twitter: @J_Kibaki
πŸ’¬ Open to: Mentoring, Knowledge Sharing, SRE Discussions


πŸ’­ Philosophy

"Observability is not about collecting dataβ€”it's about understanding your systems well enough to ask the right questions when things go wrong."

⚑ Building reliable systems, one metric at a time ⚑


Profile Views GitHub followers

Popular repositories Loading

  1. J-Kibaki J-Kibaki Public

    2

  2. SOC-Automation-Scripts SOC-Automation-Scripts Public

    A collection of real-world Python scripts to automate Security Operations Center (SOC) workflows β€” including log parsing, alert triage, and uptime reporting via the Zabbix API. These tools are buil…

    Python 1 1

  3. sys-alert-tuner sys-alert-tuner Public

    Forked from kratocz/sys-alert-tuner

    AI-powered system for optimizing monitoring alert thresholds using Reinforcement Learning.

    Python 1

  4. Uber_android_app_clone Uber_android_app_clone Public

    Forked from SimCoderYoutube/UberClone

    Uber App Clone

    Java

  5. Tinder_android_app_clone Tinder_android_app_clone Public

    Forked from SimCoderYoutube/TinderClone

    Tinder App Clone

    Java

  6. introduction-to-github introduction-to-github Public template

    Forked from github-education-experiences/introduction-to-github

    Get started using GitHub in less than an hour.