SB

Shreeda Bhat

Site Reliability Engineer & Platform Engineer · Kubernetes · AWS · GCP

4+ years AWS · GCP Kubernetes at scale 5M+ concurrent users
shreeda@devops:~
shreeda@devops:~$ cat about.txt
SRE & Platform Engineer · 4+ years → 99.99% uptime, 5M+ users online at once → Moved 200+ services AWS→GCP, live, no window → Infra cost: $80K→$5K/month via BGP+bare metal → Kubernetes everywhere: EKS, GKE, k3s
shreeda@devops:~$ skills
Containers: Kubernetes, Docker, Helm, ArgoCD, KEDA Cloud: AWS (EKS, Karpenter), GCP (GKE, multi-region) Backend: Python, Kafka, Redis, PostgreSQL IaC: Terraform, Ansible, GitHub Actions
shreeda@devops:~$

./experience

OnArrival current
DevOps Lead  ·  Nov 2025 – Present
  • Moved 10 microservices to EKS with Karpenter for spot capacity. Set up blue-green with Argo Rollouts. Nothing broke.
  • Built out Datadog from scratch: APM, distributed traces, custom metrics, the works. Before this we were flying blind.
  • Defined SLOs and error budgets. Ran K6 load tests at 200 req/s to actually validate them, not just write them down.
  • Pushed GitOps with ArgoCD hard. CI/CD failures dropped 45%.
  • Moved Kafka off managed and onto self-hosted Strimzi (TLS+SCRAM). Messaging costs dropped 30%.
EKS Karpenter ArgoCD Datadog Strimzi K6 IRSA
Mobile Premier League (MPL) 3 years
Site Reliability Engineer I → II  ·  Nov 2022 – Oct 2025
  • Kept 200+ microservices at 99.99% uptime with 5M+ users online simultaneously. There were some close calls.
  • Moved 200+ microservices from AWS to multi-region GCP live, no maintenance window. 20% cheaper, 30% faster for India and Brazil.
  • Wrote a Python controller that fed active session counts to KEDA so the autoscaler stopped evicting pods mid-game. Cloud spend dropped 25-40%.
  • Ran an Elasticsearch cluster (10 master + 30 data nodes) eating 100TB of logs a day. Keeping that healthy was a full-time job.
  • Set up ArgoCD app-of-apps with ApplicationSets, OPA/Gatekeeper, and RBAC across 3 regions. Deployments went from painful to boring, which was the goal.
  • Built the gameserver provisioning platform from scratch: 32+ Terraform modules, Ansible, Packer. Spun a server in 15 min instead of a few hours.
GKE KEDA ArgoCD Terraform Elasticsearch Python OPA/Gatekeeper
Dukaan 11 mo
DevOps Engineer  ·  Jan 2022 – Nov 2022
  • Built a BGP Anycast edge across 9 data centres. TTFB under 50ms for 1M+ storefronts. No GeoDNS tricks, just routing.
  • Moved the whole platform off AWS to bare metal while it was live. Used BGP dual-announcement with BIRD so traffic shifted gradually. Bill went from $80K to $5K a month.
BGP BIRD k3s MetalLB AWS

./skills

Containers & Orchestration
Kubernetes Docker Helm ArgoCD KEDA Argo Rollouts Strimzi
Cloud
AWS (EKS · Karpenter · IAM · ALB · VPC · S3) GCP (GKE · multi-region)
Infrastructure as Code & CI/CD
Terraform Ansible Packer GitHub Actions Jenkins
Observability
Datadog APM Prometheus Grafana Elasticsearch Loki OpenTelemetry
Backend & Systems
Python Bash Kafka / Strimzi Redis PostgreSQL REST API design Distributed Systems K6
Security & Networking
IRSA OPA / Gatekeeper RBAC TLS BGP / BIRD Route53

./projects

./testimonials

★★★★★
"Superb DevOps engineer. Has great knowledge of AWS and all other cloud computing platforms. He deployed a number of open-source tools for my business that too very quickly."
CL
Upwork Client
Deploy Strapi CMS on AWS EC2 · Aug 2023
via Upwork · 5.0
★★★★★
"Very glad that I hired Shreeda Bhat for my deployment project. He is a humble, helpful and skilled guy. I will surely rehire him in case if I have any more requirement. And I am very confident to refer him to my friends."
CL
Upwork Client
Deploy Django App on AWS EC2 · Jul 2022
via Upwork · 5.0
★★★★★ (4.6)
"Shreeda is a VERY reliable freelancer who is skilled at many facets of Python Django and DevOps. He will be a great addition to anyone's team!"
CL
Upwork Client
Django Python Developer · Jun–Aug 2021
via Upwork · 4.6
★★★★★
"Shreeda is great to work with. Highly recommended!"
CL
Upwork Client
QA Testing · Dec 2020
via Upwork · 5.0
"Working with Shreeda Bhat is a rare opportunity to come across — a self-driven, environmentally-conscious professional with great teamwork spirit."
EE
Emmanuel Erinfolami
QA Engineer · same team · Feb 2020
via LinkedIn
★★★★★
"I was very impressed with the quality of work that was delivered. I highly recommend Shreeda for any project."
SH
Sheetal
Other · Mar 26, 2024
via Truelancer · 5.0
★★★★★
"He's very dedicated and has great drive and ownership. Strongly recommend and look forward to working with him!"
TT
Thistakesagestotype
Web Development · Nov 25, 2023
via Truelancer · 5.0

./on_the_web

./systems

Session-Aware Eviction Controller
Python KEDA Kubernetes Custom Metrics API

Python controller that watches active session counts and feeds them to KEDA as custom metrics. Stops pods from getting killed while users are mid-game. Cloud spend dropped 25-40%. Before this, the autoscaler was just guessing.

→ Read writeup
Gameserver Provisioning Platform
Terraform Ansible Packer GCP Python

On-demand game server provisioning across GCP regions using 32+ Terraform modules, Ansible, and Packer. Used to take a few hours per server, now takes 15 minutes. Not glamorous work, but the team stopped waiting on infra.

→ Read writeup
Bare-Metal BGP Edge Network
BGP BIRD k3s MetalLB Linux

Anycast edge across 9 data centres using BGP (BIRD). Same /24 announced from every PoP. Users hit their nearest server automatically. Migrated off AWS while traffic was running. $80K/month down to $5K. TTFB under 50ms everywhere.

→ Read writeup

./blog

→ View all 5 posts

./contact

Looking for roles in EU · UK · UAE · APAC · Canada
↗ Book a 30-min call

Response time: usually within 24 hours