$ whoami =>
Building resilient infrastructure at scale. Automating everything between code and production. 5 years obsessing over uptime, observability, and pipelines that never fail.
Certifications
16 certifications across AWS · Azure · GCP · Others
Amazon Web Services
Microsoft Azure
Google Cloud Platform
Others
Toolbelt
Cluster management, Helm, custom operators, HPA/VPA autoscaling.
92% proficiency
IaC at scale, multi-env state, modules, Atlantis GitOps.
89% proficiency
EKS, Lambda, RDS, S3, CloudFront, IAM, VPC architecture.
95% proficiency
GKE, Cloud Run, Pub/Sub, BigQuery, Artifact Registry.
80% proficiency
ArgoCD, Flux, GitHub Actions, progressive delivery + Flagger.
87% proficiency
Prometheus, Grafana, Loki, OpenTelemetry, PagerDuty SLOs.
85% proficiency
ML-driven alert classification reducing false positives 80% and auto-correlating incidents across microservices.
REST API over Kubernetes events for real-time cluster observability and webhook-driven alerting for ops teams.
End-to-end SRE framework for Kafka with auto-remediation, consumer lag alerts, and SLO dashboards at 10M msg/sec.
● Last updated: · Auto-refreshing every 8s
Service Health
API Latency — Last 60s
● LIVE16.1 / 24 cores
59.2 / 80 GB
Active Alerts
Live System Logs
● STREAMINGOpen to DevOps / SRE roles, freelance infra work, and open-source collaboration. Response within 24 hours.
Open to full-time DevOps/SRE roles, contract infra work, and open-source contributions.
$ whoami =>
Building resilient infrastructure at scale. Automating everything between code and production. 5 years obsessing over uptime, observability, and pipelines that never fail.
Certifications
16 certifications across AWS · Azure · GCP · Others
Amazon Web Services
Microsoft Azure
Google Cloud Platform
Others
Toolbelt
Cluster management, Helm, custom operators, HPA/VPA autoscaling.
92% proficiency
IaC at scale, multi-env state, modules, Atlantis GitOps.
89% proficiency
EKS, Lambda, RDS, S3, CloudFront, IAM, VPC architecture.
95% proficiency
GKE, Cloud Run, Pub/Sub, BigQuery, Artifact Registry.
80% proficiency
ArgoCD, Flux, GitHub Actions, progressive delivery + Flagger.
87% proficiency
Prometheus, Grafana, Loki, OpenTelemetry, PagerDuty SLOs.
85% proficiency
ML-driven alert classification reducing false positives 80% and auto-correlating incidents across microservices.
REST API over Kubernetes events for real-time cluster observability and webhook-driven alerting for ops teams.
End-to-end SRE framework for Kafka with auto-remediation, consumer lag alerts, and SLO dashboards at 10M msg/sec.
● Last updated: · Auto-refreshing every 8s
Service Health
API Latency — Last 60s
● LIVE16.1 / 24 cores
59.2 / 80 GB
Active Alerts
Live System Logs
● STREAMINGOpen to DevOps / SRE roles, freelance infra work, and open-source collaboration. Response within 24 hours.
Open to full-time DevOps/SRE roles, contract infra work, and open-source contributions.