๐จโ๐ป Principal Engineer
        April 2025 โ Present
        As a Principal Engineer, I am responsible for driving technical excellence, architectural decisions, and mentoring engineering teams. In this role, I will focus on:
        
          - Leading technical strategy and architecture for large-scale systems
- Mentoring senior engineers and technical leads
- Driving innovation through emerging technologies and best practices
- Ensuring system scalability, reliability, and performance
 
      
      
        ๐ฉโ๐ป SRE III โ BookMyShow
        BigTree Entertainment Pvt. Ltd. | Apr 2021 โ Present
        As a senior SRE, I've been instrumental in designing scalable infrastructure, ensuring high availability for large-scale events, and embedding reliability across the CI/CD lifecycle.
        
          ๐ ๏ธ CI/CD Architecture & Release Automation
          
            - Standardized CI/CD across teams using GitLab, Bitbucket, and Bamboo.
- Integrated SonarQube for quality gates; cut production issues by 30%.
- Enabled reusable deployment templates with safe rollback support.
 
        
          โ๏ธ Cloud Migration & Infra Modernization
          
            - Migrated core workloads from VMware & GCP to AWS with EKS, EC2, RDS.
- Replaced JFrog with Amazon ECR for better cost and container management.
- Automated infra provisioning via CloudFormation & Ansible.
 
        
          ๐ Disaster Recovery Implementation
          
            - Built a multi-region DR architecture with RDS cross-region replication, S3 backups, and Route 53 failover.
- Authored DR runbooks and executed regular failover drills.
- Reduced RTO from 4h to <30 mins across critical services.
 
        
          ๐ Scalability for High-Traffic Events
          
            - Handled peak loads (5x+ traffic) during the Cricket World Cup and concerts.
- Tuned EKS with HPA, disruption budgets, and circuit-breakers.
- Monitored with Grafana, Prometheus, synthetic testing, and APM tools like New Relic and ELK Stack APM
 
        
          ๐งฉ Istio-Based Service Mesh Deployment
          
            - Introduced advanced traffic routing, retries, mirroring, and observability for microservices.
- Improved service resilience and debugging via sidecar telemetry and distributed tracing with Jaeger.
 
        
          ๐ง  Reliability Culture & Team Enablement
          
            - Led incident response for P0s, with detailed postmortems and RCA reviews.
- Trained new SREs on Kubernetes, observability tools, and CI/CD platforms.
- Documented internal architecture and DR knowledge base.
 
       
      
      
        ๐ง DevOps Deployment Engineer โ Zycus
        Oct 2017 โ Nov 2019
        Zycus is a global leader in Source-to-Pay procurement software, empowering enterprises with automation-driven solutions.
        
          - ๐ GitLab Access Automation: Automated GitLab access control using Python scripts for streamlined onboarding.
- ๐งช CI Pipeline Hardening: Integrated GitLab CLI with Jenkins, SonarQube, and Nexus to enforce quality gates.
- ๐ Release Management: Coordinated deployment planning across multiple non-prod and prod environments.
- โ๏ธ Hybrid Cloud Management: Provisioned and maintained infra on AWS, Navisite, and VMware platforms.
- ๐ฆ AWS Services Integration: Deployed VPC, EC2, ALB, Auto Scaling, and S3 in scalable infra setups.
- ๐ณ Docker-Based Dev Envs: Enabled isolated dev/testing using Docker Compose and shared base images.
- ๐ Developer Enablement: Guided devs in creating Dockerfiles and containerizing local apps.
- ๐ง Ansible Configuration: Managed infra configuration and app deployments via Ansible roles/playbooks.
- ๐งญ Consul for Service Discovery: Leveraged Consul to manage dynamic service configurations.
- ๐ Web Server Config: Served applications via Apache, Nginx, and HAProxy for high availability.
- ๐ก๏ธ CI/CD Quality Assurance: Ensured stable deployments through robust infra testing and rollout strategies.
 
      
      
        ๐ฉบ DevOps Engineer โ Doctor Insta (via OpsTree Solutions)
        June 2016 โ September 2017
        Doctor Insta is a telehealth platform offering digital primary care and remote doctor consultations across India.
        
          - ๐ง Infra Automation with Ansible: Created reusable roles/playbooks for consistent infra provisioning.
- โ๏ธ AWS Infrastructure Management: Managed EC2, RDS, VPC, S3, and Route 53 across environments.
- ๐ฆ Region Migration: Led successful production migration across AWS regions with minimal downtime.
- ๐ Git Hosting via Gitolite: Deployed and maintained secure, internal Git repositories.
- ๐ CI/CD with Jenkins: Automated deployments and app builds using job-based pipelines.
- ๐พ DB Backup Automation: Scheduled daily backups with secure uploads to AWS S3.
- ๐ Monitoring with Zabbix: Implemented end-to-end infra and app monitoring for alerts and health checks.
- ๐ Python App Deployment: Deployed apps in isolated virtual environments to ensure dependency consistency.