Result-driven SRE/Lead DevOps known for high productivity and efficient task completion. Specialize in continuous integration and deployment (CI/CD), infrastructure as code (IaC), and cloud services management. Excel in teamwork, problem-solving, and adaptability, using these soft skills to navigate complex project environments and drive successful outcomes.
Overview
1
1
Certificate
5
5
years of post-secondary education
11
11
years of professional experience
Work History
Site Reliability Engineer
Mastercard
Vancouver, British Columbia
04.2025 - Current
Constructed a robust FinOps framework encompassing governance, cost ownership, and KPI metrics.
Defined tagging standards for streamlined financial oversight.
Performed monthly cost reviews across multiple AWS accounts to maintain budget compliance.
Configuring and managing code on Terraform for various teams and applications.
Implemented automated CI/CD pipeline with GitHub Actions to facilitate infrastructure deployment via Atlantis.
Managing various helm charts for each microservice and distinct environment configuration for deployment.
Established SLO and SLA parameters within Datadog, linking alerts to PagerDuty.
Designing prompts and agents on GitHub Copilot to construct RCAs and support incident management.
Assisted team in establishing incident and change management processes on Remedy.
Engage in on-call triage for production incidents and compose root cause assessments for same.
Presented team to CAB for production changes, ensuring alignment with organizational goals.
Participated in annual audits for all products and runbooks, enhancing compliance measures.
Assisted in framing SOD (Segregation of Duties) documents to establish clear operational guidelines.
Participated in on-call rotations to provide 24/7 support for critical systems.
Site Reliability Engineer
Lululemon
Vancouver, British Columbia
05.2024 - 06.2025
Managed infrastructure on AWS utilizing Terraform and Helm charts.
Employed Terraform variables across distinct workspaces for each environment.
Orchestrated Kubernetes-native workloads (EKS) and managed
30+ microservices across 4 environments.
Integrated vault with GitLab CI/CD pipeline for enhanced security and deployment efficiency.
Created custom pipelines for each microservice across four distinct environments.
Streamlined deployment processes by implementing tailored CI/CD solutions in GitLab.
Configured and maintained Apache Airflow on EKS for enhanced performance.
Transitioned executor type from Celery to Kubernetes for improved task management.
Upgraded Apache Airflow from 2.4.3 to 2.9.1 to optimize Kubernetes functionality.
Maintaining Apache Airflow and troubleshooting day to day issues such as - DAG failure, DAG import failure etc.
Managing multiple RDS both SQL and Postgres.
Worked on AWS cost optimization and brought down monthly costs
significantly.
Creating Dashboards, Alerts and Infra Monitoring using terraform on Datadog for multiple teams.
• Achieved S3 object replication via Lambda functions, optimizing data transfer processes. Enabled secure vendor collaboration by implementing assume role for S3 data sharing.
Facilitated post-mortem and RCA culture to minimize repeat incidents through actionable insights.
Also engaged in DR team and facilitated DR deployment using Terraform.
Lead DevOps Engineer
ICOM AI
Surrey, British Columbia
05.2022 - 02.2024
Engineered and sustained AWS cloud infrastructure, optimizing costs while ensuring availability and scalability for over 250 car dealerships.
Directed DevOps team throughout project lifecycles, ensuring seamless delivery and implementing automation strategies to enhance workflow.
Directed integration of best practices in CI/CD infrastructure as code alongside configuration management. Championed initiatives for optimized CI/CD workflows, improving overall deployment effectiveness.
Spearheaded implementation of GitLab as innovative VCS and CI/CD solution in self-hosted environment.
Established zero downtime CI/CD pipeline utilizing GitLab CI/CD
Deployed Terraform to establish AWS infrastructure from scratch for multiple projects.
Created and managed EKS clusters to deploy python based microservices via GitLab CICD.
Led the implementation of security measures, including SSL certificates, SOC2 compliance, and Single Sign-On.
Established monitoring, alerting, and observability frameworks utilizing Datadog and AWS CloudWatch.
Software Engineer II (DevOps & Cloud)
Careerbuilder
Noida, UP
12.2019 - 12.2021
Managed and optimized 32 AWS accounts across multiple teams, driving cloud cost reduction and operational efficiency through Cost Optimization best practices and resource optimization initiatives.
Performed regular cloud cost analysis and utilization assessments to improve budget adherence and optimize AWS resource consumption.
Formulated deployment strategies for cloud-based applications.
Supported team with importing existing infrastructure into Terraform.
Assessed system performance, recognized issues, and deployed solutions to streamline operations.
Established, administered, and supervised cloud-based services including AWS EC2, S3, EBS, ELB, RDS utilizing Terraform and Ansible.
Established DevOps best practices encompassing infrastructure as code and configuration management.
Conducted root cause analysis on incidents occurring in production environment to avert future occurrences.
Leveraged Kubernetes and containerization technologies for efficient deployment of applications across various clusters.
Sr. Software Engineer (DevOps & Cloud)
3Pillar Global
Noida, UP
10.2018 - 10.2019
Administered multi-faceted infrastructure across AWS platforms.
Engineered and designed infrastructure to ensure high availability and scalability.
Establish Terraform for efficient AWS resource configuration with custom scripts
Implemented CICD pipeline and achieved zero downtime
deployments.
Connected CloudWatch to Loggly, improving log monitoring and detailed analysis.
Utilized Mosso Cloud to adjust promotional amounts for a client.
Assisted in achieving 24/7 monitoring using CloudWatch.
Configured IAM roles and policies in alignment with team and group structures.
DevOps Specialist
Tec primo Solutions
Noida, UP
05.2018 - 10.2018
Deployed and configured web servers, databases, application
servers, and other related systems.
Automated the process of deploying new software releases with
continuous integration tools such as Jenkins.
Configured logging and monitoring services such as ELK Stack
Developed scripts for configuration management and system automation tasks.
Utilized containerization technologies like Docker Swarm and Kubernetes to deploy applications across multiple clusters.
Analyst
HCL Tech
Noida, UP
09.2015 - 04.2018
Administered, monitored, and resolved backup issues across multiple servers utilizing Symantec NetBackup, IBM tape libraries, and HPDP.
Conducted data backups and facilitated disaster recovery procedures.
Ensured the successful execution of all scheduled backup jobs with minimal
disruption to business operations.
Configured tape libraries and robotic loaders for automated
tape handling processes.
Installed and configured new backup hardware components
when required.
Assisted in the development of policies and procedures related to
backup operations and disaster recovery plans.
Monitored storage utilization trends to ensure effective capacity
planning was in place.
Implementing disaster recovery plans and performing regular data
backups to ensure data integrity and availability in case of
emergencies.
Site Reliability Engineer at Adobe Systems India Pvt. Ltd. (Payroll Teamlease Digital)Site Reliability Engineer at Adobe Systems India Pvt. Ltd. (Payroll Teamlease Digital)