Site Reliability Engineer/Devops with over 15 years of experience in designing, implementing, and managing distributed systems and infrastructure. Skilled in optimizing AI and machine learning infrastructure, automating processes, and enhancing the reliability and performance of cloud-based systems. Extensive expertise in Kubernetes, CI/CD pipelines, cloud platforms, and monitoring tools. Proven ability to bridge the gap between development and operations to drive reliability, scalability, and efficiency.
Site Reliability Engineer
Domino Data Lab, San Francisco, CA 2021 – Present
CKA: Certified Kubernetes Administrator.
CKAD Certified Kubernetes Application Developer
HashiCorp Certified: Terraform Associate.
Microsoft Azure FundamentalsAZ-900
AWS Certified Cloud Practitioner
MYSQL DBA Administrator
Linkedin Certifications on Kubernetes, Big Data.