Summary
Overview
Work History
Education
Skills
Tools
Timeline
Generic

Supriya Sheshu

San Jose

Summary

Devops leader with over a decade of experience in managing infrastructure for billion dollar revenue products in a dynamic fast paced environment. Spearheaded the development and operation of large-scale, business-critical applications, significantly enhancing product performance and reliability. Adept at fostering innovation and proven track record of migrating to microservices and cloud platforms. Demonstrated ability to collaborate with cross-functional teams and manage engineers across different geographical locations. Results-driven with a pragmatic approach and has a track record of process optimization and successful project delivery.

Overview

15
15
years of professional experience

Work History

Production Engineering

Yahoo
01.2013 - Current

Senior Engineering Manager Dec 2023 - Current

Principal Production Engineer Dec 2020 - Nov 2023

Senior Production Engineer April 2018 - Nov 2020

Production Engineer Jan 2013 - March 2018

  • Production engineering lead for search and native advertising marketplace that generates billion dollar annual revenue serving 6 billions ads calls and generating 5 million customer reports daily
  • Successfully lead teams across the US, Taiwan and India focusing on application's advertiser interfaces, reporting and analytics
  • Collaborated closely with Development and Product teams
  • Key member of the project since its inception in 2013 to productionize, optimize, scale and manage the application to eventual shutdown
  • Built and scaled infrastructure from 4 hosts to 10000+
  • Technical lead to migrate monolithic applications architecture to microservices using docker and kubernetes and On-premises applications to Amazon public cloud
  • Designed and implemented a robust business continuity and disaster recovery plan for the application resulting in a 20% reduction in customer-reported incidents and on-call duty
  • Managed and monitored distributed batch processing and real time data streams, processing ~50TB daily and storing ~30PB of data
  • Collaborated with stakeholders to develop strategies for piloting, pushing and rolling back changes and built the CI-CD pipelines
  • By streamlining production quality controls achieved a 99% success rate in product launches
  • Participated in design and architecture review meetings to ensure applications can meet security, scalability, performance and quality standards in accordance with agreed upon SLA
  • Designed and implemented in-house chaos engineering framework, boosting system resilience
  • Setup escalation process and runbook, developed comprehensive training program for tier 1-3 support folks to enable 24x7 operational support
  • Ran incident post mortem management meetings
  • Improved product operability and cost efficiency
  • Implemented an automated self healing framework to trigger operational actions based on predetermined KPIs, reducing manual intervention
  • Built dashboards to monitor the application performance and behavior
  • Helped analyze bottlenecks and troubleshoot issues easily
  • Demoed proof of concepts, presented product operability metrics and migration success stories at brown bag sessions
  • Directed all facets of people management from hiring and mentoring to goals setting and performance reviews.

Software Engineer

EMC Corporation
01.2009 - 01.2011
  • Designed and developed application to diagnose availability, performance and business continuity of various network storage devices in the On-premises data center.

Education

MS in Software Engineering -

San Jose State University
01.2012

BE in Computer Science -

R.V. College of Engineering
01.2009

Skills

  • Cloud computing
  • Big data processing
  • Site reliability engineering
  • Network optimization and troubleshooting
  • Technical project management
  • Communication
  • Team management

Tools

  • CI-CD - Jenkins, Screwdriver
  • Configuration management - Chef, Ansible
  • Container orchestration - kubernetes, docker, Amazon EKS
  • Infrastructure as code - AWS Cloud Formation
  • Observability Tools - Nagios, Splunk,Grafana, Prometheus, Amazon CloudWatch
  • Programming - python, bash
  • Big data processing - Hadoop, Apache Storm, Apache Druid, Oozie, Hive, AWS EMR, S3, Glacier, Athena

Timeline

Production Engineering

Yahoo
01.2013 - Current

Software Engineer

EMC Corporation
01.2009 - 01.2011

MS in Software Engineering -

San Jose State University

BE in Computer Science -

R.V. College of Engineering
Supriya Sheshu