Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

SAI KIRAN REDDY KOPPULA

GTA,Ontario

Summary

Seasoned Site Reliability Engineer with 6+ years of experience resourcefully planning and deploying necessary application tools to meet organizational goals. Expert at generating quality coding to enhance critical software automation, accuracy, agility and security based on latest industry-standard concepts. Established history providing pivotal consultation with Dev, QA, security and IT operations staff to influence software development effectiveness. Resourceful when overseeing group projects and providing detailed analysis to craft practical processes.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Site Reliability Engineer

HP Inc.
Kitchener
03.2022 - Current
  • Developed and implemented monitoring solutions to improve system reliability.
  • Performed root cause analysis of production incidents and provided recommendations for improvement.
  • Collaborated with development teams to ensure proper release engineering practices are followed.
  • Provided technical guidance on the design, implementation, and maintenance of cloud infrastructure.
  • Implemented automation tools to increase efficiency in deployment processes.
  • Monitored systems performance using various metrics such as latency, throughput, availability.
  • Ensured high availability and scalability of applications across multiple environments.
  • Researched and evaluated new technologies to enhance platform reliability and stability.
  • Created automated scripts for software deployments and configuration management tasks.
  • Troubleshooted complex issues related to application architecture and system configurations.
  • Configured Kubernetes clusters for container orchestration purposes in a multi-cloud environment.
  • Maintained security policies for the organization's cloud services according to industry standards.
  • Optimized existing infrastructure components for cost savings while ensuring compliance requirements.
  • Documented best practices and procedures for incident response activities.
  • Conducted regular reviews of alerts generated by monitoring tools to identify potential issues.
  • Performed capacity planning activities based on current usage trends and future projections.
  • Participated in post-mortem reviews following major outages or incidents.
  • Assisted with troubleshooting network connectivity issues between servers located in different regions.
  • Provided training sessions on SRE principles and best practices to team members.
  • Assisted with developing service level objectives for critical services.
  • Provided continuous process improvement and preventive and corrective actions to facilitate operational efficiency.
  • Developed strategies for disaster recovery plans that ensured minimal downtime during an outage.
  • Developed automation scripts and tools to streamline the deployment process.
  • Collaborated with development teams to ensure application deployments were successful.
  • Provided technical guidance on best practices related to DevOps engineering roles.
  • Monitored user activity in order to detect potential security threats or anomalies.
  • Automated server provisioning using configuration management tools such as Ansible and Chef.
  • Managed the roll-out of software updates across multiple servers in the environment.
  • Configured cloud services utilizing Amazon Web Services.
  • Established logging solutions such as ELK stack for monitoring system performance metrics.

Site Reliability Engineer

Sun Life Assurance Company
Toronto, Ontario
08.2021 - 03.2022
  • Developed and implemented monitoring solutions to improve system reliability.
  • Performed root cause analysis of production incidents and provided recommendations for improvement.
  • Performed troubleshooting on a variety of issues impacting system health or performance.
  • Created detailed documentation of processes, procedures, and standards utilized in the environment.
  • Managed the roll-out of software updates across multiple servers in the environment.
  • Deployed containerization solutions such as Docker to improve application portability.
  • Integrated third-party APIs into existing systems for improved functionality.
  • Reviewed application architecture designs with developers to ensure scalability requirements are met.
  • Assisted with capacity planning activities by analyzing current usage trends and making recommendations based on findings.
  • Deployed applications using automated tools such as Chef, Ansible or Puppet.
  • Deployed applications onto production servers using configuration management tools like Ansible.
  • Configured and maintained server stacks using Ansible Playbooks.

Associate Consultant

iLenSys Technologies Pvt. Ltd.
Hyderabad, Telangana
07.2016 - 08.2019
  • Monitored system performance and implemented necessary changes to maintain optimal utilization of resources.
  • Installed, configured, maintained and upgraded Linux operating systems and services.
  • Provided technical support for Linux-based systems including troubleshooting of server applications and hardware issues.
  • Evaluated security protocols to ensure compliance with internal policies and external regulations.
  • Developed scripts for automation of system administration tasks using Bash, Python languages.
  • Configured network components such as routers, switches, firewalls, VPNs according to organizational requirements.
  • Deployed updates, patches and hotfixes on a regular basis to keep the system secure from vulnerabilities.
  • Maintained documentation related to system configurations, software installations, user accounts for future reference purposes.
  • Resolved conflicts between different processes running simultaneously on the same server by optimizing resource allocations accordingly.
  • Configured and updated Linux servers with latest releases and patches.
  • Maintained minimum organizational performance threshold for Linux server-based operations.
  • Provided technical support during deployments to troubleshoot any issues that arise.
  • Maintained a library of artifacts related to each release cycle including installation packages, configuration files, test results.
  • Identified opportunities for improving existing processes by implementing new automation techniques.
  • Resolved conflicts between different versions of libraries used by various applications.
  • Worked closely with systems analysts, engineers and programmers to understand limitations, develop capabilities and resolve software problems.
  • Reviewed project requirements to identify customer expectations and resources needed to meet goals.

Education

PG Diploma -

Confederation College of Applied Arts And Technology
Thunder Bay, ON
05-2021

Bachelor of Technology -

Jawaharlal Nehru Technological University
Hyderabad
04-2016

Skills

  • Version Control Tools: GitHub, Bitbucket
  • Languages: Shell, Python, Groovy, YAML, JSON
  • Databases: MongoDB, MySQL, AuroraDB, DynamoDB
  • Tools: Jenkins, Azure Pipelines, Ansible, Nexus, SonarQube, Maven and ELK Stack, JMeter, APIGEE, Terraform, AWS Cloud formation, Lambda, AWS CDK
  • Monitoring Tools: Splunk, Splunk Observability, New Relic, Grafana, OpenSearch, Datadog
  • Messaging Queue: RabbitMQ, Bull MQ, SQS, Apache Kafka
  • Cloud Services: AWS, Microsoft Azure
  • Orchestration: Docker, Kubernetes, AWS ECS, AWS EKS

Certification

  • Professional Scrum Master, Scrum.org

Timeline

Site Reliability Engineer

HP Inc.
03.2022 - Current

Site Reliability Engineer

Sun Life Assurance Company
08.2021 - 03.2022

Associate Consultant

iLenSys Technologies Pvt. Ltd.
07.2016 - 08.2019

PG Diploma -

Confederation College of Applied Arts And Technology

Bachelor of Technology -

Jawaharlal Nehru Technological University
SAI KIRAN REDDY KOPPULA