Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
ORGANIZATIONS
Languages
Timeline
Generic

Midhun Ms

Barrie

Summary

Experienced DevOps Engineer with expertise in automating infrastructure, managing CI/CD pipelines, and enhancing system reliability. Proven track record of implementing scalable solutions and optimizing deployment processes. Demonstrated proficiency in cloud services and scripting, collaborating effectively with cross-functional teams to drive operational excellence.

Overview

8
8
years of professional experience
1
1
Certification

Work History

DevOps Engineer Specialist

NTT DATA
06.2023 - Current
  • Application Troubleshooting & Workflow Optimization: Resolved application-related issues in Linux environments and optimized workflows for100+ applications
  • Automated tasks using Ansible while aligning with SRE principles to improve system reliability
  • Server & Middleware Administration: Performed WebLogic and Tomcat upgrades, server restarts, and OpenShift POD recycles
  • Ensured system performance and uptime through proactive middleware management
  • Automation & CI/CD Pipelines: Built CI/CD pipelines using tools like Jenkins and CloudBees to streamline deployments
  • Developed automation scripts with Ansible, Python and BASH improving operational efficiency and deployment processes
  • AI/ML Implementation: Worked on AI-based solutions using LLM (Genie AI) models and ML pipelines to efficiently manage application bottlenecks and enhance operational performance
  • Defect Management & Root Cause Analysis: Investigated and resolved defects in pre-production and production environments using ServiceNow and JIRA
  • Conducted root cause analysis and documented findings for continuous improvements
  • Version Control & Collaboration: Oversaw GIT administration and managed tools like Artifactory and Logstash
  • Ensured version control, build quality, and seamless collaboration with IT and business teams
  • Monitoring & Logging: Implemented logging and monitoring solutions using Dynatrace, Splunk, AWS CloudWatch and Prometheus
  • Built custom metrics for EC2 and S3 instances to ensure system health and faster issue resolution
  • System & Process Improvement: Continuously optimized systems and processes to enhance performance and reliability, incorporating feedback and leveraging tools like Dynatrace for analysis
  • Improved database efficiency and resolved errors across MongoDB, PostgreSQL and Oracle DB based environments
  • Documentation & Knowledge Sharing: Created comprehensive documentation for processes, workflows, and incident management
  • Provided training and support for L1/L2 support teams, ensuring smooth knowledge transfer and effective issue resolution
  • Designed and implemented infrastructure solutions using IaC tools like Terraform and CloudFormation, ensuring efficient resource management and scalability
  • Leveraged AWS services such as EC2, RDS, ELB, Route53, S3, and SES to provision resources and maintain system reliability
  • Achievements/Tasks

Senior System Engineer

HCLTech
10.2016 - 11.2021
  • Code Deployment and Transition: Managed seamless code transitions from development to production, ensuring efficient delivery with minimal downtime
  • Conducted deployment activities, tested environments with code changes, and prepared QA/test environments for production releases
  • Application Support: Provided L2/L3 support for bank, mobile, and credit card applications, addressing critical issues and improving system reliability
  • Vendor Collaboration: Participated in new code drops and vendor discussions for bank-related applications, working with vendors and application teams to automate routing files, enhancing process efficiency
  • Middleware Management: Installed, configured, deployed, and administered technologies such as IBM WebSphere Application Server, IBM WebSphere MQ, IIB, RQOM, IBM HTTP Server, Apache, and Tomcat Webservers
  • Performance Monitoring: Monitored production systems using tools like Splunk to identify root causes of issues, diagnosed and recommended effective solutions
  • Monitored system performance metrics, and resolved critical incidents (P1/P2), reducing downtime and improving application availability
  • Incident Management: Collaborated with L1/L2 teams for incident resolution, participated in bridge calls for critical issues, and represented team in technical discussions with customers
  • Backup and Recovery: Led backup/restore operations, system recovery activities, and provided24x7 support, opening PMRs with IBM for production and development issues
  • Managed production patching processes, ensuring smooth deployment and conducting post-deployment health checks to verify system stability and functionality
  • Executed routine database management tasks, including schema installation, configuration
  • Collaborated in daily Scrum meetings and participated in bridge calls for project status updates, while efficiently addressing ad-hoc client requests to meet dynamic business needs
  • Achievements/Tasks

Education

Artificial Intelligence- Architecture, Design and Implementation (Honours) -

Georgian College of Applied Arts And Technology
Barrie, ON
04-2023

Bachelor of Engineering - Electronics and Communication Engineering

Nehru Institute of Engineering and Technology
Coimbatore, TN
05.2015

Skills

  • IBM Websphere Application Server
  • Oracle Weblogic
  • Python
  • Jenkins
  • Cloudbees
  • GitHub Collaboration
  • Splunk
  • Dynatrace
  • AWS Cloudwatch
  • Linux
  • OpenShift
  • Kubernetes
  • Machine learning and DL Algorithms
  • Logstash
  • IBM MQ
  • Apache and Tomcat Webservers
  • Application Support
  • SSL security certs and Keystore
  • F5 load balancer and Routing
  • AWS, EC2 and S3
  • Urban Code Deploy
  • MongoDB
  • Oracle DB
  • Postgre-SQL
  • JIRA
  • Confluence
  • ServiceNow
  • Networking concepts - TCP/IP,DNS,DHCP
  • BASH and Python Scripting
  • Ansible Playbooks, YAML files
  • Prometheus
  • Monitoring and logging
  • Infrastructure automation
  • Performance optimization
  • Maintenance and troubleshooting
  • Developer collaboration
  • Incident management
  • Meeting participation
  • Continuous integration/Continuous deployment
  • Agile

Accomplishments

  • Resolved product issue through consumer testing.
  • Achieved 99% issue resolution by completing Incidents with accuracy and efficiency.
  • Collaborated with team of 10 Developers in the development of Genie AI a monitoring tool for application issues.
  • Supervised team of 5 staff members.
  • Achieved 90% operational and performance efficiency through effectively helping with application monitoring using monitoring tools like AWS cloudwatch ,Dynatrace etc..

Certification

  • AWS Academy Graduate - AWS Academy Cloud Foundations (03/2023 - Present)
  • AWS Academy Graduate - AWS Academy Machine Learning Foundations (03/2023 - Present)
  • Cloud Essentials -IBM (06/2021 - Present)
  • Site Reliability: Tools & Automation (09/2023 - Present)
  • Best Practices for the SRE: Automation (10/2023 - Present)

ORGANIZATIONS

  • NTT DATA (06/2023 - Present)
  • HCLTech (10/2016 -11/2021)

Languages

English
Full Professional

Timeline

DevOps Engineer Specialist

NTT DATA
06.2023 - Current

Senior System Engineer

HCLTech
10.2016 - 11.2021

Bachelor of Engineering - Electronics and Communication Engineering

Nehru Institute of Engineering and Technology
  • AWS Academy Graduate - AWS Academy Cloud Foundations (03/2023 - Present)
  • AWS Academy Graduate - AWS Academy Machine Learning Foundations (03/2023 - Present)
  • Cloud Essentials -IBM (06/2021 - Present)
  • Site Reliability: Tools & Automation (09/2023 - Present)
  • Best Practices for the SRE: Automation (10/2023 - Present)

Artificial Intelligence- Architecture, Design and Implementation (Honours) -

Georgian College of Applied Arts And Technology
Midhun Ms