Summary
Overview
Work History
Education
Skills
Websites
Certification
Accomplishments
Projects
Timeline
Generic
Saketh Oppula

Saketh Oppula

Montreal,QC

Summary

Experienced Professional in Data Science, Data Engineering and Data Governance with 5 + Years of Expertise in Predictive Analytics, NLP, and Computer Vision. Skilled in Data Visualization and Product Management. Proven Ability to Drive Business Insights and Impact

Overview

5
5
years of professional experience
1
1
Certification

Work History

Senior Data Scientist

General Electric - GE
04.2021 - 09.2022
  • Led and drove the strategy of the Computer Vision domain as a domain lead and scrum master using agile methodology, resulting in a 40% increase in project delivery efficiency
  • Created, designed, and developed projects on OCR, image wear detection, object detection, and deep learning, achieving an average accuracy up to 94% in computer vision models
  • Applied predictive and prescriptive analytics across various business domains, including Finance, Supply-chain, Operations, and EHS, leading to a cost reduction of 70% in the supply chain and a significant improvement in EHS compliance.

Data Engineer

General Electric - GE
06.2019 - 04.2021
  • Improved data governance in the Data Lake ecosystem with the help of PyLinea
  • Managed and utilized Python, Tableau, and Graph DB to solve business-critical use cases in Power Data Lake
  • As a Project Manager, designed, architected, and developed PyLinea, a comprehensive Data Lineage Tool
  • Engaged with business leaders and stakeholders, including Chief Data Officers, throughout the process
  • Responsible for the entire Data Lineage and Data Governance Ecosystem, innovating and designing to achieve a well-developed PyLinea tool, utilized across teams
  • Led a team in developing a data lineage tool, resulting in a 90% cost reduction and an 80% decrease in data redundancy
  • Used technologies: Python, Neo4j, Vue.js, Flask, and Tableau

Data Engineering Specialist

General Electric - GE
06.2017 - 06.2019
  • Developed Talend ETL pipelines and resolved challenges in building a self-serve visualization layer using Tableau, resulting in a significant improvement in dashboard production time
  • Created maintained and KPI based Tableau data sources and dashboards for CDO and CIO of the organization
  • Volunteered and contributed to multiple Python-based utility tools that contributed to the AWS migration of the ecosystem leading to 80% improvement in data storage efficiency and storage cost.

Intern

GE Digital
  • As an intern in GE Digital, designed and developed in the domain of Data Governance, Data Lake simplification and cleanup
  • Influenced the design of a user cleanup process focused on Data base and Data Visualization platforms using PL/SQL, Python, and RPA.

Education

Master of Science - Computer Science

Concordia University
Montreal, QC
01-2024

Bachelor of Science - Computer Science

Amrita University
06-2017

Skills

  • Python
  • SQL
  • Java
  • C
  • PostgreSQL
  • JavaScript
  • Tableau
  • PowerBI
  • Neo4j
  • Talend
  • Spark
  • Vuejs
  • MongoDB
  • SSIS
  • SQL Server
  • Hadoop
  • MS Office
  • Greenplum
  • Snowflake
  • SAS
  • Jupyter Notebook/ Lab
  • Erwin Data Modeler
  • Linux
  • CentOS
  • Windows
  • ScikitLearn
  • PyTorch
  • Keras
  • Pandas
  • TensorFlow
  • PySpark
  • Agile Methodology/ Scrum
  • Product Management
  • Lean Methodology
  • Operational Excellence
  • Solution Architecture

Certification

  • Completed Analytics Engineering Program Certification offered by GE Research by successfully delivering business critical predictive analytics solution.
  • Certified in Business Essential Leadership Skills – BELS offered by GE Corporate.

Accomplishments

  • Avengers - DnA HUB Award - Received this award in recognition of my diligent efforts towards my contribution in AWS Migration at GE Power.
  • Spotlight Award - Awarded with Spotlight Award for three consecutive years for my contributions in Data Governance and Data Science at GE, for successfully driving, architecting, and developing a data lineage tool/ Framework at GE and for successfully driving and developing the phase 1 of Safety Oculus - An AI based Industrial EHS Monitoring Framework.
  • Impact Award - Received this award in recognition for delivering critical outcomes in the Self-Serve Analytics at GE.

Projects

PyLinea - Data Lineage Tool (Product Manager, Scrum Master, Architect, Core Developer):

Led the conception, development, and implementation of PyLinea, a groundbreaking Data Lineage Tool. As Product Manager, Scrum Master, and Core Developer, I devised the entire architectural framework, utilizing abstract syntax tree notations and a robust graph database for end-to-end lineage within the Data Lake ecosystem.

Key Responsibilities:

  • Led the conceptualization, design, and development of PyLinea, defining the strategic vision and objectives of the tool. Conducted thorough assessments of business requirements and engaged with stakeholders, including Chief Data Officers, to align PyLinea with organizational goals.
  • Innovated and formulated the entire architectural framework of PyLinea, ensuring scalability, flexibility, and adaptability to diverse data environments.
  • Embraced an agile methodology as the Scrum Master, fostering a collaborative and iterative development process.
  • Organized and led scrum ceremonies, ensuring efficient communication within the development team, resulting in a 40% increase in project delivery efficiency.
  • Acted as the core developer, leading the actual implementation of PyLinea's features and functionalities. Utilized abstract syntax tree notations for objects to capture intricate data relationships, establishing a clear and concise representation of data flow within the ecosystem.
  • Leveraged a robust graph database to store and manage the complex interconnections, enabling seamless navigation of data lineage. Expanded PyLinea ecosystem by introducing complementary products, such as a log parser and a GDPR/sensitive data monitor.
  • The log parser facilitated efficient parsing and interpretation of data logs, contributing to improved data governance and compliance.
  • The GDPR/sensitive data monitor addressed privacy concerns by identifying and monitoring sensitive data, ensuring adherence to regulatory requirements.

The success of PyLinea extends beyond lineage, creating an integrated ecosystem that significantly enhances data governance and compliance.

Employee Health Service using AI / Safety Oculus (Product Manager, Scrum Master, Architect, Core Developer):

Served as the driving force behind the innovative Employee Health Service using AI, known as Safety Oculus. As the Product Manager, Scrum Master, Architect, and Core Developer, I orchestrated the entire lifecycle of the project, from conceptualization to implementation.

Key Responsibilities:

  • Conceptualized, designed, and developed Safety Oculus—an AI system ensuring worker safety by verifying proper PPE usage. Aligned the product strategically with organizational goals, translating scientific insights into practical AI use cases.
  • Led architectural design for Safety Oculus, ensuring scalability and efficiency. Implemented Visual Computing techniques (YOLO, Faster R-CNN) for enhanced PPE detection and improved safety compliance.
  • Took on the role of Scrum Master, steering agile development to maximize collaboration. Acted as the primary point of contact, fostering communication between key stakeholders, including CIOs, Engineering Site Managers, and IT teams.
  • Engaged with Engineering Site Managers to bridge the gap between IT and Engineering teams, ensuring seamless integration and alignment with operational needs. Interacted directly with CIOs to understand their strategic goals and incorporated feedback to enhance the product's effectiveness.
  • Translated key scientific findings into AI-driven business use cases, solving complex problems through the implementation of Safety Oculus. Achieved a dynamic Video AI analytic alert system, ensuring real-time safety monitoring and intervention.
  • Recognized with the Deliver with Focus Award by GE Power CIO for outstanding contributions to the successful implementation of Safety Oculus.

Serial Number Detection - Deep Learning based object detection process – YOLOv5, Mask-RCNN to detect serial numbers embossed on industrial parts. Driving the development of object detection pipeline and streamlining it into production in AWS and IOT ecosystem in collaboration with GE Research

Tax Type Classification - ML and RPA based utility which classifies the type of transaction based on details of the transaction from the ledger. Designed and developed the multiclass classification model using Categorical Boosting.

Timeline

Senior Data Scientist

General Electric - GE
04.2021 - 09.2022

Data Engineer

General Electric - GE
06.2019 - 04.2021

Data Engineering Specialist

General Electric - GE
06.2017 - 06.2019

Intern

GE Digital

Master of Science - Computer Science

Concordia University

Bachelor of Science - Computer Science

Amrita University
  • Completed Analytics Engineering Program Certification offered by GE Research by successfully delivering business critical predictive analytics solution.
  • Certified in Business Essential Leadership Skills – BELS offered by GE Corporate.
Saketh Oppula