Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Tejaswini Nalubala

Toronto,Canada

Summary

Data engineer with 11 years of experience designing and implementing data applications using cloud services. Proficient in constructing data ingestion and ETL pipelines of scale. Extensive experience in schema design, data ingestion, transformation, data governance, aggregation, optimization strategies, and process automation in data lakes/warehouses. Strong background in programming, data structures, and algorithms with hands-on experience in Python and SQL

Overview

11
11
years of professional experience
1
1
Certification

Work History

Principal Data Engineer

Oracle
12.2021 - Current
  • Implemented advanced frameworks for data ingestion in PySpark, enhancing Oracle's internal platform, created data pipelines for finance projects, which streamlined metadata customization and improved processing speed by 30%, saving 15 hours weekly
  • As part of Cerner OI into Oracle Health, built data pipelines to migrate the future revenue and billing data from the existing systems, cleaned/wrangled it, and made it available on Exadata DB and Object Storage for the business DV reporting and ML workloads within 2 weeks, reducing the estimated time by a month
  • This ensured smooth operational integration and earned me the Platinum Recognition Award in the organization
  • Managed task and project planning, worked with multi-functional teams and mentored teams while building data ingestion and processing solutions utilizing Oracle OCI infrastructure i.e DataFlow (Serverless Spark), Object Storage, and ADW (Autonomous Data Warehouse)
  • Championed and executed the transition to Oracle Cloud Infrastructure (OCI) services, strategically selecting tools and services for specific application needs; reduced operational costs by 20% and enhanced deployment speed by 35%
  • Incorporated security and governance for the ETL pipelines using tags and policies in Object Storage, with Terraform IaC on Oracle OCI, reducing manual allocation of roles and increasing resource utilization 40% faster
  • Implemented CI/CD pipelines for data workflows, reducing deployment times by 40% and increasing data processing efficiency by 30%
  • Currently working on transitioning the existing financial revenue solution module, which is residing on ODI, to a distributed data driven architecture using OCI DataFlow and Object Storage to achieve a one-day close for Oracle Wall Street reporting.

Senior Data Engineer

Oracle
08.2016 - 12.2021
  • Spearheaded API creation using Python and PySpark for data ingestion from source systems into the Delta Lake to be used across teams in the organization, reducing data latency by 40% and enhancing system scalability, utilizing Oracle OCI
  • As part of Oracle order to revenue migration to cloud from on-premise, which boosted the performance of the revenue composition solution load plan by 70% with the new data model and the usage of Oracle cloud offerings, reducing the runtime from 15 hours to 2 hours
  • Designed the data model for the collection of data from Enterprise Financial Intelligence Revenue for all the cloud businesses of Oracle, enabling users to break revenue down to its composite parts and then creating reporting analytics for the business using Oracle Analytics and Oracle DV
  • Developed an end-to-end ETL Validation framework to ensure data quality at various stages in the ETL pipeline using pytest framework ensuring TDD, thereby increased testing efficiency by 30%
  • Collaborated with cross-functional teams to understand requirements and translate them into production-ready technical solutions.

Senior Systems Engineer

Infosys Limited
06.2013 - 07.2016
  • Designed data models and ETLs for an insurance and banking client to implement uniform business rules across solutions to generate reports on the sales revenue data of the company, thereby reducing bugs by 60%
  • Developed ETL using Informatica PowerCenter and Python
  • Deployed and patched the code using Jenkins, Git, and Vagrant machines
  • Worked on development, unit testing, and deployment processes and mentored newly onboarded team members in the technical aspects of the project, leading to a 30% improvement in performance due to reduction in bugs.

Education

Bachelor of Technology in Electronics and Communication Engineering -

SR Engineering college (now SR University)
Warangal, India
01.2012

Skills

  • Python
  • SQL
  • Spark
  • Delta Lake
  • Data Lakehouse
  • SparkSQL
  • OCI
  • OCI DataFlow
  • OCI Object Storage
  • Linux
  • ETL
  • Data Modelling
  • ODI
  • PySpark
  • Programming
  • Code Design
  • Leadership
  • Communication
  • ADW
  • CI/CD
  • Jenkins
  • Git
  • Data Ingestion
  • Data Transformation
  • Data Security
  • Data Governance
  • Programming
  • OCI functions
  • Problem solving
  • TDD
  • ETL development
  • Data Warehousing
  • Data Modeling
  • Data Pipeline Design
  • Performance Tuning
  • Business Intelligence

Certification

  • Oracle Cloud Infrastructure 2023 AI Foundations Associate certification, Oracle University, 2024, Demonstrated proficiency in Oracle Cloud services and AI concepts. Enhanced skills in designing scalable, cloud-based data architectures. Integrated AI and machine learning capabilities into data infrastructure.
  • AWS Certified Data Engineer – Associate, AWS, 2024, Demonstrated proficiency in designing and managing scalable data processing systems using AWS services.

Timeline

Principal Data Engineer

Oracle
12.2021 - Current

Senior Data Engineer

Oracle
08.2016 - 12.2021

Senior Systems Engineer

Infosys Limited
06.2013 - 07.2016

Bachelor of Technology in Electronics and Communication Engineering -

SR Engineering college (now SR University)
Tejaswini Nalubala