Data Engineer with 4+ years of experience building scalable, cloud-native data platforms that drive business intelligence and automation. Proven ability to lead teams in adopting modern tools like GitLab, FiveTran, and DBT to accelerate delivery and improve data quality. Skilled in creating end-to-end data pipelines, orchestrating ETL workflows, and enabling real-time analytics using cloud and open-source technologies.
Overview
4
4
years of professional experience
Work History
Data Engineer
CGS (Computer Generated Solutions)
07.2024 - 07.2025
Led migration from legacy systems to modern platforms, ensuring data integrity and minimal downtime.
Built end-to-end ETL pipelines using Informatica Cloud to automate and streamline data workflows.
Developed pipelines to extract data from MongoDB and load into Amazon S3, improving cloud storage and accessibility.
Actively migrating data from Amazon S3 to Snowflake using Fivetran, enabling more scalable and performant analytics infrastructure.
Orchestrated data pipelines and migration workflows using Apache Airflow to ensure reliability and automation.
Optimized ETL workflows, reducing data processing time by 40% and improving data accuracy by 15%, resulting in faster reporting cycles.
Implemented automated monitoring to ensure ETL reliability.
Designed data models with DBT and exposed them via Starburst for analytics consumption.
Co-developed Pitch Builder, automating personalized pitch deck creation for sales reps and reducing manual prep time by 80%
Led team-wide GitLab adoption, reducing deployment errors by 30% and increasing CI/CD pipeline efficiency by 50%
Established repository structure, access controls, and workflow governance.
Built and deployed CI/CD pipelines to automate builds, testing, and deployments.
Created and enforced GitLab best practices to promote consistency, code quality, and team productivity.
Created dashboards in Tableau to support operational and strategic decision-making.
Data Engineer
Cooke Inc.
05.2021 - 07.2024
Developed ELT pipelines to ingest and process data from various sources, reducing latency by 30%.
Enabled real-time data ingestion and activation via Salesforce CDP for personalized marketing.
Managed orchestrations using Azure Data Factory for dependable data flow.
Built automated quality checks and alerting for pipeline health monitoring.
Designed and implemented star and snowflake schema models for data warehousing.
Automated deployments using Azure DevOps CI/CD pipelines.
Developed Python scripts to automate data enablement and enrichment tasks.
Wrote complex stored procedures to fulfill business logic requirements.
Applied data governance standards to improve quality and consistency.
Used DBT, Python, and SQL to clean and transform data for analytics readiness.
Education
Post-Graduate Diploma - Information Technology Professional