Sr. Data Engineer with 12+ years of experience in full lifecycle development, including requirement gathering, data pipeline design, ingestion frameworks, building data marts, and KPI implementation. Proven track record across domains like insurance, automobile, stocks, media, and e-commerce. Over 8 years of experience in leading and supporting agile teams with Big Data technologies, including Spark and Hadoop (on-premise and cloud). Skilled in AWS (EMR, EC2, S3, Redshift) and GCP (BigQuery, Dataproc, Composer/Airflow, GCS, Cloud Functions). Expertise in NoSQL databases (MongoDB) and DWH applications. Developed CSV Ingestion Framework using PySpark. Experienced with MS Azure: ADLS, ADF, BLOB, DATABRICKS, Delta Lake, CI/CD using Azure DevOps. Strong focus on performance tuning for long-running queries. Hands-on experience in Informatica, batch & real-time ETL processes.
Databricks Certified Associate Developer for Apache Spark 3.0