
Data Engineer with 5+ years of experience designing, building, and supporting scalable ETL/ELT pipelines on Azure. Strong hands-on expertise in Azure Data Factory, Azure Databricks, and Microsoft Fabric, with solid programming skills in SQL, Python, and PySpark. Experienced in data integration, schema design, data quality validation, and governance to deliver reliable, analytics-ready datasets, including preparing curated data for Power BI reporting. Proven ability to optimize performance, support production systems, and collaborate effectively with data scientists, analysts, and business teams.
Programming and querying: Python, SQL, PySpark, SparkSQL
Data engineering and integration: Azure Data Factory, Azure Databricks, Microsoft Fabric (Lakehouse, Pipelines), ETL/ELT pipelines
Databases and storage: Microsoft SQL Server, PostgreSQL, Azure Cosmos DB, Azure Data Lake Storage, Azure Blob Storage
Analytics and reporting: Power BI, Microsoft Excel
Data architecture and governance: Data lakes, lakehouse architecture, medallion architecture (Bronze/Silver/Gold), schema design, data dictionaries, data quality and governance
Performance and operations: Pipeline monitoring, performance optimization, production support, SLA management
DevOps and version control: Git, Bitbucket, CI/CD pipelines