Results-driven IT professional with approximately 6 years of experience in the Insurance, Aviation, Banking, and pharmaceutical sectors. Over 3 years of recent expertise in Azure Data Factory, Azure Databricks, Azure SQL, Azure Data Lake Gen1/Gen2, and Delta Lake. Proficient in designing and developing robust data ingestion pipelines using Azure Data Factory, as well as Spark applications via PySpark within Databricks for comprehensive data extraction, transformation, and aggregation across multiple file formats to meet business requirements. Experienced in building Enterprise Data Warehouses and Data Marts, implementing dimensional modeling (Star/Snowflake schemas), and developing logical/physical data models in Azure Synapse (Dedicated SQL Pool DW). Skilled in end-to-end business and system analysis, ETL solution design, release/change management, requirements gathering, process flow design, production support, testing, and detailed deliverable documentation. Demonstrated proficiency throughout the complete software development lifecycle (SDLC), including database design/modeling, development, and deployment for various business intelligence and data warehouse/data mart (ODS) initiatives. Adept at performing business requirements gathering, development, implementation, and comprehensive technical documentation, including source-to-target mapping. Experienced in utilizing reusable/non-reusable transformations such as XML, Normalizer, Expression, Aggregator, Lookup, Union, Joiner, Filter, and Stored Procedure. Familiar with bulk, normal, and incremental loads, and adept at implementing data cleansing, profiling, and validating test plans to ensure successful data loading processes. Competent with scheduling tools like Control-M and Autosys, and managing diverse data sources including relational databases, flat files (fixed/delimited), Parquet, and Delta formats. Skilled at performance tuning for Databricks notebooks and ADF pipelines, overseeing code migration across environments, maintaining project planning through sprint work items, and providing thorough documentation for UAT, test cases, and data validation.
Title: Azure Data Engineer
PySpark, SQL, Azure ADF, Azure Databricks, Power BI, SSRS, JIRA, Microsoft Word, Excel, PowerPoint, Azure SQL, Oracle 11g/10g, SQL Server 2010, Windows, UNIX, Oracle Linux