Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

Lovepreet Singh

Calgary

Summary

Experienced Data Engineer with a strong analytical mindset and expertise in SQL, PySpark, Databricks, and Power BI. Skilled in designing scalable data pipelines, optimizing ETL workflows, and managing large datasets across various databases. Proven ability to drive user growth, enhance retention, and implement data analytics best practices. Adept at uncovering actionable insights through in-depth analysis and delivering error-free reports using Power BI, DAX, and SQL Server. Passionate about leveraging big data technologies to enable data-driven decision-making for key stakeholders.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

LTIMindtree
01.2024 - Current
  • Ingesting structured and unstructured data from various databases into Azure Data Lake Storage (ADLS Gen2) using Azure Data Factory (ADF).
  • Mounting ADLS in Databricks and transforming large datasets using PySpark, optimizing performance for scalability and efficiency.
  • Processing massive datasets on a 95-node Spark cluster, implementing partitioning, caching, and optimized joins for improved query performance.
  • Leveraging SQL within Spark SQL and Delta Lake for complex transformations, aggregations, and analytical queries.
  • Optimizing Delta Lake performance by implementing Z-Ordering, partitioning, and data compaction for efficient querying.
  • Monitoring and troubleshooting Spark jobs, improving execution time by tuning shuffle partitions, broadcast joins, and cluster configurations.
  • Building Power BI reports and dashboards, transforming raw data into actionable insights by leveraging DAX, Power Query, and SQL.
  • Using Python Pandas and PySpark for data exploration, aggregations, and complex transformations before storing results in Delta Lake

Data Analyst

Jupiter Synergies Canada Inc.
07.2022 - Current
  • Developed ETL pipelines using SQL and PySpark in Databricks, automating data ingestion, transformation, and reporting processes.
  • Designed and optimized SQL queries for trend analysis, data validation, and performance tuning in SQL Server and Databricks.
  • Built Power BI dashboards by integrating SQL Server, Databricks, and Excel, providing real-time insights into employee attendance and HR metrics.
  • Implemented DAX formulas to calculate key HR metrics, such as overtime, absenteeism, and payroll adjustments, improving reporting accuracy.
  • Automated data workflows using Python and SQL, reducing manual reporting efforts and enhancing data processing efficiency.
  • Created interactive reports and visualizations in Power BI, leveraging DAX measures and SQL queries to improve decision-making.

Machine Learning Engineer

Omdena
03.2021 - 06.2022
  • Actively contributed to all project phases like Data Collection, Cleaning, Exploratory Data Analysis, and Model Building
  • Collected, processed, and labeled a custom data set, training and testing multiple state-of-the-art NLP models
  • Successfully led a team of 20 from various countries for the task of Data collection and Data cleaning
  • Through active collaboration and well-organized team management, the project went successful without hassle

Data Analyst

Freelance (Volunteer)
08.2019 - 03.2021
  • Conducted data analysis and transformation using SQL and Power BI, ensuring accurate insights for decision-making.
  • Developed interactive dashboards in Power BI, leveraging DAX functions for KPI tracking and trend analysis.
  • Optimized data workflows by automating data extraction, cleaning, and transformation using SQL and Python.
  • Created complex DAX measures to enable time-based comparisons and advanced data modeling for business insights.

Market Data Analyst

XSEED Education Pvt. Ltd.
07.2018 - 07.2019
  • Gathered, stored, and retrieved data using SQL Server, ensuring data accuracy and consistency for analysis.
  • Developed Power BI dashboards with DAX measures to analyze customer behavior and support marketing strategies.
  • Automated data workflows using Azure Data Factory (ADF), streamlining data extraction, transformation, and loading (ETL) processes.
  • Created and distributed reports using SSRS and Power BI, improving stakeholder decision-making with actionable insights.
  • Analyzed marketing trends and competitor activities by leveraging past data, and driving optimized sales and marketing strategies.

Education

Post-Graduation - Data Analysis

Northern College
Scarborough, ON
12.2021

Master of Business Administration (MBA) -

Lovely Professional University
05.2018

Bachelor of Technology -

Punjab Technical University
05.2016

Skills

  • Power BI
  • Microsoft SQL Server
  • Python
  • Databricks
  • PySpark

  • Azure Data Factory
  • Big data processing
  • Hadoop ecosystem
  • Big data technologies
  • Data visualization

Certification

  • SQL and Relational Database issued by Cognitive Class
  • Python for Data Science by IBM
  • Microsoft Power BI Desktop for Business Intelligence
  • Azure Data Factory By LinkedIn
  • Data Modeling in Power BI By Microsoft
  • Excel Power Tools For Data Analysis By Macquarie University
  • Advanced SQL Retrieval Queries in SQLiteStudio By Coursera

Languages

English
Full Professional
Hindi
Native or Bilingual
Punjabi
Native or Bilingual

Timeline

Senior Data Engineer

LTIMindtree
01.2024 - Current

Data Analyst

Jupiter Synergies Canada Inc.
07.2022 - Current

Machine Learning Engineer

Omdena
03.2021 - 06.2022

Data Analyst

Freelance (Volunteer)
08.2019 - 03.2021

Market Data Analyst

XSEED Education Pvt. Ltd.
07.2018 - 07.2019

Bachelor of Technology -

Punjab Technical University

Post-Graduation - Data Analysis

Northern College

Master of Business Administration (MBA) -

Lovely Professional University
Lovepreet Singh