Summary
Overview
Work History
Education
Skills
Certification
Research publications
Timeline
Generic

Praveen Dominic Dharmalinga Pandian

Scarborough,Canada

Summary

Dynamic Tech Lead and Sr. Data Engineer with a proven track record at Tech Mahindra Ltd., specializing in optimizing data processes and leading high-impact projects. Expert in PySpark and SparkSQL, with a knack for mentoring teams. Demonstrated success in reducing data processing times and enhancing data-driven decision-making, showcasing both technical mastery and leadership excellence.

Overview

12
12
years of professional experience
1
1
Certification

Work History

Tech Lead (Sr. Data Engineer)

Tech Mahindra Ltd.
Toronto, Canada
07.2023 - Current
  • Company Overview: Client: Kroger
  • Optimized Spark and Azure Data Factory jobs, reducing processing time for 10Bn+ data points to under a minute
  • Led a data integration project enabling cost analysis for 500K+ products
  • Developed dashboards using Tableau & Databricks for business insights
  • Built & optimized data pipelines using PySpark, SparkSQL, and Unity Catalog
  • Mentored a team of 10+ engineers, organized tech guilds, and led training sessions on distributed systems
  • Client: Kroger

Data Science Analyst Intern

Justo Global
10.2022 - 06.2023
  • Company Overview: Remote
  • Built ML-based classifiers (NSFW content, toxic speech, spam detection, grammar correction)
  • Designed and implemented ETL pipelines with Databricks, Azure SQL & Power BI
  • Configured Google Analytics and created dashboards for user engagement tracking
  • Remote

BI Consultant

Galgotias University
, India
07.2017 - 05.2021
  • Company Overview: India
  • Developed ELT pipelines using PySpark & SparkSQL, processing data from third-party LMS systems
  • Created multiple dashboards (student analytics, financial reports, faculty workload monitoring)
  • Led website development, requirement gathering, and feasibility analysis
  • India

Application Development Sr. Analyst

Accenture
, India
05.2013 - 08.2017
  • Company Overview: Client: Maersk Oil
  • Developed ETL workflows using Databricks & Python, integrating SAP ECC & Oracle
  • Migrated SAP BW ETL flows to BW4HANA, optimized SQL-based transformations
  • Automated background job logs, enhanced performance of process chain loads
  • Client: Maersk Oil

Education

M.Sc. - Big Data Analytics

Trent University
ON, Canada
09.2022

M.S. - Software Engineering

VIT University
India
05.2013

Skills

  • Programming & Scripting: Python (PySpark, Pandas, NumPy, Scikit-learn), SAP ABAP
  • Databases & Querying: SparkSQL, MySQL, MongoDB
  • Big Data & ETL: Apache Spark, Azure Data Factory, SAP BW, HDFS, Hive, Pig
  • Machine Learning & AI: Regression Models, KMeans, NLP, Azure ML, LLMs (OpenAI, Azure AI Search, Databricks DBRX)
  • Visualization & Analytics: Tableau, Power BI, Google Analytics
  • Cloud & DevOps Tools: Databricks, Docker, Git, Jira, VS Code, MS Excel

Certification

  • Tableau Desktop Specialist
  • Microsoft certified: Azure AI fundamentals
  • SAP Certified development Associate - ABAP with SAP NetWeaver 7.0
  • Speaker at Tech Guilds & Knowledge Transfer Sessions on Apache Spark & Databricks
  • ACE (Accenture celebrates excellence) and STAR of the month awards.

Research publications

  • Multilingual Sentiment Analysis using Deep Learning, IEEE Xplore, 2023
  • Stock Market Prediction using Deep Learning, Bentham Science, 2019
  • AI & Predictive Analytics in IoT-based Surgery, Elsevier, 2019

Timeline

Tech Lead (Sr. Data Engineer)

Tech Mahindra Ltd.
07.2023 - Current

Data Science Analyst Intern

Justo Global
10.2022 - 06.2023

BI Consultant

Galgotias University
07.2017 - 05.2021

Application Development Sr. Analyst

Accenture
05.2013 - 08.2017

M.Sc. - Big Data Analytics

Trent University

M.S. - Software Engineering

VIT University
Praveen Dominic Dharmalinga Pandian