Summary
Overview
Work History
Education
Skills
Websites
Languages
Timeline
Generic

Vivek Veluswamy

Toronto,ON

Summary

Result-oriented professional with more than 5 years of experience in handling data in multiple projects and roles. Proficient in PySpark for distributed data processing and ETL tasks. Proficient in understanding of data warehouses/cloud architectures, CI/CD pipelines and ETL processes. Design and develop dashboards and scorecards to support business needs, ensuring alignment of metrics across the organization. Worked on both Star and Snowflake schemas to build cubes using the fact and dimension tables, perform processing, and deploy the cubes to SQL server analysis services database. Good Knowledge of Azure, Storage/ Data Lake with Extraction, Transformation, and Loading. Profound experience in Data Engineering, data analysis, and data visualization using tools and Python libraries. Produce specialized reporting packages, dashboards, and scorecards, enhancing BI for the management leadership team.

Overview

6
6
years of professional experience

Work History

Data Engineer

Capgemini
11.2022 - Current
  • Performed data migration of 20 years from On-premises to Snowflake cloud within Azure infrastructure
  • Designed and implemented scalable ETL processes using PySpark for processing large volumes of data, resulting in a 30% improvement in processing time
  • Entire process is classified into three stages: Ingest, Curate, and Publish, facilitated within Azure cloud environment
  • Scheduled jobs through Azure Data Factory within Azure environment to automate the above stages
  • Collaborated with business analysts to define key performance indicators (KPIs) and developed automated reports and dashboards to track and visualize these metrics within Azure ecosystem
  • Created and maintained reports and dashboards using Power- BI within Azure environment
  • Worked in Azure Data Factory’s built-in error handling mechanisms to manage exceptions and failures during data processing
  • Also, configured error handling policies, alerts, and notifications to detect and respond to errors promptly, improving data quality and reliability
  • Worked closely with stakeholders to understand their needs and provided solutions with effective data insights presentations using Power-BI within Azure cloud.

Data Engineer

Wipro Ltd
04.2020 - 05.2021
  • Experienced in design and implementation of large-scale data from Oracle database on-premises to Snowflake Cloud Data Warehouse using Azure
  • Developed and maintained PySpark scripts for end-to-end ETL workflows, including data extraction, transformation, and loading into data warehouses
  • Data Ingestion to Azure Services like Azure data warehouse, Azure data storage, Azure Data Lake, Azure SQL, and processing the data in Azure Data bricks
  • Using Power BI, created accounting reports including Year-To-Date, Month-To-Date, Period Close reports, Transaction and Snapshot reports for General Ledger, Budget reports, etc
  • Utilized Snowflake's Snow Pipe feature, enabling seamless integration of live data sources for timely analysis and decision support in production environment
  • Familiarity with Snowflake Streams for real-time data integration and change data capture (CDC), enabling event-driven architectures and real-time analytics solutions to drive business agility and competitiveness
  • Worked in leveraging Snowflake's Zero Copy Cloning feature to efficiently create development and testing environments without duplicating data, optimizing resource utilization and accelerating project timelines
  • Created Calculated Columns and Measures in Power BI and Excel depending on requirements using DAX queries.

Data Analyst

Sutherland
02.2018 - 04.2020
  • Leveraged Azure cloud services for data storage and processing to generate performance and client reports
  • Developed complex SQL queries using joins and sub-queries to retrieve data for reporting purposes
  • Provided support for SQL queries to analyze and review test cases, reports, and other data-related tasks
  • Collected, cleansed, and analyzed structured and unstructured data from Azure Blob Storage and other Azure data sources to provide insights for business decision-making
  • Built fact tables and curated layers in Snowflake within Azure cloud infrastructure to meet client report requirements, ensuring scalability and performance
  • Utilized Power BI on Azure to design and deploy an innovative diagnostic performance dashboard, enabling real-time tracking of live reports and enhancing decision-making processes.

Education

POST GRADUATION IN DATA ANALYTICS FOR BUSINESS -

ST. Clair College, Windsor, Ontario, Canada

BACHELOR OF ENGINEERING -

Anna University

Skills

  • Data Warehousing - Snowflake
  • Big Data Processing
  • Workflow Management - Apache Airflow
  • Databases - PostgreSQL, MySQL, Oracle, and SQL Server
  • ETL Development - Talend
  • Data Pipeline Design
  • Spark Framework
  • Data Visualization & Reporting - Tableau, Power BI
  • Azure Data Factory
  • Python - Pandas, PySpark, NumPy
  • Agile Methodologies (Scrum)
  • Data Quality Management
  • AWS/Azure Cloud Architectures
  • CI/CD Pipelines - Github, Jenkins

Languages

English CELPIP CLB7

Timeline

Data Engineer

Capgemini
11.2022 - Current

Data Engineer

Wipro Ltd
04.2020 - 05.2021

Data Analyst

Sutherland
02.2018 - 04.2020

POST GRADUATION IN DATA ANALYTICS FOR BUSINESS -

ST. Clair College, Windsor, Ontario, Canada

BACHELOR OF ENGINEERING -

Anna University
Vivek Veluswamy