Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Parul Dhawan

Brampton

Summary

Data, AWS & GCP certified Customer Solutions architect Big Data Solutions, AWS x1 & Google Cloud Certified x2(PSA) Cloud expert with over 10 years of practical experience in leading Medium to Large scale engagements in Data and Cloud Architect, ETL, Big Data, AWS and Google Cloud Migrations and Management Technical expertise encompasses Big Data and Cloud Technologies – GCP, Hadoop, Yarn, MapReduce, Hive, HBase, Apache Kafka, Confluent Kafka- Azure, Apache Spark, Apache Storm, Sqoop, SQL & AWS & GCP. My core strengths lie in Data Engineering, Cloud Data Modernization, and Analytics Platforms, where I have worked with technologies like Cloudera, Data Bricks, AWS & Google cloud Platform. I have successfully overseen the implementation of expansive Data migration and transformations in different domains including Financial Services, Taxation, Telecommunication.

Overview

11
11
years of professional experience

Work History

Customer Solutions Architect

CGI Canada Inc
01.2020 - Current
  • I serve as a Solutions Architect, where I define a technical vision and collaborate with delivery teams to actualize it
  • My primary emphasis lies in supporting a prominent Canadian telecom company during its digital modernization initiative
  • My current role involves
  • Concentrated effort on migrating to Google Cloud Platform (GCP) and harnessing native cloud solutions like BigQuery, Big Table, Composer/Airflow, Data proc Spark Serverless, Cloud Run, Cloud Functions and DataFlow.
  • Design and craft solutions that precisely meet Client's requirements
  • This involves selecting suitable technologies, accelerators and designing data pipelines.
  • Work closely with diverse teams, including data scientists, engineers, and business analysts, to ensure the seamless delivery.
  • Provide technical advice and expertise to clients and internal teams, aiding their comprehension of the advantages and limitations of various technologies.
  • Collaborate with the customer success team to ensure client satisfaction with provided Data & AI solutions and explore possibilities for expanding engagement.

Senior Consultant

CGI Canada Inc
01.2020 - 11.2022
  • Implementing, managing, and administering the on-premises Hadoop infrastructure (HDFS, Yarn, Map reduce, Kafka, zookeeper, Hive, HBase, Storm, Spark, Ranger, Oozie, Ambari).
  • Working as Kafka SME to assist Application team onboard Kafka project, troubleshooting issues and support in production implementation.
  • Experience working with deploying Hadoop clusters, Maintaining, troubleshooting, Adding, and Removing nodes ensuring high availability.
  • Patching/Upgradation of existing Hadoop environments: HDP from 3.1.0 to 3.1.5 & HDF from 3.4.1 to 3.5.1
  • Migration from on premise Apache Kafka to Confluent Kafka in Azure
  • Maintaining the authentication through API-keys and secrets
  • Set up Replicators & connectors and manage users through confluent-ldap-sync
  • Monitoring Kafka Topics and replicators in C3
  • Experience working with ccloud shell.
  • Expertise with Kafka brokers management, up-gradation, Zookeeper coordination, Kafka topics and Disk management, Authentication, MIT Kerberos set up, and SSL.
  • Experience working with Azure CI-CD Pipelines for the streamlined and automated process of managing the Kafka Topics (creation, alteration of configs & deletion of Kafka Topics)
  • Integrated Active directory, identity management for authorization of users and groups.

Big Data Engineer - Hadoop/Kafka Consultant

Deloitte Consulting US-India
10.2017 - 11.2019
  • Project: A complete solution built for a "Billion Dollar Technology Giant” by ingesting existing legacy systems data from multiple domains into a single enterprise analytics platform
  • Tools & Technologies: Hortonworks, Hdfs, Apache Kafka, Apache Spark, Hive, Talend BD, TAC, ORC File Format, AWS, MySQL, ALM, Scala
  • Operated as Lead Developer/Software Developer aiding the development team in designing the data pipeline.
  • Proficient in managing project related activities involving planning, execution, testing, data lineage tracking, and management in all aspects of the data life cycle.
  • Built and maintained a large-scale Kafka platform (including components from the wider Kafka ecosystem) to support a range of big data streaming applications.
  • Worked in close collaboration with Architects and other cross-functional team members in creating design documents, outlining overview, diagrams, technical requirements, and solution that best addresses a business need.
  • Created Data pipeline using Talend ETL to extract data from source systems, transform, load into Hive tables and scheduled through TAC.
  • Managed F2F sessions involving stakeholders, technical and development team during development stages, hence incorporating client involvement and excellent project delivery.
  • Ensured applications are free of common coding vulnerabilities and completed unit and integration testing per standards and design specs.

Senior Software Engineer

Tavant Technologies
10.2016 - 09.2017
  • Project: Crafted a data pipeline for migrating legacy data for over 1 billion individuals & businesses for US-based global information services and credit rating provider
  • Tools & Technologies: Cloudera, Hdfs, Apache Hive, Parquet File Format, Amazon S3, Apache Spark, MySQL, Oozie Scheduler, Jira, Confluence and Scala
  • Worked on upgradation of Existing Systems of Experian to a Big data Based Ecosystem (AWS, spark-Scala, Hive) from IBM based Ecosystem (DB2, DataStage), built pipelines using Spark and Scala for archived data
  • Redesigned code to be modular which could cater to upcoming requirements and assist in financial viability and revenue generation
  • Worked with the Team to create and enhance the technical foundation for the project wherein Engineered Oozie workflows/Coordinators for automating the data load process

System Engineer

Infosys
11.2014 - 10.2016
  • Project: Implementation of a Project Involving the Redesign, Restructuring, and Transformation of the World’s Largest & Most Populous Democracy's Taxation System
  • Tools & Technologies: Hortonworks, Hdfs, Apache Storm, MySQL, NoSQL HBase, Java
  • Administered the real-time data stream of Payloads through Apache Kafka, developed storm containers-spouts and bolts for Data Validation and storing into NoSQL Databases (HBase)
  • Designed and built production data pipelines from ingestion to consumption within a hybrid big data architecture using Java and Python
  • Led with example by initiating innovative solutions to Big Data issues and challenges within the team

Software Engineer

Ericsson India Global Services
05.2013 - 09.2014
  • Tools & Technologies: Cloudera, Hadoop, Hdfs, Yarn, Zookeeper, Sqoop, Flume, Hive, MySQL, Java
  • Migrated data from legacy database to Hadoop using Sqoop, Designed Data model, defined schemas in Hive
  • Used Task Scheduler Oozie to schedule MapReduce jobs

Education

GCP Certified Solutions Architect – Professional -

07.2023

GCP Certified Solutions Architect – Associate -

10.2022

AWS Certified Solutions Architect -

10.2021

Certified Developer for Apache Spark by Databricks -

07.2017

Bachelor of Technology-Electronics & Communication Engineering -

Kurukshetra University
06.2012

Skills

  • Cloud Platforms – GCP & AWS
  • Hadoop stack - Cloudera
  • Cloud migration
  • Cloudera stack - Sqoop, Hive, Impala, Spark, Kafka, Oozie, Ranger
  • NoSQL Database - HBase
  • Workflow Automation
  • Scripting Language - Linux
  • GCP – Big Query, GCS, Cloud Run, Cloud functions, Dataflow, DataProc, Bigtable, Composer/Airflow
  • Confluent Kafka
  • CI-CD Pipelines
  • Team Management
  • Data Architecture
  • Client Engagement
  • ETL
  • Scheduling and Workflows - Oozie & TAC

Timeline

Customer Solutions Architect

CGI Canada Inc
01.2020 - Current

Senior Consultant

CGI Canada Inc
01.2020 - 11.2022

Big Data Engineer - Hadoop/Kafka Consultant

Deloitte Consulting US-India
10.2017 - 11.2019

Senior Software Engineer

Tavant Technologies
10.2016 - 09.2017

System Engineer

Infosys
11.2014 - 10.2016

Software Engineer

Ericsson India Global Services
05.2013 - 09.2014

GCP Certified Solutions Architect – Professional -

GCP Certified Solutions Architect – Associate -

AWS Certified Solutions Architect -

Certified Developer for Apache Spark by Databricks -

Bachelor of Technology-Electronics & Communication Engineering -

Kurukshetra University
Parul Dhawan