Summary
Overview
Work History
Education
Skills
Certification
Major Accomplishments - Citibank
Timeline
Generic
Raj Deb

Raj Deb

Whitby,Canada

Summary

Dynamic Senior Data Engineer with 16 years of extensive Canadian experience, currently working at Citibank Canada, leading a team to innovate and optimize big data platforms. Expert in Spark programming and data pipeline design, achieving a 30% improvement in analytics performance. Adept at managing client expectations and driving operational efficiency through strategic initiatives.

Overview

19
19
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Citibank Canada
Whitby, Canada
05.2017 - Current
  • Currently working as a Data engineer Lead managing the TTS vanguard Cloudera Hadoop Stack, managing a team of 6 engineers, doing full end to end support, maintenance, innovation of the big data platform across full SDLC from dev to SIT to UAT to Prod
  • Built Spark-based ETL pipelines to process terabytes of structured and unstructured data
  • Designed data ingestion pipelines with Kafka and Apache Spark, improving data availability and consistency
  • Developed Couchbase XDCR replication strategies for multi-region data availability
  • Optimized HBase and Hive queries, improving analytics performance by 30%
  • Developed microservices in Java and Python to interact with Couchbase and Hadoop
  • Integrated Neo4j graph queries into applications for relationship-based analytics
  • Worked on Couchbase SDK optimizations, reducing memory overhead in data-heavy applications

Senior Technical Manager :Dev and Support

CIBC
04.2006 - 05.2017
  • Company Overview: 4th largest Canadian bank
  • Hands on Experience with troubleshooting all Production issues, minor or major, during and after Business hours by collaborating with various technology teams within the bank: Networking, Infrastructure, Development and database teams, with the sole purpose of getting the issues resolved on time and within SLA so as to have minimum impact to the end user
  • Was involved hands on, on finding and executing the Root cause analysis on a major Production Issue with a Trading application called TLM (Trade Lifecycle Manager)
  • There were over 2000 Trades that failed and went into exception state
  • Took the initiative in involving various technical teams to join the conference bridge to troubleshoot and resolve the issue before end of day
  • Subsequently found out Root cause with the code and helped in the resolution of the issue
  • Implemented various cost saving and business efficiency initiatives, through hardware and software rearchitecture, best practices and vendor negotiations, resulting in 30% cost reduction in operations
  • Manage client expectations, anticipates operational and tactical risks and tracks them
  • Manage project closure initiatives, such as client satisfaction survey and closure analysis
  • Document and archive project activities, deliverables, tools and findings for future projects
  • 4th largest Canadian bank

Education

Bachelor of Engineering -

University of Toronto
01.2005

Skills

  • Java
  • Python
  • Scala
  • Spark Programming with Scala
  • Cloudera Big data Hadoop stack: CDP71
  • RDBMS: Oracle
  • NoSQL: MongoDB
  • Couchbase
  • No-SQL databases: Apache Hbase
  • Elastic Search
  • Data warehouse: Apache Hive
  • Snowflake
  • Python programming
  • ETL development
  • Data pipeline design
  • Data modeling
  • API development

Certification

  • AWS Certified Solutions Architect: SAA-C02, 01/01/21
  • AWS Certified Developer Associate, 02/01/21

Major Accomplishments - Citibank

  • Designed and implemented a high-performance Couchbase cluster to support low-latency data retrieval for real-time applications.
  • Architected Hadoop & Spark-based data lakes for large-scale ETL and machine learning workloads.
  • Developed a hybrid NoSQL-graph model integrating Couchbase and Neo4j for advanced relationship-based queries.
  • Optimized Couchbase indexing and query performance, reducing query response time by 40%.
  • Implemented data partitioning, sharding, and caching strategies for distributed systems.
  • Collaborated with engineering teams to migrate on-prem Hadoop clusters to AWS EMR, reducing operational costs.

Timeline

Senior Data Engineer

Citibank Canada
05.2017 - Current

Senior Technical Manager :Dev and Support

CIBC
04.2006 - 05.2017

Bachelor of Engineering -

University of Toronto
Raj Deb