Summary
Overview
Work History
Education
Skills
References
Projects
Timeline
Generic

Andy Seo

Toronto,ON

Summary

Team-oriented Data Engineer with expertise in designing, implementing, and optimizing data pipelines for efficient and scalable data processing. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed

Overview

2
2
years of professional experience

Work History

Data Engineer

RBC Investor Services
05.2022 - Current
  • Built ETL Pipelines for RBC IS data assets(Custody/Pensions/Derivatives/Securities/Transactions) both on-premise and cloud databases utilizing Airflow and Databricks. Owner of 20+ data assets.
  • Led end-to-end implementation of multiple high-impact projects from requirements gathering through deployment and post-launch support stages.
  • Collaborated with data scientists to develop machine learning models by providing necessary data infrastructure and API endpoint. (RBC chat bot IVA)
  • .Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and to meet SLA
  • Increased efficiency of data-driven business decision-making by creating RBC's enterprise data packages.
  • Implemented and employed data quality checks to ensure compliance with RBC data governance policies.
  • Utilized Elasticsearch dashboards and Airflow for monitoring and troubleshooting production issues.

Data Engineer

Jarvis Consulting Group
01.2022 - 05.2022
  • Automated scripts for node usage information to be stored on the Jarvis database to help the Linux Cluster Administrator team.

Education

Bachelors of Science - Statistics

University of Toronto
04.2021

Skills

  • Python/PySpark
  • SQL
  • Airflow
  • Kafka
  • Azure Databricks
  • AWS S3
  • Hadoop(HDFS)
  • Agile
  • Machine Learning
  • GIT
  • Docker
  • Java
  • Bash
  • ELK Stack
  • Great Expectations
  • Jenkins/Helios/OCP
  • IDP/Bento(RBC)
  • CI/CD
  • ETL development
  • Data Warehousing
  • Data Modeling
  • Data Pipeline Design
  • Big Data Processing
  • Scripting Languages
  • Performance Tuning
  • Data Governance
  • API Development
  • Data Quality Assurance
  • Relational databases

References

  • Ankur Tyagi, Director of Data Engineering(RBC IS), ankur.tyagi@rbc.com
  • Gurjeet Kaur, Associate Director Data Engineering(RBC IS), gurjeet.x.kaur@rbc.com

Projects

RBC Polaris - Enterprise Data Solutions 

- Centralized data repository for RBC IS that serves as an API endpoint for RBC applications

- Front-end UI for RBC internal clients(Data Analyst, Business Analysts, Data Scientists,..etc)


Timeline

Data Engineer

RBC Investor Services
05.2022 - Current

Data Engineer

Jarvis Consulting Group
01.2022 - 05.2022

Bachelors of Science - Statistics

University of Toronto
Andy Seo