Summary
Overview
Work History
Education
Skills
Languages
Timeline
Generic

Jaewon Yun

Montréal,QC

Summary

Data Engineer with 5+ years of combined experience in data engineering, machine learning, and computational physics. Specialized in building, deploying, and maintaining scalable data processing pipelines using AWS, Snowflake, and Talend. Proficient in SQL and Python with a strong background in data integration, ETL processes, cloud infrastructure management, and deep learning. Skilled in optimizing data tasks for performance and cost, managing data access controls, and implementing CI/CD pipelines. Experienced in developing machine learning models and applying computational physics methods to solve complex problems.

Overview

5
5
years of professional experience

Work History

Data Integration Specialist

International Air Transport Association (IATA)
08.2023 - Current
  • Optimized Data Pipelines: Build, deploy, and maintain scalable data processing pipelines using Talend, S3, and Snowflake, optimizing tasks for performance and cost.
  • Talend Management: Manage Talend Data Integration jobs, overseeing platform setup, TMC administration, and environment configuration.
  • ETL Pipeline Development: Develop and maintain over 350 ETL pipelines using Python, Talend, and Amazon S3, ensuring efficient data extraction, transformation, and loading.
  • Data Governance: Write scripts with Liquibase for data correction and updates in Snowflake, ensuring data integrity and accuracy.
  • Interdepartmental Collaboration: Assist various departments with data integration needs, supporting seamless data movement into Snowflake for diverse organizational functions.
  • Data Management: Work with aviation data to enhance and ensure the reliability of aviation safety data.

Data Engineer

Branchy Solution
01.2022 - 08.2023
  • Cloud-Based ETL Pipelines: Designed and executed robust ETL pipelines using AWS Glue Studio, Lambda, and S3, improving data volume handling and enhancing data infrastructure scalability.
  • Data Processing with Databricks: Utilized Databricks with PySpark for efficient data processing, streamlining data transformation and load processes.
  • Data Modeling: Constructed data modeling strategies within AWS Redshift and RDS, ensuring maintainability and scalability of data warehouse systems.
  • Security Management: Managed IAM roles and security groups to strengthen data protection and ensure consistent system reliability.

Machine Learning Intern

University of Toronto
09.2022 - 03.2023
  • Deep Learning Model: Developed a physics based deep learning model using Python to study how atoms change shape in a high-speed electron microscope, by analyzing the patterns made by the electrons.

Research Assistant - Quantum Optics

University of Toronto
05.2019 - 05.2020
  • Quality Analysis: Developed a novel method to quantify electron numbers per pulse in an Ultrafast Electron Diffraction (UED) setup, utilizing a Faraday Cup, CCD Camera, and simulation techniques. This ensured high-quality diffraction images and contributed significantly to quantum physics research.
  • Temporal Resolution Measurement: Engineered a transportable autocorrelator to precisely measure the pulse duration of ultrashort laser beams in a UED setup, achieving femtosecond-scale (10^-15 seconds) temporal resolution. This accurately determined the characteristic reaction times of experimental objects.
  • Data Processing Optimization: Enhanced the data processing workflow for UED experiments by designing and implementing sophisticated Python scripts for data cleaning and processing. This significantly improved the efficiency and accuracy of data analysis in quantum Optics studies.
  • Data Visualization: Produced advanced data visualizations using Python, effectively conveying complex scientific data through publication-ready graphics for leading scientific journals.

Education

Honours Bachelor of Science - Physics Specialist

University of Toronto
Toronto, ON
01.2020

Skills

  • Snowflake
  • Talend
  • ETL development
  • AWS Lambda Functions
  • AWS Glue ETL Development
  • AWS SageMaker
  • Data Warehousing
  • Data Modeling
  • Data Pipeline Design
  • Computational Physics
  • Quantum Computing
  • Physics-informed Deep Learning

Languages

English
Full Professional

Timeline

Data Integration Specialist

International Air Transport Association (IATA)
08.2023 - Current

Machine Learning Intern

University of Toronto
09.2022 - 03.2023

Data Engineer

Branchy Solution
01.2022 - 08.2023

Research Assistant - Quantum Optics

University of Toronto
05.2019 - 05.2020

Honours Bachelor of Science - Physics Specialist

University of Toronto
Jaewon Yun