Summary
Overview
Work History
Education
Skills
Awards
Timeline
Generic

Zhengxiao Sun

Guelph,ON

Summary

Data Scientist with a strong foundation in mathematical modeling, data analysis, and database management, complemented by experience in sentiment analysis and AI model optimization. Proficient in R, SQL, and Python, with a track record of leveraging data to drive actionable insights and improve decision-making processes. Known for exceptional communication skills and the ability to bridge technical and non-technical stakeholders, fostering collaboration across diverse teams. Passionate about contributing to innovative data-driven solutions and advancing your organization’s strategic objectives.

Overview

4
4
years of professional experience

Work History

Data Scientist Intern

Baidu
05.2023 - 08.2023
  • Designed and optimized complex SQL queries to retrieve accurate data from community datasets, improving 15% accuracy of Baidu’s large language model by identifying and analyzing discrepancies between human and AI-generated queries.
  • Collaborated closely with cross-functional teams, including data scientists and engineers, to refine model performance, driving enhancements that better align AI outputs with real-world community management needs.
  • Authored key technical section of user manual for flagship product, effectively differentiating its competitive advantages and deployment options, contributing to successful client engagement and product adoption.

American Statistical Association (ASA) DataFest

On campus
03.2022 - 04.2022
  • Conducted comprehensive analysis of data derived from video game that simulates teenage life choices, assessing over 10,000 unique player interactions to model behavior and extract health-related insights
  • Utilized R and R Studio to develop advanced visualizations, including line charts and statistical analyses, based on dataset of 5,000+ data points, identifying distinct patterns and trends in player behavior
  • Presented findings to group of field experts, delivering evidence-based conclusions on player behavior and recommending strategies to encourage healthier life choices among teenagers

Deloitte Data Analytics Challenge

Remote
01.2020 - 02.2020
  • Implemented in Python to extract box office revenue of Joker's North America market and visualized data by using statistical methods like QQ plot (Quantile-Quantile Plot)
  • Designed experiment to indicate impact of different factors such as length and ratings of one film by applying statistic methods
  • Proactively communicated with each team member to ensure consistent project progress and reminded team of key task deadlines

Education

Master of Science - Data Science

University of Guelph
Guelph, ON
05.2026

Bachelor of Science - Computational Modeling And Data Analysis

Virginia Tech
Blacksburg, VA
12.2023

Skills

  • R
  • Python
  • SQL
  • English (Fluent)
  • Chinese (Native)
  • Problem-Solving and Effective Communication
  • Faithful

Awards

  • Dean's List: 2020
  • Dean's List: 2021
  • Dean's List: 2022

Timeline

Data Scientist Intern

Baidu
05.2023 - 08.2023

American Statistical Association (ASA) DataFest

On campus
03.2022 - 04.2022

Deloitte Data Analytics Challenge

Remote
01.2020 - 02.2020

Master of Science - Data Science

University of Guelph

Bachelor of Science - Computational Modeling And Data Analysis

Virginia Tech
Zhengxiao Sun