Summary
Overview
Work History
Education
Skills
Languages
Certification
Timeline
Generic

Zixuan Cai

Waterloo,ON

Summary

Three-plus years hands-on experience in data mining, modeling, visualization, and statistical analysis using R, SAS, and Power BI. Proficient in tidyverse and MASS, with expertise in machine learning techniques like LDA, QDA, Naïve Bayes, and tree-based models using the caret library. Experienced in OLS regression (SLR, MLR) and classification (logistic, probit) models. Skilled in model validation with cross-validation (CV), ROC/AUC analysis, and various selection criteria such as AIC, BIC, etc. Strong SQL skills for data manipulation and analysis. Flexible and hardworking team player focused on boosting productivity and performance with conscientious and detail-oriented approaches.

Overview

2
2
years of professional experience
1
1
Certification

Work History

Biometrics Intern

Luzsana Biotechnology, Inc.
06.2022 - 08.2022
  • Developed R codes generating the optimal dose escalation/de-escalation boundaries, selecting the maximum tolerated dose (MTD), and obtaining the operating characteristics of the “i3+3” and Bayesian Optimal Interval (BOIN) dose escalation designs for single-agent oncology phase I trials.
  • Conducted simulations under 15 dose-toxicity scenarios to evaluate the impact of various design parameters on the operating characteristics of oncology phase I methods and demonstrated the results using R including R Shiny, RMD, and dplyr.
  • Drafted and finalized a PowerPoint slide deck contrasting the performances of “i3+3” and BOIN designs in terms of safety and efficacy to help the R&D group make informed decisions about phase I methods selection.

Intern

Jiangsu Hengrui Pharmaceutics, Inc.
08.2020 - 03.2021
  • Assisted researchers from Gastric Cancer RWS project with experimental data recording.
  • Analyzed lab data in Microsoft Excel using statistical methods, including t-tests and Chi-square tests.
  • Generated compelling data visualizations, including histograms and summary tables, to communicate hidden insights within the data using Microsoft Excel.
  • Summarized peer-reviewed scientific journals by creating concise outlines using Microsoft Word for research references.

Education

MMSc - Management Science

University of Waterloo
Waterloo, ON
05.2025

M.P.H. - Biostatistics

University of California, Los Angeles
Los Angeles, CA
06.2023

Bachelor of Science - Pharmaceutical Sciences

University of California, Irvine
Irvine, CA
06.2021

Skills

  • Programming: SAS 94, R/R Studio, SQL (Google BigQuery, dbplyr), Python, STATA, Git
  • Data preparation & Validation: Microsoft Excel (Pivot Table, Power Query, functions for analytics)
  • Data visualization: Tableau, Microsoft Power BI, R (ggplot2, Shiny App), SAS (Visual Analytics, proc chart, proc sgplot, proc univariate, etc)
  • Soft Skills: Communication, Organizational and Analytical Skills, Openness to Learning, Adaptability, Leadership, Teamwork, Time Management, Project Management, Interpersonal Skills, Public Speaking

Languages

English
Full Professional
Chinese (Mandarin)
Native or Bilingual

Certification

  • Data Analytics Specialization, Google LLC - Issued Feb 2024 - Credential ID CMS7RBXM6PBP
  • R Programming, Johns Hopkins University - Issued Oct 2022 - Credential ID BDRPXDPQ7QMK

Timeline

Biometrics Intern

Luzsana Biotechnology, Inc.
06.2022 - 08.2022

Intern

Jiangsu Hengrui Pharmaceutics, Inc.
08.2020 - 03.2021

MMSc - Management Science

University of Waterloo

M.P.H. - Biostatistics

University of California, Los Angeles

Bachelor of Science - Pharmaceutical Sciences

University of California, Irvine
Zixuan Cai