Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Shirley You

Research Data Scientist
Ottawa,ON

Summary

Data Scientist with 5 years of experience executing data-driven solutions to increase efficiency, accuracy, and utility of internal data processing. Experienced at creating data regression models, using predictive data modeling, and analyzing Machine Learning algorithms to deliver insights and implement action-oriented solutions to complex problems.

Overview

5
5
years of professional experience
4
4
Certification

Work History

Research Data Scientist

Unity Health Toronto
08.2021 - Current
  • Lead analyst for the Ulcerative colitis research project, developed advanced SQL queries and analytics in Rstudio by applying statistical algorithms and clinical data science methods to interpret key points from the data, aiming to improving the quality of care for patients by providing researchers detailed reports.
  • Taking leadership on the investigation of missing lab and comparison between pharmacy manual mapping and RxNorm result, which not only provided a brand new insight and a new way of working with medication tables but also supported future publications of RxNorm.
  • Taking the leadership of upgrading MPR cohort generation code from using flat files to SQL database, parameterized the key data elements to accelerate running time and improved the data accuracy. Applied advanced SQL query and adaptive statistical iterative reconstruction algorithm in R. Collaborated with DQA team, data science team and management level, we've made a huge success and achieved a milestone together.
  • Taking the leadership on Long COVID Data harmonization for two hospital sites, motivated to reaching out to external point of contacts and finding solutions to address raw data issues, conducting the data engineering structures, collaborating and communicating with team to create solutions for complex data problems. Harmonized over 100 non-derived variables for each sites and created the solution for the linking issue and over 80% patients were able to link to the database.
  • Appointed analyst at ICES for ONCO project, provided comprehensive analyses to internal and external stakeholders by using SAS programming. Initiated and provided useful insights to all the scientists, researchers and collaborators with ICES-GEMINI Linkage and the usage between ICES and GEMINI datasets.

Data Analyst

IMBA Medical
12.2020 - 08.2021
  • Analyzed and processed complex data sets using advanced querying, visualization and analytics tools to increase efficiency of customer data management by 80%
  • Applied methods for big data to reveal patterns, trends, and associations based upon user comments and feedback, resulting in 18% revenue growth
  • Supported COVID-19 tracking system and dashboards, generated patient's health progress reports and graphic visualization
  • Collaborated with deployment team, development team, sales team and client relationship team working on 3 different projects achieving 20% improvement in approval decision time
  • Researched and resolved issues regarding integrity of data flow into databases

Data Science Fellow

Sharpest Minds
08.2019 - 04.2020
  • Constructing data pipelines and applying data processing, cleansing and integrating techniques by using Pandas, NumPy, and Scikit-learn in Python programming, boosted efficiency of data manipulating process by 20%
  • Contributing to predictive analytical modelling standards, reporting, and data analysis methodologies, model management, increased model accuracy by 12%.

Research Assistant Epidemiologist

University of Ottawa
08.2018 - 04.2019
  • Worked closely with Epidemiology department and Statistics department to establish coding techniques and structure of datasets on a year-long, data-driven research project
  • Assist with designs, developments, and implementation of clinical studies and published scientific research papers.
  • Augmented experimental DNA datasets linking parental genetic markers and vitamin consumption to incidences of cleft palates in children by using synthetic data.
  • Built ETL functions to extract genomes data from different data sources, transform DNA datasets and employ high-performance scientific computing on cloud.
  • Utilize statistical models and machine learning to visualize and interpret data in order to perform meaningful analyses and draw scientific conclusions from data.
  • Key achievement: The first team that made significant breakthroughs on worldwide genome level regarding maternal environmental effects on newborns, which could potentially help to solve children's cleft palates illness.

Data Analyst

Dress for Success Ottawa
08.2018 - 11.2018
  • Assisted with CRM database and used Excel to build personas and searchable portfolios for clients
  • Used Google Analytics and Microsoft Excel to generate annual business reports, filter and sort data, create pivot tables, visualization and graphics
  • Built and verified entire data pipeline for 2018 Annual Report, saving company approximately 5% per year in developer debugging time.

Education

Master of Science - Mathematics and Statistics

University of Ottawa
Ottawa, ON
05.2019

Bachelor of Science - Mathematics

Tianjin University of Technology And Education
China
07.2016

Skills

  • Python
  • Rstudio
  • SAS
  • SQL
  • Tableau
  • Microsoft Office
  • Data Wrangling, Visualization and Analysis
  • Machine Learning
  • Statistical analysis
  • Communication
  • Team collaboration
  • Leadership
  • Presentation

Certification

  • Certified SAS Programmer, SAS. Issued May 2020, No Expiration Date. Credential ID: HTZ3RYUENYFW
  • Certified Data Science in Python, University of Michigan. Issued Mar 2020, No Expiration Date. Credential ID: CYH8FECXUEG7
  • Certified Clinical Epidemiology, Utrecht University Issued Sep 2021, No Expiration Date. Credential ID: W2YHWJNMXQD7
  • Certified Clinical Data Science, University of Colorado. Issued Mar 2022, No Expiration Date. Credential ID: 4D2EH2UEKLEK

Timeline

Research Data Scientist

Unity Health Toronto
08.2021 - Current

Data Analyst

IMBA Medical
12.2020 - 08.2021

Data Science Fellow

Sharpest Minds
08.2019 - 04.2020

Research Assistant Epidemiologist

University of Ottawa
08.2018 - 04.2019

Data Analyst

Dress for Success Ottawa
08.2018 - 11.2018

Master of Science - Mathematics and Statistics

University of Ottawa

Bachelor of Science - Mathematics

Tianjin University of Technology And Education
Shirley YouResearch Data Scientist