Summary
Overview
Work History
Education
Skills
Languages
Timeline
Generic

Jin Han

Victoria

Summary

Knowledgeable data scientist with strong foundation in data analysis, machine learning, and statistical modeling. Successfully developed predictive models and data-driven solutions to optimize business performance. Demonstrated proficiency in Python and SQL, leveraging data visualization tools to communicate insights effectively.

Overview

6
6
years of professional experience

Work History

Data Scientist

Budweiser Brewing Company APAC
09.2023 - 01.2024
  • Collaborated with the Indian team to compute monthly Asia ROI, analyzing brands index to inform next month's marketing strategy.
  • Implemented predictive algorithms to forecast beer ratings, identifying critical brewing process elements affecting taste.

Senior Data Analyst

VoxelCloud Inc
06.2019 - 05.2023

Data manipulating

  • Utilized the Selenium framework to simulate user actions for login, clicks, and pagination operations. Wrote automated web crawling scripts, including anti-scraping measures, successfully collecting over 20,000 skin images
  • Conducted research on various OCR technologies to extract and batch process text from prescription images and medical reports, Results were used by the algorithm team to upgrade and refine the skin classification model, achieving a significant improvement.
  • Designed and implemented data synchronization between two major internal databases using sqlalchemy, PostgreSQL plugin dblink, and cron jobs, enhancing data sharing efficiency across teams.
  • Formulated and improved data cleaning, anonymization, and storage guidelines. Led the development of an automated data anonymization and storage platform, improving data management and processing efficiency.
  • Managed the storage, backup, and retrieval of multimodal massive datasets for various product lines. Proficient in using NAS, minIO, and Alibaba Cloud storage services.

Data Mining

  • Developed an evaluation metric system for the company's multi-disease prediction models.
  • Independently completed a facial segmentation project based on Naive Bayes, with the model results in production use.
  • Conducted AB testing analysis for pricing strategies on mini-programs and performed funnel analysis.
  • Designed tracking plans for the annotation platform and wrote quality control analysis scripts using sqlalchemy and plotly.

Multimodal Annotation Platform Development

  • Utilized computer vision, machine learning, and unsupervised hierarchical clustering algorithms to deploy lesion annotation graphical fusion, reducing annotation and case cost by 30%.
  • Designed and optimized automated ETL processes for high-frequency annotation projects, leading the development of a one-click duplication feature for annotation projects.
  • Led and managed the application of quality analysis modules on the annotation platform. Implemented online quality control for annotators, resulting in cost savings of 5%-20% per project for the company.
  • Authored general-purpose script modules for data processing in various annotation platform workflows, shared across the entire data team, reducing response time and improving data processing efficiency.

Data Science Intern

NBCUniversal
06.2018 - 12.2018
  • Cleaned 1 billion rows of raw data for CNBC network with Pyspark
  • Built Predictive Regression Models on Pyspark with ml and MLlib packages
  • Contributed to R&D tasks to enhance currently implemented forecasting methodologies
  • Contributed to building data-processing algorithms and visualizations for dashboards and BI tools

Education

Master of Science - Computer Science

University of Victoria
Victoria, BC
12-2025

Master of Science - Data Science

New York University
New York,NY
05-2019

Bachelor of Science - Statistics

Shandong University
Shandong, China
05-2017

Skills

  • Python programming
  • Machine learning
  • SQL databases
  • Statistical analysis
  • Scikit-learn
  • Natural language processing
  • Big data analytics
  • Neural networks
  • Data mining
  • Data wrangling

Languages

English(Full Professional)
Chinese (Native)

Timeline

Data Scientist

Budweiser Brewing Company APAC
09.2023 - 01.2024

Senior Data Analyst

VoxelCloud Inc
06.2019 - 05.2023

Data Science Intern

NBCUniversal
06.2018 - 12.2018

Master of Science - Computer Science

University of Victoria

Master of Science - Data Science

New York University

Bachelor of Science - Statistics

Shandong University
Jin Han