Overview
Work History
Education
Skills
Internship Experience
Teaching Experiences
Timeline
Generic

AMIR PARIZI

Vancouver,BC

Overview

3
3
years of professional experience

Work History

Data Scientist II

Staples US Retail - Marketing Team
09.2022 - Current
  • LLM English-SQL Bot: Enhanced the performance of the LLM (Azure OpenAI - GPT 3.5) model by optimizing it through Snowflake database tuning. This optimization facilitated natural language database searching in Snowpark for the BI team.

Tools : Azure OpenAI , Python , Snowpark , Streamlit , Snowflake, GPT 3.5

  • Customer Segmentation 360 Dashboard: Developed a Customer Segmentation algorithm for a massive 900M transaction dataset, resulting in improved targeted marketing strategies and the creation of the "Customer 360 Dashboard" for Staples US stores.

Tools : Python , SQL , Snowpark , Clustering , Big Data

  • A/B Testing: Streamlined marketing A/B testing processes with a Python application, contributing to enhanced testing procedures.

Tools : Python , A/B Test , Bayesian Probability , Flask

  • Sentiment Analysis: Conducted Sentiment Analysis on a dataset consisting of 560K survey responses, achieving an impressive accuracy rate of 96.7% in labeling by employing TF-IDF and LSTM models.

Tools : Python , Snowflake , TF-IDF , NLTK , PySpark , Tableau , PowerBI

  • Time Series Forecasting: Provided mentorship to a Junior Data Scientist in the realm of Time Series Analysis and successfully implemented a production-ready LSTM model in Snowflake. This model enabled the generation of hourly flash reports and the detection of data anomalies.

Tools : Python , Snowflake , Neural Network (Keras) , Pytorch , Time Series Analysis ( SARIMAX ) , Tableau

  • Life Time Value Model: Transformed the Life Time Value (LTV) model from Pandas to Pyspark, dramatically reducing the training time from 2 days to just 40 minutes for modeling 17M customers using the XGBoost algorithm.

Tools : Python , Snowflake , XGBoost, Pyspark


Data Scientist I

Staples US Retail
05.2021 - 08.2022
  • Anomaly Detection in Staples IIS Logs: Leveraged Azure's built-in anomaly detection feature and Deep Learning models (RNN and LSTM) to identify anomalies in Staples IIS logs, thereby preventing prolonged error experiences on the Staples website

Tools: Python , Neural Network (LSTM) , Tableau , Azure DevOps , Django

  • Quality Check Function for Staples.com Upload Page: Developed a quality check function for the Staples.com upload page to ensure business cards met border criteria and prevented information cutoff during printing, achieving a precision rate of 91%.

Tools: Python , Neural Network (LSTM) , Tableau , Azure DevOps

  • Data Migration: Successfully migrated data from MySQL and DataBricks to the Snowflake and Snowpark environment.

Tools: Python , Snowflake , MySQL , DataBricks, SnowPark

  • NLP/Computer Vision Analysis in Fashion Industry: Conducted NLP and Computer Vision analyses on various fashion industry products to create machine learning models for consulting purposes, assisting new retailers.

Tools: Python , MySQL , Tf-IDF , NLTK , Deep Learning (LSTM)

Education

Master of Science - Engineering

University of British Columbia
Vancouver, BC
05.2021

Bachelor of Science - Engineering

Sharif University of Technology
Tehran
08.2018

Skills

  • Programming and Development: Python, C , C, Django, Flask , Streamlit
  • Data Analysis and Machine Learning:
    Snowpark, Machine Learning (Scikit-learn, Logistic models, word2vec, KNN, Tree-based models, k-means, SVC) , Deep Learning (Keras, TensorFlow, Pytorch)
  • Data Management and Processing: Hadoop, Spark, Azure , Snowflake , SQL
  • Data Analytics: PowerBI, Tableau
  • Statistical Analysis and Inference: A/B testing, Bayesian
  • Version Control and Project Management: Git,
    Jira Agile

Internship Experience

UBC Sauder Business School  Jun 2020 - Oct 2020


  • NLP Classification for Asset Transfer: Conducted an NLP classification task on a dataset consisting of 400,000 records from various firms. Utilized Word Embedding techniques and a Bidirectional LSTM neural network architecture to accurately label asset transfer or ownership transfer.
  • Health Clinic Patient Flow Optimization: Designed and implemented a stochastic model for a health clinic in C++ to optimize patient flow within elevators.
  • Elevator Utilization Program: Proposed individualized programs for each elevator based on the simulation results to achieve optimal elevator utilization during the COVID-19 pandemic.

Teaching Experiences

  • Instructor Concordia Education Edu: Lead instructor for the one-year data science bootcamp
  • Mentor/Instructor Lighthouse Lab: Held instructor and mentor sessions for data science bootcamp
  • Curriculum Developer Lighthouse Lab: Create content for Tableau and Machine Learning courses
  • Session Lead: Udacity: Session lead and instructor for 12 weeks AI with python nanodegree

Timeline

Data Scientist II

Staples US Retail - Marketing Team
09.2022 - Current

Data Scientist I

Staples US Retail
05.2021 - 08.2022

Master of Science - Engineering

University of British Columbia

Bachelor of Science - Engineering

Sharif University of Technology
AMIR PARIZI