Data Scientist with 7 years of experience specializing in predictive modeling and data processing. Proven leader in data science projects across shipping, banking, and real estate sectors. Proficient in Python, SQL, Hadoop, Numpy, Pandas, Ski-learn, PyTorch, and LightGBM. Committed to leveraging data to drive business strategy and performance.
Executed extensive data cleansing, feature engineering, exploratory data analysis, AB testing, and error analytics
Key Projects & Accomplishments:
Customer Demand Forecasting Project
• Developed and researched machine learning models to accurately forecast
customer demand for shipping services, refining the booking process and
enabling the engine to prioritize high-yield customers.
• Improved overall accuracy by 10% compared to previous methods.
Vessel Departure Time Prediction Project
• Developed machine learning models to predict vessel departure times,
enhancing customer satisfaction and aiding vessel advisors in minimizing
bunker costs.
• Improved computation time using Numpy vectorization and decreased
memory usage with Dask, resulting in a tenfold reduction in processing time
for the complete set.
Empty Container Forecasting Project
• Developed machine learning models to estimate the number of empty
containers needed for the engine to optimize the repositioning of empty
containers.
• Designed an automatic retrain pipeline for the model, leading to significant
improvements in the efficiency of the training process
Key Project & Accomplishments:
Document Classification using NLP
• Developed a natural language processing (NLP) model to categorize
documents based on their content.
• Utilized optical character recognition (OCR) tools to extract text from
documents. This enhancement in document classification saved one week’s worth of manpower with higher accuracy.
Python Programming
Machine Learning
SQL Databases
Statistical Analysis
Scikit-Learn
Neural Networks
Big Data Analytics
Agile Methodology
Data Mining
Feature Engineering
Business Forecasting