Currently a third-year student from Santa Clara University with a declared major of Computer Science and minor of Mathematics since freshman year. Looking to declare an emphasis in Data Science. Have developed a strong interest in programming, math, and working with data throughout years of systematic training as a computer science student, and consequently possess the patience and enthusiasm for delving further into these realms in real-world context.
Proficient with python, fluent in C++, familiar with Java, and have experience with MATLAB and Scala. Have rich programming experience in simple game design, web app development, data analysis.
Looking forward to both strategy- and diagnosis-oriented tasks, and to working on projects that could build towards the data scientist profession.
Explores deep-learning in ML that utilizes the Multilayer Perceptron model to study how the 180 extracted features from audio files map to 8 categories of emotions. The database used was the RAVDESS database which contains 1435 samples of speeches performed by 24 actors, each labeled (numerically) with a particular type of emotion. The features were extracted using the librosa library and were mainly characterized by three dominant audio features: MFCC, chroma, and MEL, each containing its own array of sub-features and together summing up to a total of 180 features. Achieved an accuracy of 56%.
Led a team project for the Web and Data Mining class that builds an anime recommendation engine with the aid of class association rule mining on user data.
Conducts PCA on international stock exchanges that seeks to discover dynamics within the world stock market. Database contains stock exchange data for 9 countries across 536 time points from January 5, 2009 to February 22, 2011.
Link to database: https://archive.ics.uci.edu
/ml/machine-learning-databases/00247
/data_akbilgic.xlsx
This project applies the general linear model (GLM) to a database on the sales of bikes in Seoul with the aim of predicting future sales based on values for selected features including seasons, rainfall, and temperature in the region. The highlights of the project can be summarized as follows:
Link to database: https://archive.ics.uci.edu/ml/machine-learning-databases/00560/SeoulBikeData.csv
This project revolves around image extraction that seeks for an optimal way of filtering a portrait of Stravinsky by Picasso through 2 techniques: image convolution and low-rank approximation.
The second part of the project seeks to evaluate the effectiveness of SVD in denoising (projecting out the noise from an image) that involves the following steps:
Source of image: https://upload.wikimedia.org/wikipedia/en/1/1c/Stravinsky_picasso.png
Developed an interactive web application that simulates a pizzeria using Django and deployed it using Heroku app. The website models on the Uber Eats platform and allows users to create and edit their own pizza menus and enrich their menus by filling out details about toppings for each pizza.
The main challenge of the project lies in its overall architectural design, which could be summarized as follows:
Link to website: https://centaur-pizzeria.herokuapp.com/
The Alien Invasion project is a python-coded game developed using pygame. The player controls a ship that shoots bullets towards falling alien fleets. While relying on the classical Nintendo game Galaxian as its benchmark, the game project also displayed some improvements through the following aspects:
The overall game design follows the OOP methodology.
Data structure class project that develops a simulation program in C++ for office hour visit based on assumptions of the order of student arrival, duration of office hours as dependent on presence of students, arrival rate, and service rate. The
project may be summarized as follows:
Data structure class project that develops a C++ program that searches from a given word puzzle for all words of length 6 or more contained in a given dictionary:
A data structure class project that develops a C++ program that solves doublet puzzles by representing given information as a graph. A doublets puzzle is a pair of words of the same length. To solve the puzzle, one must produce a sequence of words such as hate have lave love. In this sequence, the first and last words are the given words, and adjacent words differ at exactly one position (so all words must have the same length). The project was based on Donald Knuth's dictionary which contains 4500 common 5-letter words. The problem was solved by representing the given dictionary as a graph with 4500 vertices (representing the words) and whose edges connect two words differing at exactly one position. The algorithm performed a breadth-first search starting with the first word until the second word is found. The project demonstrates the importance of data structures in terms of algorithm complexity, for by the introduction of graph representation and BFS the time was reduced from factorial order to O(V+E).
A Java program that simulates the tic-tac-toe game which when run automatically produces a sequence of XO tables that display each move of two players. The challenge of the project lies in the following three aspects:
Python/C++/Java/MATLAB/Scala
undefined