An Economist turned Data Scientist turned Data Engineer turning Software Engineer 🤞. I have a track record of acquiring new skills while successfully solving problems and following my curiosity. Ambiguity does not stop me from shipping value, I love collaborating with others and having a great time at work (and life).
Technical lead on an internal web app (https://github.com/Shopify/tintin-gcp/tree/main) designed to bring curated data analytics to Shopifolk.
- Current Infrastructure: Python (Flask) Web App deployed with the help of prod-kit to Google Cloud Run. Nightly jobs that populate data in production database are scheduled in Cloud Composer. Some pages in the frontend require fresh data that comes from async live queries to BigQuery.
- Other relevant components: CloudSQL instance for storing production data, Memcached for caching html pages, GCS Bucket to store Search Index generated via whoosh library.
- Over the course of 6 months single-handedly migrated the app from old custom Shopify infrastructure to the current infra in GCP, dramatically improved costs by using async live queries instead of pre-computing analysis nightly for every piece of information shown.
- Worked on every single connected component of the app: frontend / backend, all GCP Infra components, all web app components.
- The app organically grew to 400+ Monthly Active Users
- Why it's valuable? Opinionated analytics provides stakeholders with better context about our App Ecosystem, allows them to ask better questions and reduces the number of time consuming but relatively low impact ad hoc data requests. As a result, the team can focus on more impactful work while also providing better and timelier service to our stakeholders. The content is curated by me and fellow data scientists on the team.
Responsible for building high quality data assets to power the web app as well as other data reports and pipelines in the Ecosystem org.
- DBT modelling
- maintaining Google Cloud Project for the team
- brainstorming, designing and prioritizing new data assets with the team
A full stack Data Scientist
- Design and development of an internal Python Web App
- Data modelling via internal spark based framework and DBT
- Data Visualization via various dashboards (Tableau, Mode) and presentations to leadership and stakeholders
- Defining KPIs for business areas with alignment with senior stakeholders and product management
- Mentoring others
Notable projects:
- Under the leadership of a fellow Data Scientist, helped build a web app (Tintin) that provided opinionated context around Shopify's App Ecosystem. This project was a bog bet, but it was hugely successful with 600 Monthly Active Users by the end of 2022. We were shipping really fast, kept iterating, explored new ideas quickly - kept them if they were successful, discarded otherwise. Conducted an extensive user interviews (link) to collect product requirements and feedback.
- Data Lead on a large multidisciplinary project (Merchant Homecoming) to convert merchants to using Checkout 1 vs apps hijacking Shopify checkout. Was responsible for determining KPIs, tracking the project success, helping prioritize which merchants to reach out to.
Love adventures in the outdoors! Skiing, downhill biking, cross country biking, hiking...