Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

ANJANI MANASA KALLURI

Toronto,ON

Summary

Dynamic and results-driven IT professional with a passion for Python development and data engineering. Seeking mid-level roles to leverage expertise in Python, Java, and data manipulation for driving innovative solutions and contributing to organizational growth. Eager to apply skills in diverse formats such as ETL frameworks, data profiling, and machine learning to make a significant impact in dynamic team environments. Committed to continuous learning and professional development in the field of IT.

Overview

5
5
years of professional experience

Work History

Software Engineer

Value Momentum
08.2021 - 11.2023
  • Spearheaded the development of an ETL framework leveraging PySpark, SQL, Azure Data Lake Storage (ADLS), and Azure Databricks (ADB) to manage end-to-end data processing from raw processing to loading into CosmosDB.
  • Orchestrated data flow across three distinct layers (Raw, Curated, Processed) to optimize data handling efficiency and ensure seamless processing.
  • Collaborated closely with clients to comprehend their data structures and requirements, translating them into effective data transformations within the framework.
  • Implemented new features, delivered enhancements, debugged issues, and fine-tuned codebase to align with client specifications, resulting in enhanced functionality and superior performance.
  • Designed and implemented a Schema Enforcement feature to validate records against predefined schemas across diverse file formats, ensuring data integrity and consistency.
  • Actively participated in the Databricks community, contributing insights, finding bugs in open-source projects such as Spark Excel and resolving recursive view errors in Spark 3.2.0.
  • Expertly extracted complex data from various formats including MHTML, CSV, JSON, and multi-sheet Excel files, developing an OLTP connector to facilitate seamless data insertion into Cosmos DB. Additionally, authored data profiling scripts to empower the Power BI team with insightful analytics.
  • Leveraged Azure ML Flow library within Azure Databricks to develop and deploy machine learning models for premium value prediction as part of Proof of Concept (POC), significantly contributing to successful business negotiations.

Software Developer Intern

Spottabl
02.2021 - 07.2021
  • Engineered RESTful APIs using Python with Django and Node.js, employing MongoDB for efficient data storage and retrieval.
  • Written services around various AWS clients offered by different AWS services.
  • Orchestrated the deployment of APIs on Amazon Web Services (AWS), ensuring seamless integration and accessibility for end-users.
  • Crafted Python scripts to extract pertinent data from diverse websites including LinkedIn, Crunchbase, and Traxcn, facilitating comprehensive data acquisition for analysis and processing.
  • Leveraged Selenium and Beautiful Soup for proficient web scraping, enabling the collection of relevant data from various online sources with accuracy and efficiency.

Research Intern

Carnegie Mellon University
07.2020 - 09.2020
  • Extracted data from diverse APIs including Fitbit and GoogleFit watches, integrating it into machine learning models for a smart application aimed at predicting the likelihood of contracting Covid-19.
  • Designed an intuitive and user-friendly interface allowing customers to input their health data conveniently, enhancing user engagement and facilitating data collection for analysis.

Associate System Engineer

Tata Consultancy Services
10.2018 - 07.2019
  • Initiated career as a Java developer, entrusted with the development of CRUD APIs to manage data flow from an Insurance Application to a Splunk dashboard, enhancing data visualization and analytical capabilities.
  • Automated Splunk dashboard report generation processes to ensure adherence to client specifications, streamlining operations and promoting efficiency in data analysis and presentation.

Education

Master of Science in Information Technology - Computer Science

International Institute of Information Technology
Hyderabad
07.2021

Bachelor of Engineering: Electrical And Electronic - Science

SRKR Engineering College
Bhimavaram,India
04.2018

Skills

Programming Languages:

Python, Java, Nodejs, C

Frameworks,Libraries and Databases:

PySpark, Flask, React, Native, SQL Alchemy, Django, ReactJs, MySQL, MongoDB, CosmosDB, Azure

Data Processing and Analysis Tools:

Apache-spark,SQL, Azure Databricks, Azure Datafactory, ADLS (Azure Data Lake Storage),Delta Lake,Azure ML-Flow,ML Lib

Other Tools:

Selenium,Heroku, Firebase,GitHub,GitLab,Azure Repos

Accomplishments

  • Significantly reduced writing time to CosmosDB from 16 to 4 hours by implementing the Spark OLTP library.
  • Developed code to compare and merge two data frames with highly complex nested JSON schemas.
  • Recognized as the 'Star Employee' twice: first for identifying the root cause of a data count mismatch in CosmosDB, resolving a critical issue that halted production for 6 hours after collaborating with the Microsoft CosmosDB Principal Engineer, and second for receiving exceptional feedback from clients for agility and commitment to resolving production bugs, contributing to high client satisfaction levels.
  • Promoted to Software Engineer within a year of joining ValueMomentum at the Associate level, reflecting outstanding performance and dedication.

Timeline

Software Engineer

Value Momentum
08.2021 - 11.2023

Software Developer Intern

Spottabl
02.2021 - 07.2021

Research Intern

Carnegie Mellon University
07.2020 - 09.2020

Associate System Engineer

Tata Consultancy Services
10.2018 - 07.2019

Master of Science in Information Technology - Computer Science

International Institute of Information Technology

Bachelor of Engineering: Electrical And Electronic - Science

SRKR Engineering College
ANJANI MANASA KALLURI