Data engineer with 5 years of experience in designing, building, and optimizing data pipelines using Azure, AWS, and Apache technologies, and holding a master's degree in computer science. Seeking to contribute to a forward-thinking organization by leveraging expertise in ETL development, data transformation, and workflow automation using tools such as Azure Data Factory, AWS Glue, Apache Spark, Apache Airflow, and Terraform. Committed to delivering scalable, efficient, and reliable data solutions in both collaborative and remote environments.
Operating Systems: Windows, Linux, Mac, Unix
Programming Languages: Python, PySpark, Pandas, Scala, Java, C, C, R, C#
Cloud Platforms: Azure, AWS
Databases: Oracle, MySQL, SQL Server, MongoDB, Cassandra, DynamoDB, PostgreSQL
Data Warehousing: Redshift, Snowflake, Azure Synapse Analytics
Big Data Technologies: Apache Spark, Databricks Hadoop, MapReduce, HDFS, PIG, Hive, Kafka, Zookeeper
Machine Learning: Scikit-Learn, PyTorch, XGBoost, Azure Machine Learning
Streaming Technologies: Apache Kafka, Amazon Kinesis, Apache Flink, Azure Event Hubs
Monitoring Tools: Apache Airflow, Amazon CloudWatch, Azure Monitor
Visualization/ Reporting: Tableau, SSRS, Amazon QuickSight and Power BI