Data Engineer/ETL Developer with 7+ years of experience in the IT industry. Specialized in Cloud platforms including AWS and Azure. Expertise in Data Analysis, Statistical Analysis, Machine Learning, Deep Learning, and Data mining. Skilled in handling large data sets of structured and unstructured data sources, including Big Data. Proficient in Python, SQL, and Tableau for end-to-end data science solutions. Experienced in using Spark with Scala for advanced analytics on Hadoop clusters and PostgreSQL for robust data engineering tasks. Domain expertise in Investments Management with Informatica Power Center for complex data extractions. Well-versed in ETL processes, Dimensional Data Modeling, SCD, Performance Tuning, and Data Warehousing. Familiar with big data technologies like Hadoop, Spark, and Hive. Strong communication and interpersonal abilities. Hands-on experience in AWS & Azure Cloud platform operations.
SQL
MYSQL
PostgreSQL
Big Data Processing Frameworks: Apache Spark
HADOOP
HDFS
Hive
JIRA
Cloud Platform: AWS
AWS EC2
AWS S3
AMAZON REDSHIFT
AWS GLUE
AWS Kinesis
AWS Lambda
AWS EMR
Languages: Python
Scala
Powershell
Reporting Tools: MS Office (Word/Excel/Power Point/Visio)
Azure Data Factory
Azure Data Lake Storage
Azure Synapse Analytics
Azure Data Bricks
Tableau
Power BI
Data warehousing
Data modeling
ETL pipeline design
Real-time processing
Data migration
Data cleansing
Big data processing
Data validation
Data profiling
Real-time analytics
API development