Summary
Overview
Work History
Education
Skills
Certification
Languages
Timeline
Generic

Naveen Maranayakanahalli Beluraiah

Woodstock,ON

Summary

Highly accomplished Big Data Architect and AI Data Engineer with 14+ years of progressive experience designing and implementing large-scale data solutions across Microsoft Azure, Google Cloud Platform (GCP), and AWS. Proven expertise in leveraging Generative AI (Gemini AI, Vertex AI) for complex code migration and modernization. Expert in leading cross-functional teams, architecting end-to-end ETL/ELT pipelines, and specializing in PySpark, Azure Synapse, Databricks, and BigQuery. Certified Azure Solutions Architect (AZ 305).

Overview

15
15
years of professional experience
1
1
Certification

Work History

Specialist, AI Data Engineer

Definity
04.2025 - Current
  • GCP Data Platform Implementation & Ontario Auto Reform:
  • Strategic Implementation: Led the engineering implementation of the Ontario Auto Reform data platform on GCP, ensuring compliance and readiness for new business requirements.
  • Multi-Source Ingestion: Designed and implemented pipelines to ingest data from core source systems including Guidewire, Mainframe, and Sonnet digital applications directly into GCS.
  • Streaming Data Flow: Established real-time and batch data pipelines using MQ and Kafka Connector to feed data into Google Cloud Storage (GCS).
  • Curated Data Architecture: Developed data processing layers to move information from GCS into curated BigQuery datasets, utilizing BigQuery SQL for transformation to the derived layer for business reporting.
  • AI-Driven SAS to BigQuery Migration Project:
  • AI Transformation: Leveraged Vertex AI and Gemini AI to convert approximately 15,000 lines of complex SAS code into optimized BigQuery SQL.

Lead Data Engineer

Nexus Cognitive Technologies-SBFE
07.2022 - 12.2024
  • Led the data engineering efforts for a major financial data platform (SBFE).
  • Data Ingestion & Reliability: Automated ingestion workflows to manage over 150+ financial data sources, ensuring data integrity and reliability in Azure Data Lake Storage Gen2.
  • Metadata Governance: Designed and implemented a robust metadata framework for 150+ member files, automating preprocessing and standardization routines.
  • Performance Optimization: Developed optimized PySpark ETL pipelines for Delta Lake, loading terabytes of historical data and achieving a 25% reduction in data processing time.

Big Data Architect

Squadron Data: Client -Kellogg's
12.2021 - 06.2022
  • Architected a modern ingestion and reporting pipeline on Azure.
  • Ingestion Architecture: Implemented secure SFTP data extraction using Cloudera NiFi on Azure VM, landing raw data into Azure Data Lake Storage Gen2.
  • Transformation & Quality: Utilized Azure Synapse Analytics Serverless SQL Pools for direct querying, data quality checks, and advanced transformations (deduplication, enrichment).

Big Data Architect

Squadron Data: Client-Blue Cross Blue Shield Association
06.2021 - 12.2021
  • Led the installation and security hardening of a CDP Private Cloud cluster.
  • Cluster Deployment: Installed a CDP Private Cloud base cluster in AWS, configuring core components including Knox, Atlas, Ranger, SOLR, and NiFi.
  • Security & Authentication: Implemented MIT Kerberos authentication and integrated Okta for user authentication, significantly enhancing the cluster's security posture.

Big Data Architect

Capgemini: Client-Manulife - Kitchener, Canada
08.2018 - 06.2021
  • Focused on data integration, automation, and disaster recovery using Apache NiFi.
  • ETL Development: Developed complex NiFi flows to meet specific customer requirements for financial data transformation.
  • CI/CD Automation: Developed RESTful APIs for NiFi deployment automation, supporting Continuous Delivery and Continuous Integration practices.

Big Data Architect (Team Lead)

Capgemini: Client-Standard Chartered Bank – India
03.2017 - 07.2018
  • Led the design and development of a Hadoop-based Data Lake.
  • Team Leadership: Led a team of 5 in implementing data solutions using NiFi (HDF).
  • Performance & Best Practices: Authored Spark jobs in Scala/Java and created a Spark best practices guide to enhance performance and efficiency across jobs.

Senior Data Engineer

Atos: Client-Marriot – India
03.2014 - 02.2017
  • Contributed to the design and operational support of a scalable data platform.
  • Data Streaming: Integrated Apache NiFi with Kafka to enable live data streaming and implemented workflows to ingest data from diverse sources.

Software Engineer

Maleotech Solution Pvt Ltd - Bangalore
02.2013 - 02.2014
  • Developed a critical performance dashboard for a client's IT team.

Commissioning Engineer

Suzlon - Bangalore, India
08.2010 - 11.2012
  • Specialized in the commissioning and maintenance of Wind Turbine Generators (WTG).

Education

Bachelor of Engineering - Electronics and communication

Malnad Collage of Engineering / Vishveshwaraya Technological University
Hassan, Karnataka, India
01.2010

Skills

  • Category: Cloud Platforms
  • Key Technologies & Expertise: Microsoft Azure: Azure Data Lake Storage Gen2 (ADLS Gen2), Azure Synapse Analytics, Azure Data Factory, Azure Functions, Azure SQL Database, Azure Virtual Machines, Azure Active Directory
  • Key Technologies & Expertise: Google Cloud Platform (GCP): BigQuery, Dataflow,Vertex AI, Gemini AI, Google Cloud Storage
  • Category: Big Data & AI
  • Key Technologies & Expertise: Apache Spark: PySpark, Scala, Spark SQL, Spark Streaming, Databricks (Runtime, SQL, Cluster Management), Delta Lake
  • Category: Data Engineering
  • Key Technologies & Expertise: ETL/ELT, Data Pipelines, Data Warehousing, Data Modeling, Data Reconciliation Processes, Data Governance
  • Category: Programming/DevOps
  • Key Technologies & Expertise: Python, Scala, Java, Bash, Shell Script, ARM Templates, Azure DevOps (CI/CD), Bitbucket, REST APIs
  • Category: Open Source
  • Key Technologies & Expertise: Apache NiFi, Hive, HBase, Zookeeper, Atlas, Ranger, Kafka, YARN

Certification

  • AZ 305 - Azure Solutions Architect Expert
  • AZ 104 - Azure Administrator Associate

Languages

English
Full Professional

Timeline

Specialist, AI Data Engineer

Definity
04.2025 - Current

Lead Data Engineer

Nexus Cognitive Technologies-SBFE
07.2022 - 12.2024

Big Data Architect

Squadron Data: Client -Kellogg's
12.2021 - 06.2022

Big Data Architect

Squadron Data: Client-Blue Cross Blue Shield Association
06.2021 - 12.2021

Big Data Architect

Capgemini: Client-Manulife - Kitchener, Canada
08.2018 - 06.2021

Big Data Architect (Team Lead)

Capgemini: Client-Standard Chartered Bank – India
03.2017 - 07.2018

Senior Data Engineer

Atos: Client-Marriot – India
03.2014 - 02.2017

Software Engineer

Maleotech Solution Pvt Ltd - Bangalore
02.2013 - 02.2014

Commissioning Engineer

Suzlon - Bangalore, India
08.2010 - 11.2012

Bachelor of Engineering - Electronics and communication

Malnad Collage of Engineering / Vishveshwaraya Technological University
Naveen Maranayakanahalli Beluraiah