Summary
Overview
Work History
Education
Skills
Languages
Certification
Timeline
BusinessAnalyst
John Wesey Subas

John Wesey Subas

Ottawa,ON

Summary

Cloud and Data Engineering Architect with extensive expertise in big data processing, performance optimization, and modern data processing frameworks. Proficient in building scalable, fault-tolerant systems leveraging Kubernetes, Hadoop, and Spark for distributed data processing and storage. Demonstrated expertise in telecommunications systems involving DOCSIS specifications, network analytics, and horizontal scaling strategies. Adept at resolving complex memory and performance issues in real-time and distributed systems.

Overview

20
20
years of professional experience
3
3
Certification

Work History

Principal Applications Developer

Oracle
08.2022 - Current
  • Optimized slow Apache Spark-based data processing jobs for multiple teams, achieving a 50% reduction in resource utilization.
  • Designed and deployed an end-to-end MLOps framework, improving the Mean Time to Reliable Insights (MTRI) and enabling seamless troubleshooting in production.
  • Established OCI Data Science as the standard for ML workflows across GBUs, enhancing accessibility and standardization.
  • Debugged and resolved performance bottlenecks, such as incompatibilities in CPU instruction sets affecting analytics pipelines.
  • Led a technical excellence forum to set direction for evolving data and ML processing pipelines.

Principal Platform Data Scientist

Oracle
08.2021 - 08.2022

Optimized analytics jobs using vectorization in Pandas and NumPy, reducing query processing time by 50% and addressing memory usage and performance issues.


• Improved ML model performance by addressing ensemble model weaknesses and applying hyperparameter tuning, K-fold cross-validation, and other optimization techniques.


• Developed a Python library to auto-generate HTML model cards for various use cases.

Senior Software Engineer

Guavus
05.2015 - 07.2021

• Backend/ML developer and architect at Guavus with expertise in Kafka, ElasticSearch, Postgres, HBase, Apache Spark, Kubernetes, and Spring Boot.
• Led the EDA of NTT Alarms data, driving the analytics team to create models with 90% accuracy and 80% precision/recall for alarm prediction.
• Spearheaded Scala-based ML data pipelines for AlarmIQ and OPSIQ, enabling network service providers to monitor and predict alarms and user experience degradation, handling 5K+ records per second.
• Developed a Scala-based load testing application using Gatling, integrated with Jenkins, supporting KeyCloak authentication and custom field generation, to be used across teams for testing.

Technical Lead

BlueRose Technologies Private
04.2013 - 04.2015

• Developed a management VM on Citrix Xen hypervisor for centralized control of all VMs in a Netscaler/CloudBridge instance.

Technical Lead

Motorolla Mobility
05.2012 - 04.2013

• As a DOCSIS 3.0 expert and C++ developer, I designed the multicast control plane and led the implementation of the cable modem registration FSM for Motorola CCAPs.

Member Technical Staff, Technical Lead

HCL - Cisco
12.2004 - 04.2012
  • Diagnosed and resolved complex memory-related issues, including memory corruption, segmentation faults, and memory leaks, in Cisco CMTS software.
  • Contributed to the stability and performance of critical networking features by optimizing memory usage and ensuring robust system behavior.
  • Developed and implemented the DOCSIS Multicast QoS feature for Cisco’s MC2020 linecards with efficient control-plane and data-plane implementations.
  • Fixed numerous complex bugs in Cisco CMTS, including customer-reported issues, earning multiple awards for contributions.

Education

Master of Science - Software Systems

B.I.T.S
Pilani
05-2015

Bachelor of Technology - Information Technology

Madras University
Chennai
04-2004

Skills

    Scala, Java, Python, C, C

    Apache Spark, Kafka, HBase, ElasticSearch, Postgres

    Kubernetes / Helm, Docker

    CI/CD, Jenkins

    Network Analytics, DOCSIS, TCP/IP, Netflow, BGP

    Machine Learning, Support Vector Machines, Logistic Regression, Random Forests, K-Means Clustering

    OCI Cloud, AWS

    Numpy, Pandas, Shap, DropWizard, SpringBoot

    Profiling, tuning CPU/memory-intensive workloads, horizontal scaling

Languages

English
Professional Working

Certification

  • AWS Certified Solutions Architect Associate (2024)
  • Oracle Cloud Infrastructure 2024 Generative AI Professional
  • Oracle Cloud Infrastructure 2024 Networking Professional

Timeline

Principal Applications Developer

Oracle
08.2022 - Current

Principal Platform Data Scientist

Oracle
08.2021 - 08.2022

Senior Software Engineer

Guavus
05.2015 - 07.2021

Technical Lead

BlueRose Technologies Private
04.2013 - 04.2015

Technical Lead

Motorolla Mobility
05.2012 - 04.2013

Member Technical Staff, Technical Lead

HCL - Cisco
12.2004 - 04.2012

Master of Science - Software Systems

B.I.T.S

Bachelor of Technology - Information Technology

Madras University
John Wesey Subas