Summary
Overview
Work History
Education
Skills
Websites
Certification
Publications
Timeline
Generic

Michael Mallia

Vancouver,BC

Summary

Software Engineer with extensive experience at Data Sentinel, focusing on high-performance data ingestion and machine learning. Proficient in cloud platforms and data governance, delivering solutions that improve data quality and compliance. Skilled in client relationship management, creating innovative architectures for optimized enterprise data management.

Overview

23
23
years of professional experience
1
1
Certification

Work History

Software Engineer

Data Sentinel
12.2019 - 02.2025
  • As a 'Hands-on' Software Engineer, key in development of the 'backend' processes for an enterprise sensitive data management platform.
  • This software platform leverages deep learning technology to identify, inventory, classify and tag sensitive data; rate the level of sensitivity in financial terms; uncover duplicate data; audit data quality; measure data risk and enable active data governance.
  • Developed high performance data ingestion processes using Scala/Apache Spark against multiple data sources.
  • Sources included multiple RDBMS, NOSQL DBs, cloud drives/applications, CRMs, email and messaging applications.
  • Developed metadata and text extraction processes against multiple document types using Scala/Apache Tika and Tesseract OCR.
  • These included Microsoft Documents, Goggle Docs and Adobe PDF.
  • Co-developed data classification processes using Python/LLM (NER-bert)
  • Implemented Swarm/Docker platform on AWS/EC2.
  • Images/Containers included all backend processes, REACT Web, PostgreSQL/ORIENTDB databases, and Apache Kafka.

Senior Solution Architect

Videotron
08.2019 - 11.2020
  • Senior Solution Architect in the implementation of Prolifics' 'ProLite Master Data Management (MDM) solution.
  • The InfoSphere Information Server based solution help in giving better data quality, increase data standardizations and de-duplicate address (location) and customer data while standardising its metadata via a repeatable near realtime or high throughput batch process.
  • Architected, planned and implemented the baseline version of Prolifics Prolite MDM solution.
  • This involved providing technical expertise and direction aligned with best practices, driving the architecture and implementation of successful MDM solution, and defining necessary processes for project maintenance and life-cycle activities.
  • Designed and developed initial and repetitive ETL processes using DataStage to extract source system data into a consistent format for input to the MDM system.
  • Directed the implementation of data governance tools and processes to enforce Data governance and compliance.

Revenue Quebec
07.2018 - 08.2019
  • MDM Solutions - Pilot
  • Utilizing InfoSphere Information Server, constructed processes to demonstrate the integration (ETL) and Data Quality and Metadata Management functionalities as part of an IBM MDM pilot solution to Revenue Quebec.
  • Client management and interfacing with clients as the main point of contact for IBM.
  • Designed and developed key processes identified for the pilot to demonstrate how to extract, transform source system data into a consistent format for input to IBM's MDM solution.

Innovapost
05.2019 - 07.2019
  • Created processes to standardize, match and survive client data (name and address) for integration into Innovapost's MDM implementation for their Campaign Management Solution.
  • Designed and developed initial ETL processes using DataStage to extract source system data into a consistent format for input to the MDM system.
  • Made recommendations to improve process efficiency and effectives by means of design based on best practices.

Brenntag/DiGiB
10.2018 - 03.2019
  • As part of a global team on the 'DataLayer' project, responsible for the design and implementation of the application's integration framework.
  • This framework included the use of IBM's Cloud Storage (S3) and Db2 Warehouse (formerly dashDB) services loading into Salesforce.
  • Worked with different teams of Data Modelers, DBA's, developers, to ensure an aligned approach to development of the infrastructure framework and to ensure business and technical requirements were fulfilled.
  • Designed and developed on-premise use of InfoSphere's DataStage provided for the ETL processing, moving/transforming data from iSeries files into ..
  • Implemented process efficiencies for the use of IBM Information Server software.

Co-founder, Senior Partner and Vice President of the Data Integration Practice

Stream Integration
01.2002 - 02.2019
  • Mr. Mallia was a co-founder, senior partner and Vice President of the Data Integration Practice, covering worldwide Technical Pre-Sales and Service Delivery of IBM Information Server, IBM InfoSphere CDC and IBM Pure Data (Netezza).
  • He has a deep background in software engineering, management and professional services and was actively involved in a Project Management and Technical Architecture consulting capacity.
  • Mr. Mallia heavily contributed to overall technical direction of Stream Integration.
  • In his role as Practice Lead, Mr. Mallia had forged a unique niche in the data integration market by providing clients with:
  • Integrated ETL/Metadata/Database/Quality Management/Information Delivery solutions
  • Multi-technology evaluation, recommendation, and deployment
  • An approach based on an interactive methodology and productivity tools
  • A strong mentorship, guidance, and education emphasis
  • A responsive, cost-competitive service model
  • Resource costing both Onshore and Offshore for data integration projects
  • Training in Data Modeling and IBM InfoSphere Server (DataStage, QualityStage, Glossary, Information Analyzer)
  • Developed product extensions for custom functionality on IBM InfoSphere Server toolsets
  • O C++ libraries, XML/XSLT scripts, Java APIs

ETL Architect/Pure Data Architect

Mitsubishi UFJ Trust Bank
05.2017 - 06.2018
  • A multiple phase project starting in May 2017, provided guidance and developed DataStage jobs to integrate operational data into the Pure Data data store for financial reporting.
  • Mr. Mallia was also responsible for the assisting with the data architecture (data model) best suited for data analytics.

Municipal Group of Companies
01.2018 - 03.2018
  • Piloted a cloud based BI Reporting architecture integrated with on premise legacy Accounting System.

B.C. Lottery Commission
01.2018 - 03.2018
  • Preformed the installation, deployment, and configuration of IBM Information Server 11.7.
  • As well, provided a plan and automated processes for migrating existing Cognos Data Manager code into DataStage.

ETL/Quality Architect for MDM Solutions

Rexel
10.2017 - 12.2017
  • Utilizing InfoSphere Information Server, architected and built the Data Quality environment for managing metadata, cleansing, matching and survivorship of source Vendor and Product data prior to loading into Rexel's IBM's MDM CE application.

Michaels
06.2017 - 09.2017
  • As the technical lead, architected the solution for design of the 'Tree Advisor Bot'.
  • A Watson Conversation chat bot integrated with both Michaels' Connected Retail Mobile application and Demandware e-commerce platform.
  • The TA converses with Michaels Stores customers about Christmas trees and also handles conversations about tree skirts and strands of lights.
  • The Tree Advisor Bot is hosted on the IBM Bluemix cloud, utilizing a number of platform services, listed below.

Indigenous and Northern Affairs Canada
02.2017 - 03.2017
  • Provided guidance on the implementation of IBM's InfoSphere Information Server within the department's information infrastructure.
  • Developed Proof of Technology with INAC personnel using department metadata demonstrating how it could proceed with Data Governance and Metadata management.
  • Included were reviews of 'Best Practices and Guidelines' for Data Governance.

Technical Lead

Hilton Grand Vacations
01.2016 - 12.2016
  • Technical Lead for migration of HGVC's Enterprise Data Warehouse used for marketing analysis from an MS SQLServer database platform to Pure Data Analytics (Netezza).
  • This included the migration of ETL code developed using Infosphere Information Server version 9.1 to 11.5.

Export Development Canada
11.2015 - 06.2016
  • Provided guidance on the implementation of IBM's InfoSphere Information Server within the corporations' enterprise information infrastructure.
  • Installed the full InfoSphere Information Server suite.
  • Provided guidance on Data Governance and Metadata management on numerous projects.

Canadian Blood Services
04.2015 - 10.2015
  • Developed DataStage jobs to integrate operational data into the Blood Services DataMart, tracking the collection, manufacturing of blood products and their distribution.

RBS Citizens
03.2015 - 08.2015
  • Provided guidance on the implementation of IBM's Infosphere Information Server while installing the full InfoSphere Information Server suite onto an IBM Big Insights (Hadoop) platform.
  • Installed InfoSphere Streams for 'proof of technology' as a possible use as a 'stream time' decisions processor.

Canadian Food Inspection Agency
07.2013 - 03.2015
  • Architected an autonomous MDM service architecture to integrated with various systems via SOA-based design principles establishing a baseline and ongoing master data quality capability for CFIA.
  • At the highest level, the context for the MDM comprised of integration of master data from legacy systems (due to sunset, and others) as well as new source systems such as Microsoft Dynamic CRM.

Presbyterian Health Services
04.2017
  • Conducted an Assessment with Health Check for validation of Presbyterian's IBM Information Server environment(s), focusing on the Information Analyzer.
  • Included were reviews of 'Best Practices and Guidelines' for Data Governance.

Carolinas HealthCare System
01.2017
  • Conducted an Assessment with Health Check for validation of CHS IBM Information Server environment(s), assisting in the full installation of the InfoSphere Information Server.

Solution and ETL Architect

Caesars Entertainment
03.2014
  • Designed and implemented InfoSphere Information Server a GRID based architecture integrated with Teradata.

Technical Instructor / Trainer

  • An experienced Technical Instructor, delivering seminars and training courses in North America, Europe and Australia on various topics including Data Warehousing, Modelling a Data Warehouse, and IBM's InfoSphere Information Server product suite including the Netezza appliance (now IIAS or Sailfish).
  • Data Warehousing by Example
  • Data Warehousing - Data Modeling
  • IBM InfoSphere Information Server Suite (DataStage, Information Analyzer, QualityStage, Business Glossary)
  • Netezza (now IIAS)
  • Clients: Ajilon Canada, Capitol One Services, Royal Bank Canada, Alberta Learning, Indian and Northern Affairs, Bass Pro Shops, Canada Customs and Revenue Agency, CGI, Compassion, Bell, Actimedia, Galileo, Kemet, Accenture, Ikea, Canadian Food Inspection Agency, Bell, Work Safe BC, Export Development Canada, H&R Block, Union Gas, Hilton Grand Vacations, Pensam, Praxair, Innovapost, Department of Social Security (U.S.), Australia Taxation Office, 5th / 3rd Bank

Education

Bachelor - Computer Science

Acadia University
Wolfville, NS
05-1985

Skills

  • Linux and Windows
  • Cloud platforms (AWS, Azure, GCP)
  • Container orchestration (Docker, Kubernetes)
  • IBM Watson services
  • Data integration tools (Talend, StreamSets)
  • Database management systems (DB2, SQL Server, Oracle, PostgreSQL, MySQL, MongoDB)
  • Big data technologies (Apache Hadoop, Snowflake)
  • File storage solutions (AWS S3, Google Drive, MS OneDrive)
  • Web development frameworks (Angular, Nodejs)
  • Programming languages (Java, Python, JavaScript, TypeScript, C, Scala/Spark)
  • Software development methodologies
  • Machine learning techniques
  • Client relationship management
  • ETL and data quality processes
  • Data governance and classification
  • Project and IT management strategies

Certification

  • IBM Certified Solution Developer - InfoSphere DataStage v11.3
  • ELearning Badge - IBM Integrated Analytics System (SailFish) for Data Engineers
  • ELearning Badge - IBM Data Product Hub Technical Sales Intermediate
  • IBM InfoSphere QualityStage Fundamentals Technical Professional
  • IBM PureData for Analytics Sales Professional v1

Publications

  • Oracle 10g Beginners Guide, Contributing Author
  • Oracle Press 2004 (subjects: XML -Oracle implementation using XML DB)
  • Oracle 8i Data Warehousing, Contributing Author
  • Oracle Press 2000 (subjects: Oracle Warehouse Builder and Discoverer 3i)
  • Oracle 8i Beginners Guide, Contributing Author
  • Oracle Press 1999 (subjects: JDeveloper -Oracle's Java tool and Developer 2000)
  • Oracle 8 Data Warehousing, Contributing Author
  • Oracle Press 1998 (subject: Oracle Express)

Timeline

Software Engineer

Data Sentinel
12.2019 - 02.2025

Senior Solution Architect

Videotron
08.2019 - 11.2020

Innovapost
05.2019 - 07.2019

Brenntag/DiGiB
10.2018 - 03.2019

Revenue Quebec
07.2018 - 08.2019

Municipal Group of Companies
01.2018 - 03.2018

B.C. Lottery Commission
01.2018 - 03.2018

ETL/Quality Architect for MDM Solutions

Rexel
10.2017 - 12.2017

Michaels
06.2017 - 09.2017

ETL Architect/Pure Data Architect

Mitsubishi UFJ Trust Bank
05.2017 - 06.2018

Presbyterian Health Services
04.2017

Indigenous and Northern Affairs Canada
02.2017 - 03.2017

Carolinas HealthCare System
01.2017

Technical Lead

Hilton Grand Vacations
01.2016 - 12.2016

Export Development Canada
11.2015 - 06.2016

Canadian Blood Services
04.2015 - 10.2015

RBS Citizens
03.2015 - 08.2015

Solution and ETL Architect

Caesars Entertainment
03.2014

Canadian Food Inspection Agency
07.2013 - 03.2015

Co-founder, Senior Partner and Vice President of the Data Integration Practice

Stream Integration
01.2002 - 02.2019

Technical Instructor / Trainer

Bachelor - Computer Science

Acadia University
Michael Mallia