Mohammad Najmuddin
Director Data Engineering
in
*******@*****.***
Ashburn, Virginia
linkedin.com/in/mnajmuddin
Summary
A passionate, accomplished hands-on Technology Leader in Data Technologies, having more then 15 years of proven IT experience providing complete end-to-end Enterprise Data Engineering, Analytics, Data Science, AI solutions across multiple business domains
Built high performance impactful data teams (across shores/time zones) and mentored them to deliver high quality enterprise wide data solutions
Laid design framework to develop single source of truth for business executives and stake holders to make insightful decisions
Experienced in Technology Leadership, Multi-Cloud Platforms, Data Engineering, Data Governance, DW, Data Lake, Big Data, Architecture, BI, Data Science, Project Management (Agile Scrum/Kanban)
Subject Matter Expert across multiple Technologies, Architectures, Data Modeling, Dimensional Modeling (Kimball) and Business applications
Successfully implemented Data Lakes, Data Warehouses, ETL Data Pipelines, Business Intelligence and AI/ML Infrastructure across Multi-Cloud Environments.
Designed solutions to build AI/ML bots (Custom LLMs) to automate customer support, self-service reporting (SQL bot), product review analysis, HR doc. analysis (NLP)
Established standards and frameworks to maintain data quality, security, data lineage (source to destination layers), observability across data platforms and meta-data
Improved Sales, Operational efficiencies, Marketing spending across channels, Ad Spending by providing quality data and forecasts in multiple organizations leading to increased profitability
Optimized Data Platform processes/applications by improving overall performance up to 100%
Led efforts to reduce data infrastructure costs by 30+% thru optimization and consolidation of cloud resources
Very collaborative managing or working with cross-functional teams, product managers, business stakeholders and C-suite Level Executive team and provide leadership in matrix environment
A leader who inspires team by example, communication, continuous learning and collaboration
Skills
Expertise: Data Governance & Security, Data Strategy, Road Map, Team Capitalization, CI/CD, Data Mining, ETL,
ELT, REST APIs, Agile Project Management, Data Science, NLP, Gen. AI/ML, DW, Data Lake, BI, MDM,
STAR/Snow-Flake schema design, OLTP, Dim. Modeling, RDBMS, Columnar DBs, Graph DBs, ERP
Platform & Tools: AWS, Azure, GCP, Kubernetes, Airflow, Hadoop, Glue, Lambda, Google Functions, Azure
Functions, Composer, AWS Airflow (MWAA), Data Flow, Salesforce CRM, Big Data Hadoop, Hive,
Impala, GitLab, GitHub, UNIX/Linux, EC2, AWS Cloud Watch, Grafana, Looker, Tableau, Power BI,
Splunk, BO, Google Analytics, Data Dog, Ops Genie, NetSuite ERP
Languages: Python, C++, Spark, Perl, Bash, ANSI SQL, DBT, Ab Initio, SSIS, TalenD, R, Ruby, Apache Gremlin
Databases : Big Query, SQL Warehouse, Redshift, PostgreSQL, S3, RDS, GCS, GCP SQL DB, Azure Data Lake Store,
Cosmos DB, Athena, SQL Server, MySQL, Aurora, Netezza, Vertica, NoSQL DB, Snowflake
PROFESSIONAL EXPERIENCE
Unybrands LLC., Miami, FL 2021-Current
Director Data Engineering
Aligned data strategy with company vision to build Unybrands’ enterprise data platform to deliver single source of truth for the business stakeholders and executive team to make metrics driven decisions daily
Established process of building future roadmap, capacity planning and led the technical execution of them
Built Data Engineering, Software Engg. & Analytics Teams at Unybrands, working across shores and time zones
Designed and Architected robust Data Platforms for a growing eCommerce company owning 30+ brands across Amazon FBA, Shopify, Walmart, Target, TikTok marketplaces
Built Enterprise Data Engineering platform on AWS, GCP Cloud platforms to support Enterprise Reporting, Data Analytics and Data Science/AI requirements using Redshift/BQ platform, Cloud Services and Tableau
Led the effort to migrate of 100s of TBs of Legacy AWS Data Lake and DW platform to Google Cloud Platform (GCP), improving data availability & overall performance by 50 – 100% and reducing infrastructure costs by 30+%
Managed teams supporting 1000s of processes/pipelines building 100s of reports (Daily Operations, Weekly Business Review, P&L, Campaign Management, Sponsored Ads etc.) serving CS, Commercial, Marketing (Campaigns/Ads/PPC), Brand Management, Supply Chain, Finance, Product Management Teams to make data driven decisions
Designed Dimensional Data Model (STAR) to build Daily Business Ops., Weekly Business Review, Campaign & Ad management, P&L reporting across Marketplaces (Amazon FBA, Shopify, Walmart, Target, TikTok, DTC)
Built robust Inventory/Purchase Orders (PO) management reporting and forecasting to help Supply Chain manage Inventory in 3PL/Amazon Warehouses for 30+ brands across multiple marketplaces
Led the effort to build applications doing AI/ML services (Custom LLMs) to automate customer support, Self-service reporting (SQL bot), Product review analysis, HR documentation analysis (NLP processing)
Cofense Inc., Leesburg, VA 2017-2021
Data Science Director
Established Data Engineering and Data Science Practices by building Data teams from scratch
Provided technical leadership and hands-on implementation role in the areas of developing and maintaining data platform, Data Governance, Data Quality, ETL processes, new insights, advanced modeling techniques, Data Mining, Visualization, Data Science
Increased sales by 15%-20% (of $200 MM) for Professional Services, SaaS products by automating PS services reporting and providing accurate product health metrics thru data platform
Curated and Centralized data as single source of truth to enable Business Leaders, Professional Services, Finance, HR departments to perform near real-time Data Analytics, BI Reporting (Power BI), Machine Learning and predictive analytics on Azure cloud platform
Architected Data Engineering/Analytics Platform for about dozen SaaS products (multi-tenant), SFDC CRM, Finance, HR applications in a Cyber Security Organization to deliver metrics/KPIs for Product Health, Customer Health, Features health, HR, Legal, Research, Engineering, and external reporting for Professional Services, Marketing, Sales Teams
Designed Data Lake and organized, structured/semi-structured data from disparate sources (OLTP Databases, Salesforce CRM, REST APIs, multi-format files etc.) and modeled datawarehouse to create domain centric Facts, Dimensions, Departmental Data Marts for Data Analytics and BI utilizing services like HDFS, Azure Cloud, SQL Warehouse, Power BI, Splunk
Integrated Product reporting (like Repeat offender, BoD Reporting, Resiliency, Susceptibility Metrics, User Learned behaviors) for Professional Services and Financial reporting (like ARR, MRR, Billings forecast, Quarterly/Weekly Linearity Reporting) for Finance Department
Implemented software engineering best practices in code development, integrated testing, monitoring and code deployment via GitLab, Artifactory following Agile Development methodology
Developed innovative solutions to enhance Cyber Threat Analytics and Operations using NoSQL Graph Databases to identify complex relationships (in Cofense Products) across cyber threats and user interactions
Custom Ink Inc., Fairfax, VA 2014-2017
Data Engineering Lead/Manager
Built Data Engineering team from scratch and provided Technical leadership to the team
Started the practice of building Tech. Roadmap and team capacity planning to align with organization goals
Designed and Architected new Data Technology platform (Data Warehouse and Operational Data Infrastructure) to deliver scalable platform providing quality data for enterprise analytics & reporting, so that Executives and Stakeholders in various departments can make informed decisions
Designed Raw Data Lake, STAR/Snow-Flakes Schemas based of Dimensional Modeling and created summaries, trends at various levels of granularities for BI reporting (Tableau, Meta-Base etc.), and key insights
Developed ETL applications/Pipelines to Load data from disparate sources (RDBMS, Columnar DBs, NoSQL) to AWS Data services (S3/Redshift/Snowflake/EMR)
Led the initiative to train key stake holders to self-service reporting for their departments and improve the efficiencies in Data Teams to deliver data products faster
Developed Pipelines to automate Predictive Analytics and Machine Learning to build reporting for Life-Time Value, Funnel Conversion Tranches, Marketing Channel Attribution, CAC, AOV by segments, A/B Testing Analysis, RFM Analysis and forecasting etc.
Increased overall Sales by 15 (of $250MM) due to data platform providing quality data for Product, A/B Testing, Feature, CAC, RFM reporting
Reduced Marketing spending by 30% due to development of accurate Marketing Channel Attribution, CAC, AOV by segments, Lifetime Value reporting thru the Data Platform
Mentored Teams & Implemented best practices regards to Code Development, CI/CD, GitHub, Documentation
AOL Time Warner Inc., Dulles, VA 2003-2014
Tech. Lead, Principal Big Data/Data Warehouse Team
Led Data Warehouse/Big Data infrastructure consisting of 10K+ processes loading 20+ TB data daily on hundreds of ETL/Hadoop servers of data size 300+ TB
Extensively worked in Data profiling, Data Modeling to design DW on the principles of Dimensional Model (STAR Schema, Facts & Dimensions)
Led the Design and Development of AOL Data warehouse by integrating Membership, Billing, CRM data using component-based Ab Initio tool ingesting OLTP data into ODS and build Facts, Datamarts/Summaries on Netezza, Oracle, Vertica platforms
Developed critical Bill/Not Bill and Membership Adjustment reporting applications using Data Platform for Finance and Tax Departments for external FTC, Wall Street reporting
Developed code to load multi-terabytes of data into HDFS daily and applications to produce intermediate summaries for Content Data Warehouse using Hive SQL and Pig scripting
Developed critical ad hoc SQL scripts, stored procedures/packages to deliver data needed for Executive reporting, external auditing, SOX Compliances etc.
Built ETL apps for Content Warehouse to create pageview reporting and KPI dashboard metrics to perform analytics for all web products across multiple AOL Time Warner brands
Developed tools in Ab Initio, Perl, and UNIX Shell for automating data migrations, transfer, archiving of DBs and self-help reporting utilities for business stake holders across Finance, Accounting, Analytics
Led the technology platform migration and consolidation of thousands of ETL processes along with Data Warehouses/Datamarts from Legacy DBs (like Oracle) to Netezza Data server and Vertica DB cluster
Conducted Sarbanes Oxley (SOX) and other Compliance Audits, Failover analysis and created High Availability/Disaster Recovery plans for Big Data/DW processes/application infrastructure
Standardized code deployment and software Change Control Management processes
Verizon Inc, Ashburn, VA 2001-2003
Developer Consultant
Developed Data Warehouse for Verizon by developing data pipelines to load massive data from disparate sources (about 20+) to Operational Data Store. Further Developed ETL processes to ingest data into multiple data-marts, data stores, MDBS (Multi-Dimensional Data Base systems) for BI reporting. DW Reporting helped Verizon in improved order processing efficiency by 20% and overall business performance KPIs
EDUCATION & CERTIFICATIONS
Masters (M.S.), Information Technology (AIU IL, USA)
Bachelors (B.S.), Computer Sciences & Engineering, (OU T.S., India)
Data Science: Bridging Principles and Practice (UC Berkeley, USA)
Data Science/Programming R
Linux Certified Professional LPI