Neal Lee Webster Data Engineer - Database Developer - ETL Developer Page 1 5
Neal Lee Webster
Data Engineer / Database Developer / ETL Developer ************@*****.***
Dallas, Texas
Executive Summary
Visa Status / Work Authorization: United States citizen Technical Proficiencies
Platforms Microsoft SQL Server (on-premise and cloud), Oracle, Azure, AWS, Snowflake, GCP (cloud platforms) Languages SQL, T-SQL, Python, R, C#, CMD, PowerShell, EasyLanguage, JavaScript, HTML, CSS Databases Microsoft SQL Server, Snowflake, Teradata, Cassandra, MySQL, Redshift, Aurora, Oracle, Redis, Postgres
ETL Tools SSIS (SQL Server Integration Services), DBT, Matillion, Fivetran, Databricks, Azure Data Factory
(ADF), AWS Glue, AWS Data Pipeline, Informatica, Talend, IBM Data Stage, Pentaho Data Integration, Alteryx, Cloudera, Oozie scheduler, SQL Agent, Airflow, Cron, Hadoop, Spark jobs, Luigi, PySpark, Spark, Change Data Capture (CDC), Slowly Changing Dimensions (SCD type 2, 3, 4 and 6) Development
Tools
Jupyter Notebook, IDLE, SQL Server Management Studio (SSMS), SSRS, Visual Studio, VS Code, RStudio, CLI, Swagger, RedHat Decision Manager, Visio, Excel, PostMan, AWS S3, AWS LAMBDA, WinSCP, Notepad++, WinMerge, API, SVN, TFS, GIT, Idera, Redgate SDLC Agile, Scrum, Waterfall, Hybrid Project Management Technical
Strengths
Data Engineering, Data Integration, Data Profiling, Data Mapping, Source to Target Mapping, ETL Development, ETL Implementation, Data Warehousing, Database Development, Data Migration, Extract Transform Load, Incremental/Delta Load, Truncate and Load, Hashing and Checksums, Data Analysis, Data Integrity, Data Quality, Data Reconciliation, Change Data Capture (CDC), Slowly Changing Dimensions (SCD type 2, 3, 4 and 6), System Versioned Tables, Temporal Tables, Temporary and Transient Tables, Batch-Processing, Data Cleansing, Data Transformation, Distributed Systems Integration, Data Modeling, Relational Databases, Unstructured Database, Data Science, Machine Learning, Data Classification, Feature Engineering, Hyperparameter Optimization, Decision Trees, Random Forests, Python (Boto3, Matplotlib, NumPy, Pandas, PyODBC, PySpark, Scikit-Learn, Requests, Scipy, XGBoost), Data Governance, Requirements Analysis, Deployment Planning, Release Execution (CI/DI), Production Support, Root Cause Analysis (RCA), Artifact Documentation (BRD, AC, TRD, QA, UAT, PPV), Change Control Management, Version Control, Source Control, Advanced SQL Queries (CTE's, Dynamic-SQL, Cross DB Joins, Subqueries, ROW_NUMBER, RANK, NTILE, PARTITION BY, Union vs. Union All, Except/Intersect, Aggregate Functions, Window Functions LEAD/LAG), Business intelligence (BI), Data Mining, Data Visualization/Dashboards.
Data professional, database developer, and effective communicator with advanced working knowledge of SQL and Python for Data Warehouse, ETL development, data migrations, and data engineering.
Worked with multiple Data Warehouse and ETL tech-stacks on-premise, in the cloud, and hybrid.
10+ years providing production support for databases and ETL’s, managing mission critical enterprise data systems, triage ETL failures, perform root cause analysis while simultaneously participating in sprints enhancing existing data integrations and creating new data pipelines.
10+ years hands-on development experience building and deploying data pipelines, ETL’s, data integrations, and data warehouses using various DBMS’s from numerous API’s, ERP’s, CRM’s, EDI’s, flat files, and ODS’s.
Implemented solutions for data integrity, data quality, data reconciliations, data governance, and analytics reporting.
Strong ability to lead and facilitate every phase of the SDLC process and produce project artifacts.
Experience in every phase of the SDLC process and worked in both Agile and Waterfall environments.
Effectively work with small local teams and large distributed offshore teams.
10+ years eliciting business requirements, writing functional and technical documents, data mappings. Perform impact analysis, dependency analysis, data analysis, release planning, and post-production validation.
Effectively document and communicate current vs. future state, data flow diagrams, and sprint/project status. Neal Lee Webster Data Engineer - Database Developer - ETL Developer Page 2 5 Career Summary
Cigna/Express Scripts. Bloomfield, CT (REMOTE) February 2024 – Present (1 Years, 1 Month)
● Sr. Data Engineer - Data Warehouse / Compliance & Performance Analytics Tredence, Inc. San Jose, California (REMOTE) February 2022 – February 2024 (2 Years, 0 Months)
● Sr. Data Engineer - Data Warehouse / Global Data & Analytics Ulterra Drilling Technologies Fort Worth, Texas (REMOTE) December 2021 – February 2022 (3 Months)
● Sr. ETL Developer - Data Warehouse / Systems Integration PennyMac - National Mortgage Acceptance Fort Worth, Texas (REMOTE) March 2014 – December 2021 (7 Years, 10 Months)
● Sr. Data Engineer - BI Systems / Servicing IT
● Sr. Database Developer - Enterprise Data Warehouse / Central IT
● Sr. Database Systems Analyst - Enterprise Data Warehouse / AppDev IT Guitar Center, Inc. Westlake, California (ONSITE) February 2011 – March 2014 (3 Years, 2 Months)
● Sr. Systems Analyst/Programmer - IT ERP & Control Systems
● Technical Business Analyst - Sales Audit / IT Accounting Lions Futures Management, Inc. Van Nuys, California (ONSITE) April 2008 – February 2011 (2 Years, 11 Months)
● Programmer Analyst - IT Data Integration / Trade Desk Automation UBS Wealth Management Oxnard, California (ONSITE) June 2008 – August 2008 (3 Months)
● Intern, Analyst
Career Experience
Sr. Data Engineer - Data Warehouse / Compliance & Performance Analytics February 2024 – Present Cigna/Express Scripts, Bloomfield, CT (REMOTE) 1 Year, 1 Month
● Created YAML files, packages for transforming the data using DBT tool. Implemented SCD type1 and type2 using DBT.
● Developed ETL/ELT data pipeline flow to load data from various data sources to the staging database and apply complex business logic to populate normalized and denormalized data structure using DBT.
● Worked on Snowflake cloud-based project and to design dynamic ETL solution to load the data from on-prem to Cloud Data Warehouse.
● Pipelines to capture data in snowflake. Developed ETL’s (SSIS packages and SQL Agent Jobs) to stage IVR call data from multiple source systems, (Genesys, Avaya, Five9, Cisco Systems) and incrementally load into MS SQL Server for IVR call data for governance, regulatory audits and performance analytics.
● Performed data modeling, (Kimball approach) to conform data from multiple dissimilar systems of record into a consistent data model and created star schemas, (fact and dim tables) in new data warehouse for BI reporting across all IVR systems.
● Refactored and enhanced legacy truncate-and-load ETL’s into to incremental/delta loads utilizing row level hashbytes across all columns to generate unique keys to identify new, updated, and deleted records where primary key and timestamp field did not exist.
● Provided daily production support for legacy ETL’s. Performed root cause analysis for failed jobs and improved job runtimes.
Sr. Data Engineer - Data Warehouse / Global Data & Analytics February 2022 – February 2024 Tredence, Inc., San Jose, CA (REMOTE) 2 Years, 0 Months
● Successfully migrated transformations in 150+ SSIS packages to DBT, re-architecting workflows to improve performance and scalability while reducing dependency on legacy tools.
● Refactored 200+ stored procedures into DBT models, optimizing SQL logic and leveraging DBT’s modular structure to improve transparency and maintainability.
● Implemented DBT for end-to-end data transformation, creating custom macros, automating tests, and reducing pipeline runtime by 30%.
● Python and SQL to parse large JSON data from API’s. Flattened and deserialized the JSON data. Looped through each deeply nested list of key value pairs in the JSON data to prepare the JSON data to insert into staging database, cleansed and loaded into data warehouse.
● SQL and Python to extract from data warehouse, transform, and format into JSON and pushed data to external API. Parsed tabular data from source database tables and serialized the data into JSON format to POST to JSON API’s.
● Built data pipelines and workflows to ingest data from external API’s and delta load into data warehouse.
● Snowflake database development. Created tables to stage data for ETL processes and store ODS and transactional data. Neal Lee Webster Data Engineer - Database Developer - ETL Developer Page 3 5
● Parse JSON API responses to load/insert into Snowflake database. Developed Python scripts to create custom API connectors to ingest data from external vendors; Capterra, StackAdapt, Rollworks, LeadScale, Xing, Demandbase, Facebook, Google and parse JSON API responses to load/insert into Snowflake.
● Created views in Snowflake for data presentation used by Power BI.
● Data Warehouse development using Microsoft SQL Server on RDS and Snowflake for the Consolidated Corporate Support Services (CCSS) program.
● Developed data pipelines with Matillion and FiveTran to load historical and daily delta loads of marketing analytics data.
● Built automated data pipelines to ingest/load data to/from AWS S3 via Python.
● Translated and converted stored procedures, view definitions, and functions from Oracle PL/SQL to Microsoft T-SQL. Sr. ETL Developer - Data Warehouse / Systems Integration December 2021 – February 2022 Ulterra Drilling Technologies, Fort Worth, Texas (REMOTE) 3 Months
● Worked on multiple projects and provided production support simultaneously.
● Built Databricks ETL’s and ELT’s for on-premise and cloud MSSQL Server environments.
● Developed and enhanced SSIS packages to increase data throughput.
● Refactored existing Databricks ETL’s to run via delta load rather than truncate and load.
● SQL development - created and altered stored procedures, enhanced existing stored procedures, and SQL Agents jobs.
● Data Modeling - created database requirements by analyzing APIs via JSON responses.
● Transferred from data sources like flat files, XML, JSON documents to RDBMS and vice versa.
● Developed data visualizations with SSRS, Power BI, and Tableau.
● Used Python for file processing to convert JSON files with nested elements to CSV files.
● Python to read data from databases (MS SQL Server and Cassandra) to Pandas data frames.
● Compare large files and identify differences.
Sr. Data Engineer - BI Systems / Servicing IT February 2019 – December 2021 PennyMac - Private National Mortgage Acceptance Company, Fort Worth, Texas (REMOTE) 2 Years, 11 Months
● Migrated MSSQL Server on-premise to Snowflake.
● Provided critical production support for Snowflake DWH and on-premise data warehouse.
● Implemented bulk and delta load ETL’s using SSIS.
● Documented data mapping for Data-Sources to Data-Targets.
● Collected business requirements to build rules into rules engine for automated loss mitigation reporting.
● Developed additional data quality checks via T-SQL and faster file processing via Python for investor reporting ETL’s.
● Increased investor reporting rating from “C” to “A” by implementing improved data quality controls and automation.
● Significantly reduced latency/run-time (12+ hours to less than 3 minutes) using T-SQL and Python for investor reporting by refactoring existing ETL’s.
● Developed Python scripts to upload and download data files from AWS S3. Sr. Database Developer - Enterprise Data Warehouse / Central IT October 2015 – February 2019 PennyMac - Private National Mortgage Acceptance Company, Fort Worth, Texas (REMOTE) 3 Years, 5 Months
● Strengthened production and backend development across multiple reporting and application database servers by producing complex data workflows that adhere to the data management policy quality requirements.
● Spearheaded daily stand-up meetings with onsite and offshore developers, and championed bi-weekly Sprints to collaborate on how to enhance backend processes, and address and resolve any defects or bugs.
● Support the self service reporting capability through development of a pilot, and support of the user community in their ability to conduct ad hoc analytics projects.
● Development effort of the self service capability and collaborated with data stewards to identify user reporting needs, identify the data sources/systems, and integrate them into the analytics platform.
● Defined data connections and package data extracts. Responsible for collecting, collating, and triaging issues and problems with data.
● Communicate issues and other relevant information (e.g. root causes) to those staff members that are able to influence remediation.
Sr. Database Systems Analyst - Enterprise Data Warehouse / AppDev IT March 2014 – October 2015 PennyMac - Private National Mortgage Acceptance Company, Moorpark, California 1 Year, 8 Months
● Facilitated all phases of SDLC process. Partnered with LOB to create business requirement documents, development, deployment, and post-release production validation.
● Lead analyst in the kick-off and implementation of enterprise wide Data Governance program.
● Produced complex data workflows that adhere to the data management policy and data governance quality requirements.
● Data Profiling and Data Governance via SQL and Collibra.
● Spearheaded enhancements to backend processes and addressed defects or bugs. Neal Lee Webster Data Engineer - Database Developer - ETL Developer Page 4 5 Sr. Systems Analyst/Programmer - IT ERP & Control Systems February 2013 – March 2014 Guitar Center, Inc., Westlake, California 1 Year, 2 Months
● Liaison between onsite and offshore inter-department business users and development teams.
● Facilitated the management of completed projects and resolution of any issues and bugs for internal customers throughout the enterprise including Accounting, Treasury, Operations, EDW, POS, Contact and Distribution Centers.
● Collaborated with business stakeholders to obtain effective strategic, tactical, and operational insights of the business requirements to enhance critical financial systems.
● Improved ERP application that supported over 2,500 concurrent users across the entire enterprise from corporate offices to distribution centers located throughout the United States. Technical Business Analyst - Sales Audit / IT Accounting February 2011 – February 2013 Guitar Center, Inc., Westlake, California 2 Years, 3 Months
● Implemented balance sheet reconciliation application to increase financial controls by the development of a semi- automated program to automate manual processes of over $5M in transactions per month and reconciled over $70M in transactions per month.
● Lead Accounting and Financial Operations by documenting procedures for accurate reconciliations and efficient management of customer payment issues.
● Created test cases, scripted batch job for data conversions, data mapping, error handling, proper logging of issues, and validation of ERP bug fixes and enhancements in sub-ledger and general ledger.
● Facilitated QA and UAT for on-shore and offshore teams.
● Conceptualized the automated processes for mapping and conforming data extraction from internal and external remote services.
● Recipient of the Innovator Award of the Year award for automating systems to replace repetitive, manual, and error prone reconciliation / accounting of balance sheets, accounts receivable, and accounts payable. Programmer Analyst - IT Data Integration / Trade Desk Automation April 2008 – February 2011 Lions Futures Management, Inc., Van Nuys, California 2 Years, 11 Months
● Leveraged internal programs and third-party APIs to integrate high frequency trading applications into the proprietary trading software.
● SalesForce development and process automation.
● Converted trading systems from EL (EasyLanguage) to C# for retail and all non-retail customers.
● Managed offshore developer teams located in the Philippines and India by preparing functional and technical required documents.
● Overhauled and maintained cloud-based customer relationship management software and deployed customer specific software installation, maintained file schemas (XML), and identified, documented, and QA’d bug fixes and enhancements.
● Migrated from the Microsoft Mail Exchange Server to the Google Apps Enterprise.
● Lead software training sessions for employees and clients for in-person and online sessions. Analyst, Intern June 2008 – August 2008
UBS Wealth Management, Oxnard, California 3 Months
● Collected and analyzed data for mid-year performance review for approximately 30 financial advisors.
● Worked closely with operations manager, gained understanding of administrative & back office operations for managing fixed income, equity & insurance products for clients.
● Partnered with the branch manager to facilitate a recruiting project for financial advisors.
● Compiled & ranked production statistics for over a dozen financial advisors.
● Created PowerPoint presentations for weekly, monthly, and quarterly sales meetings.
● Created Excel templates for office administrator & branch manager
● Researched equity and fixed income markets culminating in a one-year investment outlook and potential investment strategies.
Neal Lee Webster Data Engineer - Database Developer - ETL Developer Page 5 5 Domain Knowledge
Mortgage Securitization, Hedging, Pricing, Correspondent, Servicing, Default Reporting, Loss Mitigation Reporting, Investor Reporting, Bankruptcy, Foreclosure, Positive Pay, Retail Origination, NMLS Quarterly Reporting Capital Markets Equities/Securities, Derivatives, Fixed Income, Futures, Liquidity Analysis, Depth of Market, Order Flow, Order Types, FIX Protocol, HFT, Algorithmic Trading, CTA, CPO, POA Accounts, Discretionary Accounts, Managed Accounts, Block Allocations, Order Execution and Settlement and Auditing, Backtesting, VAR, Leverage, Margin Requirements, CFTC/NFA Registration and Audit
Finance &
Accounting
SOX, Balance Sheet Reconciliation, General Ledger Journal Entries, PnL, Capex (Capitalized Expenditures), AP/AR Reconciliation, Tax, COGS, Inventory, Treasury Positive Pay, Unclaimed Property, Aged Accounts, Obsolescence, ERP applications (PeopleSoft, Microsoft Dynamics 365, SAP HANA) Insurance & Risk
Management
Insurance Reporting and Analysis, Vital metrics for decision making, Capitalization and Leverage Measures, Case Management, Casualty, Claims, Exclusions, Insurable Interest, Pecuniary Sense, Loss Ratio, Risk Management, Secondary Market, Underwriting, Unearned Premiums, Reinsurance, Policy life cycle testing, Policy lapse and Re- instatement, Aging-run cycles, Premium due alerts, Valuation of NPV/NAV, Claims triage and assignment, Testing claims life cycle, Claims accounting/reserving, Third party EDI/messaging Retail & eCommerce ERP (Enterprise Resource Planning), Inventory Accounting, Sales Audit, Payment Gateways, Merchant Systems, Payment Settlement, Multi-Channel (PO, Contact Center, Distribution Center, Inventory, PPS), Fraud Detection, Procurement Analysis, Reverse Auctions, Three-Way Reconciliations (POS vs Merchant Processor vs Bank Settlement) CRM applications (SalesForce, HubSpot, SAP Sales Cloud, ) Data Science &
Machine Learning
Data Engineering, Data Integration, ETL, Database Development, Distributed Systems Integration, Enterprise Data Management, Data Modeling, Data Science, Machine Learning, Data Governance, Process Automation and Improvement, Requirements Analysis, Artifact Documentation (BRD, TRD, PPV), QA, UAT, Deployment Planning, Release Execution, Production Support
Education
Bachelor of Science, Information Technology - Western Governors University, Utah (not completed) Bachelor of Science, Accounting - California Lutheran University, California (not completed) Certificate, Enterprise Project Management - University California Santa Barbara, California - 2011 Associates Degree, Moorpark College, California - 2010 Soft Skills
Communication: Ability to communicate clearly and efficiently to team members and clients, verbally and in writing. Able to present ideas in a variety of ways depending upon audience and context. Excellent active listening skills. Organizational Skills: Can plan and prioritize work. Follows tasks to their logical conclusion and makes sure that everything has been done to the right standard. Good attention to detail. Team Work: Able to enthuse and maintain project interest. Comfortable working both individually and as part of a team. Prepared to challenge ideas within a group in a personable and constructive manner. Quantitative Management: Ability to determine process measures and track to determine process effectiveness and efficiency. Results oriented: Able to drive things forward regardless of personal interest in the task. Coursework & Continued Education
Data Science Certificate - Coursera with Johns Hopkins University Online Algorithms: Design and Analysis - edX with Stanford Online Statistical Learning with Applications in R - edX with Stanford Online Series 3 - FINRA/NASD Exam - National Commodities Futures Series 30 - FINRA/NASD Exam - Futures Branch Manager Series 31 - FINRA/NASD Exam - Managed Funds Futures Series 63 - FINRA/NASD Exam - Uniform Securities Agent State Law Series 65 - FINRA/NASD Exam - Uniform Investment Advisor Law Security Futures and Arbitrator Training - National Futures Association Motivation
Solving complex problems.
Using data to drive decision making.
Continuous learning and development.
Collaborating with others.
Making an impact.
Awards & Distinctions
Annual Spotlight – PennyMac
Quarterly Spotlight – PennyMac
Innovator of the Year – Guitar Center