Post Job Free
Sign in

Data Engineer Senior

Location:
Goodyear, AZ
Posted:
April 08, 2025

Contact this candidate

Resume:

Candi Westin

720-***-****

***********@*****.***

https://www.linkedin.com/in/candi-westin-6269069/

Experienced Senior Data Engineer with over 10 years of expertise in designing and implementing scalable data warehouse solutions, optimizing ETL/ELT processes, and migrating complex data environments.

Core Competencies:

●Data Engineering: ETL/ELT, Data Pipeline Development, Data Warehousing, Cloud Migrations

●Cloud Platforms: Google Cloud Platform (GCP), BigQuery, Apache Airflow, Hadoop Ecosystem

●Tools & Technologies: SQL, Python, DataStage, Oracle, Postgres, Apache Kafka, Snowflake

●Big Data: Hadoop, Spark, Scala, Avro, Parquet, Yaml, JSON, Star Schema, Data Integration

●DevOps & CI/CD: Jenkins, Bitbucket, SourceTree, Control-M

●Software Engineering: Modern Software Engineering, Agile, Continuous Improvement

●Agile Methodologies: Scrum, Kanban

●Security: Interim Secret Security Clearance

Professional Experience:

Senior Data Engineer - Remote

Insight Global (Contract for BAE Systems) - Atlanta, GA

December 2024 - March 2025

Participated in a migration of data systems for the Minute Man III ICBM system from Oracle to Postgres.

Developed and optimized database solutions supporting the U.S. Department of Defense (DoD) at Hill Air Force Base.

Senior Data Engineer - Remote

Medasource - Nashville, TN

June 2024 - August 2024

As a short-term contractor for HCA Healthcare, I worked with the marketing team to identify security risks in their data flow processes for both in-house applications and online services. I led the initial stages of a big data integration project, focusing on reviewing, analyzing, and optimizing ETL processes to enhance performance and scalability within the Data Warehouse and BI environment.

Collaborated with senior leadership to establish ETL best practices, designed a roadmap for scalable data management systems, and developed documentation to streamline integration and implementation. Provided strategic guidance on aligning big data integration with industry best practices.

Senior Data Integration Engineer - Remote

Surescripts - Arlington, VA

June 2020 - April 2024

Environment: DataStage 11.7, Oracle Exadata, IntelliJ IDEA, Visual Studio Code, JIRA, Jenkins, Source Tree, Bitbucket, Hadoop, HUE, Control-M, Google Cloud Platform, Google BigQuery, Apache Airflow, Python scripting.

Implemented and optimized ETL (Extract, Transform, Load) data pipelines within DataStage for the Enterprise Data Warehouse, designing new tables and developing jobs to transfer data through staging to the warehouse using star-schema principles. Managed data setup, validation, and test data support, ensuring robust integration and analysis through ETL processes and SQL for Surescripts' products and services.

As part of a big data transition, I gained expertise in Spark/Scala to build modern ELT pipelines for Hadoop, defining schemas with Avro files, transforming data to Parquet, and loading it into the Hadoop ecosystem. I utilized Bitbucket and Jenkins CI/CD for seamless code deployment.

Transitioned to cloud technologies, leading the migration of data and structures from on-premises Oracle and Hadoop systems to Google BigQuery. Using Python and Apache Airflow DAGs, I developed cloud-based processes and adjusted CI/CD workflows to ensure smooth data deployment. I also designed Airflow pipelines to migrate 10 years' worth of data from Oracle to BigQuery, handling billions of records across hundreds of tables in just 6 weeks.

Software Development Engineer - Data Warehouse

Mitchell Martin, Inc. at MetLife, Inc. - Cary, NC

June 2019 - June 2020

Developed and integrated new functionality into the data warehouse, supporting financial, employee, service level agreement, performance, and accountability data for stakeholder analytics. Automated the creation of materialized views in Oracle to aggregate key data sets, ensuring alignment with reporting standards and financial data requirements.

Utilized SQL to analyze and resolve issues with financial data across multiple fact and dimension tables. Managed release cycles, including both scheduled updates and issue resolution, using ServiceNow and adhering to standardized change control processes.

Oversaw daily data warehouse loads, proactively troubleshooting and communicating incidents to stakeholders. Collaborated with both local and offshore ETL teams to define new requirements, scope changes, and establish timelines for production releases.

Applied Agile methodologies through daily stand-ups and periodic ceremonies to ensure project alignment and efficiency.

Senior Lead Data & ETL Developer - Remote

McKesson - AxisPoint Health

November 2008 – May 2019

Environment: DataStage 9.1.2-11.7, Oracle versions 10g/11i, Windows 10, RHEL 7, Star Schema, Change Capture.

Led a team to implement a new data mart for the Care Management application, designing tables and developing ETL jobs to integrate staging, dimension, and fact tables using star-schema principles. Onboarded new clients and built automated processes to support reporting standards for client-facing data extracts.

Created reusable pipelines to load historical eligibility data for new client implementations. Managed DataStage administration, including project and access management, and oversaw the installation and upgrade of DataStage from versions 9.1.2 to 11.5.

Implemented and maintained job execution schedules using DataStage and ProActive. Continuously evaluated and optimized load methodologies, reducing the data mart's weekly load time from over 20 hours to 6 hours, enabling faster troubleshooting, increased maintenance windows, and more efficient analysis.

Launched a new CISCO Mart to support telephony reporting and the CISCO Auto Dialer for automated nurse calling in disease management systems. Designed and implemented new data warehouse subject areas, including a triage system for a nurse advice line product.

Built complex Oracle SQL queries to analyze and test data for new datasets and further improved pipeline performance.

Education:

Art Institute of Colorado - Associate's Degree in Advertising Design

NuCamp Backend, SQL, and DEV/OPS with Python Bootcamp - 3/2025

Modern Software Engineering with DevOps 3/2025

SQL and Data Modeling with Python 1/2025

Data Structure and Algorithms with Python 12/2024

Astronomer Certification for Apache Airflow Fundamentals - 6/2024

Coursera ETL and Data Pipelines with Shell, Airflow, and Kafka - 5/2024

Google Cloud Digital Leader Certification - 04/2024

Snowflake Data Warehouse & Cloud Analytics - 2022

Leading SAFe 4 Agilist - 2021

Hadoop and Big Data Foundations Level 1 badge

IBM Infosphere DataStage Advanced Development for DataStage Server and Parallel



Contact this candidate