Candi Westin
***********@*****.***
https://www.linkedin.com/in/candi-westin-6269069/
Experienced Senior Data Engineer with over 10 years of expertise in designing and implementing scalable data warehouse solutions, optimizing ETL/ELT processes, and migrating complex data environments.
Core Competencies:
●Data Engineering: ETL/ELT, Data Pipeline Development, Data Warehousing, Cloud Migrations
●Cloud Platforms: Google Cloud Platform (GCP), BigQuery, Apache Airflow, Hadoop Ecosystem
●Tools & Technologies: SQL, Python, DataStage, Oracle, Postgres, Apache Kafka, Snowflake
●Big Data: Hadoop, Spark, Scala, Avro, Parquet, Yaml, JSON, Star Schema, Data Integration
●DevOps & CI/CD: Jenkins, Bitbucket, SourceTree, Control-M
●Software Engineering: Modern Software Engineering, Agile, Continuous Improvement
●Agile Methodologies: Scrum, Kanban
●Security: Interim Secret Security Clearance
Professional Experience:
Senior Data Engineer - Remote
Insight Global (Contract for BAE Systems) - Atlanta, GA
December 2024 - March 2025
Participated in a migration of data systems for the Minute Man III ICBM system from Oracle to Postgres.
Developed and optimized database solutions supporting the U.S. Department of Defense (DoD) at Hill Air Force Base.
Senior Data Engineer - Remote
Medasource - Nashville, TN
June 2024 - August 2024
As a short-term contractor for HCA Healthcare, I worked with the marketing team to identify security risks in their data flow processes for both in-house applications and online services. I led the initial stages of a big data integration project, focusing on reviewing, analyzing, and optimizing ETL processes to enhance performance and scalability within the Data Warehouse and BI environment.
Collaborated with senior leadership to establish ETL best practices, designed a roadmap for scalable data management systems, and developed documentation to streamline integration and implementation. Provided strategic guidance on aligning big data integration with industry best practices.
Senior Data Integration Engineer - Remote
Surescripts - Arlington, VA
June 2020 - April 2024
Environment: DataStage 11.7, Oracle Exadata, IntelliJ IDEA, Visual Studio Code, JIRA, Jenkins, Source Tree, Bitbucket, Hadoop, HUE, Control-M, Google Cloud Platform, Google BigQuery, Apache Airflow, Python scripting.
Implemented and optimized ETL (Extract, Transform, Load) data pipelines within DataStage for the Enterprise Data Warehouse, designing new tables and developing jobs to transfer data through staging to the warehouse using star-schema principles. Managed data setup, validation, and test data support, ensuring robust integration and analysis through ETL processes and SQL for Surescripts' products and services.
As part of a big data transition, I gained expertise in Spark/Scala to build modern ELT pipelines for Hadoop, defining schemas with Avro files, transforming data to Parquet, and loading it into the Hadoop ecosystem. I utilized Bitbucket and Jenkins CI/CD for seamless code deployment.
Transitioned to cloud technologies, leading the migration of data and structures from on-premises Oracle and Hadoop systems to Google BigQuery. Using Python and Apache Airflow DAGs, I developed cloud-based processes and adjusted CI/CD workflows to ensure smooth data deployment. I also designed Airflow pipelines to migrate 10 years' worth of data from Oracle to BigQuery, handling billions of records across hundreds of tables in just 6 weeks.
Software Development Engineer - Data Warehouse
Mitchell Martin, Inc. at MetLife, Inc. - Cary, NC
June 2019 - June 2020
Developed and integrated new functionality into the data warehouse, supporting financial, employee, service level agreement, performance, and accountability data for stakeholder analytics. Automated the creation of materialized views in Oracle to aggregate key data sets, ensuring alignment with reporting standards and financial data requirements.
Utilized SQL to analyze and resolve issues with financial data across multiple fact and dimension tables. Managed release cycles, including both scheduled updates and issue resolution, using ServiceNow and adhering to standardized change control processes.
Oversaw daily data warehouse loads, proactively troubleshooting and communicating incidents to stakeholders. Collaborated with both local and offshore ETL teams to define new requirements, scope changes, and establish timelines for production releases.
Applied Agile methodologies through daily stand-ups and periodic ceremonies to ensure project alignment and efficiency.
Senior Lead Data & ETL Developer - Remote
McKesson - AxisPoint Health
November 2008 – May 2019
Environment: DataStage 9.1.2-11.7, Oracle versions 10g/11i, Windows 10, RHEL 7, Star Schema, Change Capture.
Led a team to implement a new data mart for the Care Management application, designing tables and developing ETL jobs to integrate staging, dimension, and fact tables using star-schema principles. Onboarded new clients and built automated processes to support reporting standards for client-facing data extracts.
Created reusable pipelines to load historical eligibility data for new client implementations. Managed DataStage administration, including project and access management, and oversaw the installation and upgrade of DataStage from versions 9.1.2 to 11.5.
Implemented and maintained job execution schedules using DataStage and ProActive. Continuously evaluated and optimized load methodologies, reducing the data mart's weekly load time from over 20 hours to 6 hours, enabling faster troubleshooting, increased maintenance windows, and more efficient analysis.
Launched a new CISCO Mart to support telephony reporting and the CISCO Auto Dialer for automated nurse calling in disease management systems. Designed and implemented new data warehouse subject areas, including a triage system for a nurse advice line product.
Built complex Oracle SQL queries to analyze and test data for new datasets and further improved pipeline performance.
Education:
Art Institute of Colorado - Associate's Degree in Advertising Design
NuCamp Backend, SQL, and DEV/OPS with Python Bootcamp - 3/2025
Modern Software Engineering with DevOps 3/2025
SQL and Data Modeling with Python 1/2025
Data Structure and Algorithms with Python 12/2024
Astronomer Certification for Apache Airflow Fundamentals - 6/2024
Coursera ETL and Data Pipelines with Shell, Airflow, and Kafka - 5/2024
Google Cloud Digital Leader Certification - 04/2024
Snowflake Data Warehouse & Cloud Analytics - 2022
Leading SAFe 4 Agilist - 2021
Hadoop and Big Data Foundations Level 1 badge
IBM Infosphere DataStage Advanced Development for DataStage Server and Parallel