Andrew Kocian
*************@*****.***
https://www.linkedin.com/in/andrewk01/
PROFESSIONAL SUMMARY
Data/ETL Engineer with nearly 10 years of experience in healthcare, financial, and marketing data management. Assisted in the overhaul and migration of a legacy SQL Server system to Microsoft Azure, resulting in a 53% reduction in processing time. Transformed a manual, monthly Excel report for executives into an interactive Power BI dashboard with a 24-hour refresh cycle, significantly improving data accessibility and timeliness for executive decision-making. I would consider myself an SQL expert.
SKILLS
SQL (T-SQL, MySQL, PL/SQL, Postgres)
SSIS, SSRS
Python (Pandas), C#
Microsoft: Fabric, Azure Synapse, Databricks, ADLS Gen2 Storage, Azure Data Factory, Power BI
AWS (S3 Storage)
Apache Spark (PySpark, SparkSQL)
REST API integration
EXPERIENCE
Empower Brands – Alpharetta, GA May 2024 - May 2025
Data Engineer/ETL Developer
Supported company initiative to develop cloud-native data warehouse on Microsoft Azure
Developed robust ETL pipelines to ingest and consolidate data from multiple SaaS-based CRM platforms into a cloud data lake (Azure Data Lake Gen2). Automated data ingestion using Azure Data Factory and REST API connectors to ensure reliable daily updates
Utilized PySpark (Python API for Apache Spark) and SQL to transform, clean, and aggregate large datasets, applying indexing logic for incremental loads and partitioned storage to optimize downstream performance.
Supported the implementation of a scalable, multi-tenant Data Vault architecture across separate Azure tenants, modeling brand-level domains to enable historical traceability and cross-tenant integration.
Designed a centralized semantic layer in Azure Synapse’s dedicated SQL pool using external tables and materialized views. Created advanced SQL-based aggregations and metrics to support governed, corporate-wide reporting via Power BI.
Vizient, Inc. 02/2024 Sept 2020 - Mar 2024
Software QE/Data Engineer (Remote)
Supported Pharmaceutical Data Warehouse system hosted in Microsoft Azure
Managed and optimized a cloud-based pharmaceutical data warehouse in Microsoft Azure, enhancing data reliability and scalability.
Built PySpark test automation in Azure Databricks, boosting test coverage and minimizing manual verification.
Developed Azure Data Factory ETL pipelines for seamless data integration between Delta Lake and SQL Server.
Validated datasets with SQL and SparkSQL in Hive Metastore and SSMS, ensuring data integrity and consistency.
Vizient, Inc. – Chicago, Il Dec 2015 - Aug 2020
Data Analyst (Remote)
Data Management and Implementation team
Supported data intake for several healthcare facility consulting engagements
Facilitated monthly encounter file package submissions across 20 clients of CERNER and EPIC data structure
Developed SSIS packages with C#, PowerShell, and VBA to automate ETL processes, reducing manual effort and ensuring accurate SQL Server data loads.
EDUCATION
Metropolitan State University of Denver
2012 - 2017
CERTIFICATIONS
Microsoft Certified: Azure Data Engineer Associate
Microsoft Certified Technology Associate (MTA) – Database Fundamentals