Nounika B
Data Engineer
***********@*****.*** 612-***-**** Bloomington, MN
SUMMARY
Data Engineer with five years of experience designing and maintaining cloud based data platforms. Demonstrates high customer service orientation by delivering reliable data solutions that meet stakeholder needs. Communicates complex technical concepts clearly in written and oral formats to facilitate cross functional collaboration. Works collaboratively within team oriented environments, actively listening to diverse perspectives and fostering inclusive teamwork. Pays meticulous attention to detail when constructing data models, pipelines, and documentation. Applies strong analytical and problem solving abilities to diagnose data quality issues and implement corrective measures. Balances business acumen with technical expertise to translate reporting requirements into scalable data solutions. Adapts quickly to dynamic work environments with competing priorities, maintaining consistent productivity. Supports safety initiatives by adhering to organizational health and safety policies in all project activities. Engages in continuous learning, seeking feedback and staying current with emerging data engineering technologies. Contributes to process improvement initiatives, refining data engineering practices for efficiency and reliability. Recognizes and celebrates contributions of peers, promoting a positive and collaborative culture. Participates actively in employee owned stock ownership plan activities and related events. Provides timely and accurate information to enable colleagues and clients to perform their roles effectively. Demonstrates reliable follow through on commitments, ensuring project deliverables are completed as agreed.
SKILLS
Programming Languages: SQL, Python, PySpark, Spark SQL
Cloud Platforms: Microsoft Azure, Azure Data Factory, Azure Databricks, Azure Synapse Analytics, Azure Data Lake Storage Gen2
Data Engineering: ETL, ELT, Medallion Lakehouse, Delta Lake, Data Ingestion, Data Modeling
Database Systems: Azure SQL Database, Snowflake, Microsoft SQL Server, PostgreSQL, Oracle
DataOps & DevOps: CI/CD, Azure DevOps, Git, Terraform, Infrastructure as Code (IaC)
Analytics & Visualization: Power BI, Tableau, Jupyter Notebooks, DAX, Power Query
Data Governance: RBAC, ACLs, Data Lineage, Data Validation, Compliance (HIPAA)
Architecture & Design: Dimensional Modeling, Data Flow Design, Specification Documentation
Tools & Orchestration: Kafka, Azure Service Bus, Apache Airflow, Azure Monitor
Methodologies: Agile, Scrum, Test Driven Development, Code Review
EXPERIENCE
Data Engineer PPL May 2024 – Present
•Developed scalable batch and real time data pipelines using Spark and Kafka to process large payroll datasets.
•Implemented enterprise data ingestion workflows with Azure Data Factory, ADLS Gen2, Azure SQL, and Azure Databricks.
•Designed Medallion Lakehouse architecture using PySpark and Spark SQL to improve data quality and governance.
•Created automated ETL/ELT pipelines integrating Cosmos DB, SQL Server, and APIs into centralized platforms.
•Collaborated with data scientists to prepare feature datasets and operationalize machine learning models.
•Improved Spark job performance by tuning partitions, caching, and enabling cluster auto scaling.
•Implemented data governance frameworks including RBAC, ACLs, and lineage tracking.
•Created near real time streaming ingestion pipelines using Azure Service Bus and Spark Streaming.
•Designed data models aligned with business requirements and governance standards.
•Built CI/CD deployment pipelines with Azure DevOps and Terraform for reliable releases.
•Participated in Agile Scrum sprints to translate requirements into technical specifications.
•Supported monitoring and alerting frameworks using Azure Monitor to ensure SLA compliance.
Data Engineer CVS Health Mar 2023 – Apr 2024
•Developed large scale ETL/ELT pipelines using Azure Databricks processing terabytes of healthcare data.
•Implemented parameter driven ingestion frameworks for dynamic pipeline execution.
•Designed scalable pipelines integrating batch and near real time data into Snowflake.
•Built ELT pipelines in Snowflake leveraging micro partitioning and clustering for performance.
•Developed modular transformation workflows with dbt for version controlled, testable SQL.
•Implemented dbt models, snapshots, and tests to enforce data quality and lineage.
•Designed Medallion Lakehouse architecture using Databricks and Delta Lake with Snowflake integration.
•Engineered metadata driven ingestion using Azure Data Factory to load APIs and SQL Server data.
•Collaborated with analysts and data scientists to create curated data marts for analytics.
•Improved complex SQL transformations and dbt pipelines to reduce execution time.
•Implemented data quality validation and anomaly detection frameworks with dbt tests.
•Established CI/CD pipelines using Azure DevOps and Git for automated testing and deployment.
Jr. Data Engineer CDAC Jul 2019 – Feb 2023
India
•Developed orchestration of data movement from databases, APIs, and flat files using Azure Data Factory.
•Implemented large scale ETL/ELT pipelines with Azure Databricks processing terabytes of data.
•Designed data lake and warehouse architectures leveraging ADLS and Azure analytics services.
•Created metadata driven ingestion frameworks for scalable onboarding of diverse sources.
•Automated workflow orchestration with Apache Airflow to improve pipeline efficiency.
•Built reusable PySpark transformation frameworks to standardize data processing.
•Implemented data validation, reconciliation, and anomaly detection frameworks.
•Designed secure pipelines with encryption, data masking, and access control.
•Developed centralized monitoring, logging, and observability solutions.
•Collaborated with cross functional teams to design high availability, fault tolerant platforms.
•Developed Power BI data models using DAX and Power Query for enterprise reporting.
•Supported machine learning workflows by preparing curated datasets and enabling feature engineering.
EDUCATION
JNTUH, India Master’s in Software Engineering
JNTUH, India Bachelor’s in Computer Science Engineering
CERTIFICATIONS
Azure Data Engineer Associate
DP-203
Azure Fundamentals
DP-900