Data Engineer Quality

Location:

Seattle, WA, 98194

Posted:

May 19, 2025

Contact this candidate

Resume:

Laya Tigulla

Irving, TX *************@*****.*** +1-940-***-**** linkedin.com/in/linkdinly

PROFESSIONAL SUMMARY

Data Engineer experienced in building and maintaining scalable data pipelines, optimizing ETL workflows, and deploying cloud-native solutions across Azure, GCP, and AWS. Proficient in transforming raw data into structured, analysis-ready datasets using tools such as Apache Spark, Airflow, SQL, and Python. Expertise in designing data lake and warehouse architectures, orchestrating data workflows, automating deployments, and ensuring data quality, security, and governance across enterprise-level systems.

CORE COMPETENCIES

ETL/ELT Pipelines: Airflow, dbt, Informatica, ADF, DataStage, Fivetran, Data Vault

Cloud Platforms: Azure (Data Factory, Synapse, Databricks), GCP (BigQuery, Dataflow, Cloud Storage), AWS (EC2, S3, CloudWatch, Sagemaker, EMR, Lambda)

Programming & Query Languages: Python, SQL(Advanced), TSQL, Scala, Go, Java, Unix and Shell Scripting

Big Data Ecosystem: Apache Spark, Hadoop, Hive, Presto, Flink, Beam, Kafka, HDFS, Oozie, PySpark, PyTorch

DevOps & IaC: Docker, Kubernetes, Jenkins, Terraform, GitHub, Azure DevOps, Agile, Datadog

Data Warehousing: Snowflake, Azure SQL DB, Cloud SQL, BigQuery, NoSQL (MongoDB, HBase), Postgre SQL, Kimball, DynamoDB, Medallion architecture.

Data Governance & Security: GDPR, HIPAA, CCPA, Role-based Access Control

Analytics & BI Tools: Tableau, Power BI, Mode, Pandas, Grafana, Qlik, NumPy, Lucid Chart, Microsoft 365, TensorFlow

PROFESSIONAL EXPERIENCE

Azure Data Engineer Santander Bank, TX Dec 2023 – Present

Designed and deployed scalable Azure Data Factory pipelines to automate ingestion and transformation of structured and semi-structured data from on-premise and cloud sources.

Developed complex SQL scripts, stored procedures, and partitioning strategies to support high-volume analytics in Azure Synapse.

Implemented CI/CD pipelines using Jenkins and Azure DevOps, automating deployments and version control for data workflows.

Enabled secure and compliant data operations with adherence to GDPR and CCPA, enhancing governance across sensitive datasets.

Optimized AKS (Azure Kubernetes Service) deployments for microservices architecture, increasing application uptime to 99.99%.

Collaborated with cross-functional Business Intelligence team to align data models with business requirements and reduce report latency by 35%.

GCP Data Engineer Royal Bank of Scotland India Mar 2021 – Nov 2022

Engineered data solutions using Google BigQuery, Cloud Dataflow, and Pub/Sub to build real-time data processing pipelines.

Orchestrated daily ingestion workflows using Apache Beam and Cloud Composer (Airflow on GCP), reducing manual intervention by 60%.

Built batch and streaming pipelines for financial transaction data, improving data pipeline reliability and throughput.

Integrated Sqoop with Cloud Dataproc to ingest legacy data into Hadoop clusters, reducing processing time by 30%.

Conducted performance tuning on Cloud SQL and BigQuery to support analytical queries over billions of records.

Data Engineer Qualcomm, India Jun 2019 – Feb 2021

Developed scalable ETL jobs using PySpark and HiveSQL, handling multi-terabyte datasets and achieving 99.9% data integrity.

Migrated on-prem pipelines to AWS cloud infrastructure using Terraform, increasing environment provisioning speed by 50%.

Deployed Docker containers orchestrated with Kubernetes, reducing infrastructure costs and improving system scalability.

Implemented monitoring and alerting via ELK Stack, CloudWatch, and Nagios, achieving 99.95% system availability.

Automated data preprocessing and transformations using NumPy and Pandas, improving pipeline run times by 30%.

PROJECTS

Student Mental Health Risk Prediction (Python, Scikit-learn, Pandas)

Developed a predictive ML model using Random Forest with 84% accuracy, analyzing student data to identify mental health risk factors.

Cleaned, engineered, and visualized features using Python, supporting timely academic interventions.

Life Expectancy Analytics Dashboard (Power BI, Excel, DAX)

Built a Power BI dashboard analyzing global life expectancy trends.

Cleaned and transformed raw datasets and applied DAX measures to uncover regional health disparities.

EDUCATION

Master of Science – Business Analytics University of North Texas Jan 2023 – May 2024

Related Coursework: Business Process Analytics, Predictive modelling, Data Mining, Programming Languages for BA

CERTIFICATIONS

Introduction to Python – Penn Engineering (University of Pennsylvania)

Databricks Accredited Generative AI Fundamentals

AWS Certified Solutions Architect – Associate (In Progress)

Contact this candidate