Vaishnavi S
Email: **************@*****.***
Mobile: +1-813-***-****
Data Engineer
PROFESSIONAL SUMMARY:
Over 4 years of experience demonstrating analytical thinking and attention to detail in designing and developing data pipelines, working effectively in agile/scrum teams with minimal supervision. I am a team player.
Expert in problem-solving and innovative thinking, building enterprise-grade data solutions, and proficient in connecting dots across applications to understand the end-to-end view, ensuring team success.
Proficient in ETL and ELT frameworks, with strong communication and presentation skills to both technical and non-technical audiences, and the ability to effectively communicate across the organization.
Designed scalable ingestion pipelines, demonstrating expertise with Microsoft Office suite usage, and willingness to ask questions and reach out for assistance as required, showing attention to detail.
Experienced in implementing data governance practices, with know-how in working in Agile/scrum teams for prioritization of work and resource assignments, and ability to influence and guide the team.
Skilled in optimizing Databricks clusters, identifying priorities, and managing multiple projects simultaneously, demonstrating analytical thinking and problem-solving skills in a team environment.
Integrated data from various sources, demonstrating proficiency with query tools to aid data analysis, including strong PL/SQL skills to write and analyze complex queries and stored procedures.
Delivered robust CI/CD pipelines, showcasing expertise with Oracle Exadata or 10g and above, and working well in a team environment with minimal supervision, ensuring team success.
Built streaming data pipelines, demonstrating good communication and presentation skills, and the ability to effectively communicate across the organization depending on the audience.
Developed automated testing frameworks, demonstrating know-how in effort and financials estimation, and willingness to ask questions and reach out for assistance as required, showing attention to detail.
Created modular Power BI dashboards, demonstrating analytical thinking and attention to detail, and working effectively in agile/scrum teams with minimal supervision, ensuring team success.
Migrated on-prem legacy ETL workflows, demonstrating problem-solving and innovative thinking, and proficiency in connecting dots across applications to understand the end-to-end view.
Collaborated with cross-functional teams, demonstrating strong communication and presentation skills to both technical and non-technical audiences, and the ability to effectively communicate across the organization.
Implemented monitoring solutions, demonstrating expertise with Microsoft Office suite usage, and willingness to ask questions and reach out for assistance as required, showing attention to detail.
Demonstrated working knowledge of Apache Airflow and Terraform, demonstrating know-how in effort and financials estimation, and willingness to ask questions and reach out for assistance as required.
Basic familiarity with GCP services, demonstrating good communication and presentation skills, and the ability to effectively communicate across the organization depending on the audience.
TECHNICAL SKILLS:
Databases - Azure SQL DB, Snowflake, PostgreSQL, BigQuery, Oracle Exadata
Languages - Python, SQL, Scala, Bash, PL/SQL
Tools - Git, VS Code, Terraform, JIRA, ServiceNow, Microsoft Office
Others - Agile, Scrum
PROFESSIONAL EXPERIENCE:
USF Health Apr 2023 – Present
Data Engineer
Responsibilities:
Demonstrated analytical thinking and attention to detail by building scalable ETL pipelines using Azure Data Factory and Databricks to ingest clinical EHR data, ensuring standardization. This improved data accessibility for research and care coordination teams.
Applied problem-solving and innovative thinking skills to design and implement patient-focused dimensional data models in Synapse Analytics, supporting operational dashboards and advanced clinical reporting, enhancing program effectiveness.
Utilized strong PL/SQL skills to analyze complex queries and stored procedures within Synapse views, enabling stakeholders to monitor research metrics and operational KPIs in real time, improving decision-making.
Leveraged expertise with Microsoft Office suite to create presentations and documentation for both technical and non-technical audiences, effectively communicating project status and findings across the organization.
Implemented DevOps practices using Azure Pipelines, showcasing attention to detail in deploying notebooks, SQL artifacts, and configurations into multiple environments with automated testing and rollback mechanisms, ensuring data quality.
Collaborated with HIPAA and compliance teams, demonstrating strong communication skills to implement RBAC, encryption, and data masking strategies, ensuring all pipelines adhere to healthcare data protection standards, mitigating risks.
Facilitated knowledge-sharing sessions and documented best practices for data platform onboarding, ensuring continuity and standardization across engineering and analytics teams, fostering a team player environment.
Conducted in-depth root cause analysis on data discrepancies using lineage tracing, showcasing problem-solving skills and implemented fix-forward logic to correct broken pipelines without disrupting downstream consumption, ensuring data reliability.
Defined alerting thresholds and diagnostic logging using Azure Monitor and Log Analytics, enabling proactive pipeline health monitoring and SLA enforcement, demonstrating attention to detail and analytical thinking.
Supported cloud cost optimization efforts by analyzing Databricks job usage, scaling policies, and storage consumption to adjust cluster configurations and reduce monthly spend, showcasing innovative thinking and problem-solving.
Value Labs Mar 2021 – Dec 2022
Data Engineer
Responsibilities:
Developed scalable ETL pipelines using Azure Data Factory and Databricks, demonstrating analytical thinking to ingest marketing, sales, and CRM data into Azure Data Lake Gen2 for unified analytics and reporting.
Implemented Slowly Changing Dimension (SCD) Type 2 logic and data deduplication in curated zones, ensuring accurate historical tracking of customer engagement across multi-channel platforms, showcasing attention to detail.
Applied PySpark for advanced data transformation, feature engineering, and enrichment tasks in preparation for ML pipelines used in churn and lifetime value prediction models, demonstrating problem-solving skills.
Integrated role-based access controls and dynamic row-level security in Synapse and Power BI, ensuring secure and compliant access to sensitive marketing and customer datasets, showcasing attention to detail.
Coordinated with cloud architects to implement cost-effective compute configurations using spot clusters, autoscaling policies, and job parallelization in Azure Databricks, demonstrating innovative thinking.
Developed automated validation scripts in Python and SQL to reconcile row counts, schema structure, and referential integrity across landing, staging, and gold layers, showcasing strong PL/SQL skills.
Assisted QA teams by integrating dbt and Great Expectations into CI/CD workflows for test automation, data profiling, and issue reporting through Azure DevOps, demonstrating a team player attitude.
Built archival pipelines and retention policies for older datasets based on SLA thresholds, optimizing storage usage and adhering to business lifecycle requirements, showcasing attention to detail.
Achieved 99.9% pipeline uptime by implementing monitoring dashboards with custom metrics, threshold-based alerts, and proactive support playbooks for operational excellence, demonstrating problem-solving skills.
Worked in Agile/scrum teams for prioritization of work and resource assignments, demonstrating the ability to effectively communicate across the organization and manage multiple projects simultaneously.
Certifications:
Databricks Certified Data Engineer Professional - August 2025
Educational Details:
Master of Science in Computer Science - University of South Florida
Bachelor of Technology in Computer Science - Vidya Jyothi Institute of Technology