Sanjana Ganesuni
Cincinnati, OH
*****************@*****.***
Junior Data Engineer Azure & Databricks SQL Python Professional Summary
Enthusiastic Data Engineer with 1 year of industry experience building cloud-native data pipelines and analytical dashboards. Adept at Azure Synapse Analytics, Databricks (PySpark), and Azure Data Factory, with a solid grounding in SQL optimization, big-data processing, and business-intelligence reporting (Power BI, Tableau). Known for quickly mastering modern data-engineering tools and translating business questions into reliable, scalable data solutions. Core Technical Skills
Category Tools / Technologies
Cloud &
Big-Data
Azure (ADF, Synapse, Databricks, Data Lake, Blob Storage), AWS (Glue, S3, EC2, Redshift), Hadoop, Spark, Kafka
Languages Python, SQL, Scala, Java, C/C++, PL/SQL
Databases SQL Server, MySQL, PostgreSQL, Snowflake, Cassandra, Cosmos DB ML / Libraries Pandas, NumPy, Scikit-learn, TensorFlow, Keras, MLlib BI / Reporting Power BI, Tableau, SSRS, Power Apps, Power Automate DevOps & VCS Git/GitHub, Jenkins, Docker, Kubernetes OS Linux, Windows, Unix
Professional Experience
Data Engineer
Azure Data Factory & Databricks: Designed and deployed five end-to-end pipelines (batch & event-triggered) that ingest
~200 GB/day from on-prem and SaaS sources into Azure Data Lake Gen2, then cleanse and transform with PySpark notebooks.
● Synapse Analytics: Modeled star-schema tables and materialized views, cutting report run-times by 45 %.
● Power BI: Built interactive dashboards with DAX measures for Ops KPIs; introduced row-level security and automated email alerts via Power Automate.
● CI/CD: Added YAML pipelines in Azure DevOps to unit-test notebooks and automate ARM template deployments.
● Collaboration: Participate in 2-week sprints, peer code reviews, and root-cause analyses for data freshness issues, trimming SLA breaches from 9 % to <2 %.
Academic Projects (University Capstone, 2023)
Project Key Tech Highlights
Real-Time
Product-Sentime
nt Pipeline
AWS Kinesis, Glue,
PySpark,
Redshift, Tableau
Stream-processed 50k tweets/hr, achieving 1.2 s
end-to-end latency and 92 %
sentiment-classification accuracy.
Healthcare
Readmission
Dashboard
Azure Synapse, SQL,
Power BI
ETL’d 1.5 M EMR records; created drill-through
visuals that helped clinicians identify top 5
readmission drivers.
Education
Certifications
● Microsoft Certified: Azure Data Engineer Associate (2024)
● Databricks Lakehouse Fundamentals (2024)