SREE HARSHA
858-***-**** **************@*****.*** www.linkedin.com/in/gudasreeharsha
Summary:
Results-driven Data Engineer with 5+ years of experience in building and optimizing data pipelines using Python, SQL, and Azure technologies. Skilled in transforming complex data into actionable insights, driving business growth, and enhancing operational efficiency. Adept at managing large-scale datasets and leveraging advanced Microsoft technologies to deliver innovative, scalable solutions that empower data-driven decision-making.
Technical Skills:
● Programming Languages: Python, SQL, T-SQL, PL/SQL, PySpark, NoSQL
● Cloud & Data Engineering: Azure Data Factory, Databricks, Azure Synapse Analytics, Azure Functions, Azure Logic Apps, EventHub, SSIS
● Databases: Azure SQL, SQL Server, PostgreSQL, Oracle
● Big Data & Analytics: Apache Spark, Snowflake, Spark SQL
● Data Storage & File Formats: Delta Lake, Azure Data Lake Storage, Parquet, Avro, JSON, ORC
● CI/CD & DevOps: Git, GitHub, Docker, Azure DevOps, Jira
● Data Visualization: Power BI, SSRS
Professional Experience:
Skyway Tech Systems (Client: Profit)
Data Engineer November 2024 – Till Date
● Designed and maintained core analytics datasets and data marts by integrating data from SQL Server, REST APIs, and file-based sources (CSV, JSON, Parquet), enabling reliable reporting and downstream analytics
● Built SQL- and Python-based transformations using Azure Data Factory and Databricks, applying engineering rigor such as modular design, version control, and CI/CD practices
● Implemented Medallion architecture (Bronze, Silver, Gold layers) using Delta Lake on Azure Data Lake Storage, enabling raw ingestion, cleansed transformations, and analytics-ready curated datasets for financial and transactional reporting
● Developed ETL pipelines for ingestion, cleansing, and transformation using SQL, Azure Data Factory, and Databricks. Designed data models for analytics, implemented CDC/SCD for tracking, and optimized storage with Parquet, Snappy, and Gzip in Azure Data Lake and Synapse
● Improved database performance through effective normalization, denormalization, and indexing strategies, reducing average query execution time from 10 seconds to under 2 seconds for reporting workloads
● Communicated complex analytical findings clearly through interactive visualizations on PowerBI, aiding data analytics in strategic planning and operational improvements.
● Ensured high availability and optimized performance for large-scale datasets across advanced database systems, including Azure SQL Database, PostgreSQL, and SQL Server
● Provided operational support to the Master Data Management (MDM) team, ensuring data consistency, accuracy, and adherence to quality standards
● Collaborated with cross-functional teams, leveraging multi-tasking abilities to deliver data-driven solutions. Streamlined processes and supported business objectives through innovative ideas Explorance Inc.
Azure Data Engineer June 2021 – July 2024
● Managed database systems, performed in-depth data analysis and optimized scalable data marts and data lakes to support survey, feedback, and experience analytics used by internal stakeholders and external customers
● Designed and implemented robust ETL pipelines using Azure Data Factory (ADF), Synapse Analytics, and Databricks, incorporating incremental loading and Slowly Changing Dimensions (SCD Type 1 & 2) to ensure efficient, reliable data workflows and historical data accuracy
● Built and enhanced scalable data pipelines to process batch and incremental survey data, supporting reporting use cases such as response trends, participation rates, and longitudinal feedback analysis.
● Assisted in optimizing query performance in Azure Synapse and Delta Lake through schema tuning, partitioning, and indexing, improving dashboard and report response times.
● Migrated on-premises SQL Server databases to Azure SQL Database using Azure Database Migration Service (DMS), ensuring zero downtime
● Implemented and followed data security best practices, including Azure Active Directory (AAD), RBAC, and environment-specific access controls to protect sensitive survey and user data.
● Worked with Azure DevOps and GitHub to support CI/CD pipelines, enabling controlled deployments of data pipelines and analytics artifacts across development, QA, and production environments
● Designed analytics data models using dimensional modeling and star schema principles (fact and dimension tables) to support performant reporting and self-service analytics in Azure Synapse and Power BI
● Collaborated with cross-functional teams, including data scientists and analysts, to design scalable solutions and ensure seamless production deployments
● Documented projects with High-Level and Low-Level Design standards to enhance clarity and collaboration
Spectrum Consultancy (Client: NVIDIA)
Database Support Engineer October 2017 – January 2019
● Streamlined technical issue resolution by troubleshooting and resolving support tickets within an agile framework, ensuring minimal downtime for critical systems
● Resolved database issues by troubleshooting and addressing support tickets, ensuring minimal downtime for critical systems
● Designed partitioned tables and implemented composite and bitmap indexing strategies for performance tuning
● Optimized database performance using normalization, denormalization, and indexing techniques, improving stability and response times.
● Developed and maintained ETL workflows using Oracle Data Integrator (ODI) to ingest, transform, and load data into on-premises data warehouses and analytical data stores.
● Created SQL queries, stored procedures, views, and triggers to support application development and reporting
● Collaborated with developers and analysts to prioritize development tasks and align with business requirements
● Assisted in implementing data engineering best practices to enhance ETL pipeline efficiency and maintain code quality
Education:
●Post Graduate Diploma
Montreal College of Information Technology, Montreal, Quebec
●Bachelor's Degree in Electronics
Jawaharlal Nehru Technological University,Hyderabad, India Certifications:
● Databricks Fundamentals accreditation