Sai Shreesh Josyula
Azure Data Engineer
Toronto, Canada +1 xxxxxxx @gmail.com xxxxxxxxxxxxx
Professional Summary
Over Nearly 4 years of experience in Azure Data Engineer, ETL pipeline development, Data flow processes and business intelligence, specializing in developing and optimizing data solutions to enhance organizational efficiency.
Proficient in T-SQL and SQL, utilized extensively to design and implement complex ETL pipelines, perform data transformations, and develop interactive dashboards. Experienced in relational and dimensional data modeling to optimize database performance and reporting efficiency.
Experienced in developing and publishing interactive Power BI reports on SharePoint to monitor project status from Jira, reducing manual reporting efforts and improving team collaboration efficiency. • Skilled in designing Power BI reports to demonstrate the functionality of tracking incremental data changes using Change Data Capture (CDC), reducing data discrepancy issues by 20% and enabling faster decision-making.
Proficient in engineering data flow pipelines in Azure Data Factory to flatten complex XML data structures upon new data insertion, improving data processing speed by 40% and ensuring seamless integration.
Adept at configuring event-based triggers in Azure Data Factory to automate data processing workflows, reducing manual intervention by 60% and enhancing responsiveness to data changes by 35%.
Experienced in developing and optimizing ETL pipelines using Azure Data Factory to seamlessly integrate data from Banner, Azure Synapse, and Blob Storage, resulting in a 25% improvement in data processing efficiency.
Skilled in implementing Change Data Capture (CDC) mechanisms within Azure Data Factory workflows to monitor and process incremental data changes, enabling real-time reporting and analytics, which enhanced decision-making speed by 30%.
Proficient in automating data validation and monitoring processes in Azure Data Factory pipelines, increasing data reliability and supporting large-scale processing with Azure Data Lake and Parquet file formats, leading to a 20% reduction in data inconsistencies.
Experienced in integrating predictive analytics workflows utilizing Azure Data Factory, Power BI, and Cognos Planning Analytics, delivering actionable insights that drove strategic business decisions and contributed to a 15% increase in operational efficiency.
Skilled in designing and implementing scalable ETL pipelines using Azure Data Factory for leading financial institutions, integrating data from on-premises SQL Server and external APIs, reducing data processing time by 35%.
Proficient in leading the ETL data migration of legacy databases to Azure Synapse Analytics, enhancing data accessibility and reducing infrastructure costs by 25%.
Experienced in developing and optimizing data transformation processes with Azure Databricks and PySpark while mentoring junior engineers on best practices in data engineering and ETL processes, improving processing efficiency and supporting advanced analytics.
Adept at establishing data governance protocols, including role-based access control and data masking, ensuring compliance with GDPR and enhancing data security.
Skilled in automating CI/CD pipelines for data workflows using Azure DevOps, incorporating automated testing and monitoring to reduce deployment times by 40% and minimize errors.
Proficient in integrating Azure Data Services with Power BI to create interactive dashboards and reports, enabling real-time business insights and enhancing data-driven decision-making across departments.
Skills
Azure Data Lake
Azure Data Bricks Azure Data Factory
Azure Synapse Analytics
Azure Cosmos DB Azure Blob Storage
Azure Data Explorer
Azure Streaming Analytics Azure HD Insights
Azure Event Hubs
Spark framework Azure SQL Data Base
SQL
Data warehousing Python
Data modeling
Data pipeline design PySpark
Work History
Azure Data Engineer, 09/2024 - Current
Wilfrid Laurier University – Canada
Designed, developed, and maintained scalable ETL pipelines using Azure Data Factory to ingest, transform, and load structured and unstructured data into Azure Data Lake Storage and Azure Synapse Analytics.
Implemented end-to-end data integration solutions across on-premises and cloud sources, improving data availability for analytics and reporting.
Developed and optimized SQL queries and stored procedures for Azure SQL Database and Synapse, enhancing performance and reducing query execution time.
Built real-time data streaming pipelines using Azure Stream Analytics and Event Hub, enabling live dashboards and timely insights.
Utilized Azure Databricks to process large volumes of data using PySpark, implementing machine learning models and advanced data transformations.
Created and maintained data models, data dictionaries, and documentation for data warehouse and data lake architectures.
Led the migration of legacy ETL processes to Azure-based solutions, reducing operational costs and increasing reliability.
Scheduled and monitored ADF pipelines, leveraging Azure Monitor, Log Analytics, and alerts to proactively manage failures and performance issues.
Implemented CI/CD pipelines for data workflows using Azure DevOps, automating deployment and improving development lifecycle efficiency.
Collaborated with data scientists, analysts, and business stakeholders to gather requirements, validate data accuracy, and deliver actionable insights.
Ensured compliance with data governance policies, including encryption at rest and in transit, data masking, and RBAC through Azure Key Vault and Azure AD.
Designed and deployed incremental and full-load strategies for batch and streaming pipelines using Watermarking and Change Data Capture (CDC) techniques.
Conducted unit testing, integration testing, and performance tuning to ensure robust and high-performing data solutions.
Provided technical mentorship to junior data engineers and documented best practices in Azure data engineering.
Cloud Engineer, 07/2021 - 08/2023
Tata Consultancy Service – India
Designed and implemented scalable ETL pipelines using Azure Data Factory and data flow processes for one of the leading financial institutions, integrating data from on-premises SQL Server and external APIs, reducing data processing time by 35%.
Led the migration of legacy databases to Azure Synapse Analytics, enhancing data accessibility and reducing infrastructure costs by 25%. Designed and optimized relational and dimensional data models to improve query performance.
Developed and optimized data transformation processes with Azure Databricks and PySpark, improving processing efficiency and supporting advanced analytics.
Established data governance protocols, including role-based access control and data masking, ensuring compliance with GDPR and enhancing data security.
Implemented CI/CD pipelines in Azure DevOps, cutting deployment times by 40% and reducing errors.
Integrated Azure Services with Power BI to create interactive dashboards and reports, enabling real-time business insights and enhancing data-driven decision-making across departments.
Collaborated with cross-functional teams and guided junior developers in optimizing ETL workflows and SQL queries for performance improvements.
Education
Master of Science
Master's Applied Computing Wilfrid Laurier Universe - Canada
Bachelor of Science
Bachelor's Computer Science Engineering Vellore In - India