GUZAL OZKASIM
TN, USA 1-615-***-**** *****.*******@*****.*** LinkedIn
Accomplished Senior Data Engineer with about 8 years of deep expertise in Data Modeling, Data Engineering, and Business Intelligence. Expert in developing robust Data Integration solutions, driving Cloud Migration projects, and architecting both ER and Dimensional Data Models to enhance OLAP, OLTP, Data Warehouse, and Data Lake infrastructures. Proficient in utilizing BI tools like SQL Server, Azure Data Factory, Databricks, Azure Synapse, SSIS, and Azure Logic Apps for optimal data reporting and presentation. Adept at engaging with business users to gather detailed requirements and implementing scalable Big Data solutions to meet complex business needs. Highly skilled in data visualization using platforms such as Power BI, SSRS, and Tableau.
TECHNICNAL SKILS
Scripting: C#, Python, Spark, PowerShell, T-SQL, Pl SQL, DAX, MDX
Databases: SQL Server, MySQL, Oracle PL SQL, IBM DB2, AWS Redshift, SQLite, MongoDB, Kusto KQL, Azure Synapse, Azure SQL PAAS
Data Engineering: OLTP, OLAP, Data Warehousing, Data Marts, ODS, EDW using Kimball and Inman dimensional models. ETL and ELT dataflows using SSIS, Azure Data Factory, Databricks, Redpoint, Informatica, Talend, AWS Glue, Alteryx, custom scripts such as Python, SQL, and Scala.
Data Visualization: Power BI, Tableau, Cognos, SSRS, Crystal Reports, Domo, and others.
Infrastructure and Source Control: Azure DevOps, CICD Pipelines, Terraform configuration, GitHub, Git-Bash, Visual Studio, Microsoft TFS
Methodologies: Azure DevOps Agile, Scrums, Waterfall, Kanban
WORK EXPERIENCE
DATA ENGINEER - Kearney NY, NY 07/ 2023-Present
Collaborated with Business Analysts, Users, and Subject Matter Experts (SMEs) to refine requirements.
Designed and implemented data lakes and warehouses, maintaining strict data governance and ensuring data integrity and quality.
Actively monitored and diagnosed issues in data pipelines, quickly resolving them to reduce downtime and improve data reliability.
Routinely completed assignments ahead of deadlines, achieving client satisfaction through timely deliverables.
Managed Azure Data Factory pipelines and datasets to streamline data integration workflows.
Used Azure Data Factory Monitor to oversee pipeline operations and address issues efficiently.
Leveraged Azure Data Factory, T-SQL, Spark SQL, and U-SQL in Azure Data Lake Analytics for ETL processes from various sources to Azure Data Storage solutions.
Updated existing stored procedures to reflect new business rules and client feedback, ensuring they met the updated requirements.
Executed performance tuning to enhance system efficiency.
Facilitated the transfer of data between Excel and servers using data flows to ensure smooth data exchange.
Adept at resolving various data inconsistencies and validation errors.
Successfully migrated CSV files from Azure Blob Storage to Azure SQL Server, optimizing data transitions.
Proficient in developing database objects such as tables, views, joins, subqueries, and indexes to meet specific needs.
Integrated ADLS with tools like Azure Data Factory and Azure Databricks for effective data transformation and integration.
Engaged in creating and refining T-SQL scripts, ensuring accurate validation and functionality.
Utilized Python to develop logistic regression models for predictive analytics purposes.
Conducted data quality assessments and applied data cleansing methods to ensure the precision and trustworthiness of financial data.
Developed conceptual solutions and executed proofs-of-concept to validate the feasibility of proposed technological solutions.
Formulated and implemented strategies for migrating traditional systems to Azure, using methods such as Lift and Shift and Azure Migrate.
Took charge of data warehouse and business intelligence projects, employing Azure Data Factory for implementation.
Created comprehensive data solutions in Azure, covering aspects from storage and integration to processing and visualization.
Designed and deployed database solutions using Azure SQL Data Warehouse and Azure SQL.
Recommended cost-effective Azure architectures to optimize data infrastructure costs.
Maintained various Azure services including Azure SQL Database, Azure Analysis Service, and Azure SQL Data Warehouse.
Implemented Copy activity and custom activities within Azure Data Factory pipelines.
Built data ingestion pipelines using Snow-pipe to load transactional data efficiently.
Applied data masking techniques in Snowflake to protect sensitive data.
Managed the transfer of data from traditional database systems to Azure platforms through methodical migration strategies.
Played a key role in data migration projects, utilizing tools like SQL, Azure Storage, Azure Data Factory, SSIS, and PowerShell.
Developed C# applications to facilitate data loading from Azure blob storage to Azure SQL, and automated data importation from web APIs.
Reengineered existing application logic to fit within Azure Data Lake, Data Factory, SQL Database, and SQL Data Warehouse environments.
Specialized in executing DWH/BI projects using Azure Data Factory and Databricks.
Architected, designed, and validated Azure IaaS environments.
Developed and maintained dashboards and visualizations using Microsoft tools like SSRS and Power BI to assist in data analysis and provide insights to management.
DATA DEVELOPER - Starbucks Seattle, WA 07/2019- 07/ 2023
Worked closely with senior executives to capture their requirements and convert these into actionable technical solutions.
Engineered, executed, and launched ETL workflows using Microsoft Synapse and Data Factory to facilitate data movement from various sources into the data warehouse.
Set up and managed data warehouses through Azure Synapse Analytics, focusing on scalable and efficient data storage and analytics capabilities.
Constructed a data pipeline within Azure Data Factory to import data from on-premises databases into Azure SQL Database.
Applied data governance measures, focusing on data security, access control, and adherence to industry standards and legal regulations.
Produced detailed documentation and analytical reports, providing stakeholders with summarized insights and analyses.
Implemented integration of Azure Active Directory with multiple cloud and on-site applications to streamline identity management and access controls.
Developed interactive dashboards and reports in Power BI, offering stakeholders critical business insights.
Successfully transitioned an on-premises data warehouse to Azure Synapse Analytics, cutting infrastructure expenses by 30% and enhancing query performance by 50%.
Created and deployed a real-time data ingestion system using Azure Event Hubs and Azure Databricks, supporting near-instant analytics for key business processes.
Formulated a data quality strategy using Azure Data Factory and Azure Databricks, cutting data discrepancies by 25% and enhancing accuracy.
Constructed Power BI visualizations including pie charts, treemaps, and matrix reports.
Authored T-SQL stored procedures, views, and Power BI reports to support project management and compliance reporting.
Integrated Role-Based and Row-Level Security in Power BI, boosting security protocols.
Maintained and optimized data warehouse objects, enhancing PySpark tasks for quicker data processing through Kubernetes Clusters, the Jenkins framework, and Git version control.
Transitioned data into RV Data Pipeline using Databricks, Spark SQL, and Python.
Crafted DDL and DML scripts in SQL and HQL for analytical applications in both RDBMS and Hive databases.
Designed shell scripts to parameterize Hive jobs within Oozie workflows and automate task scheduling.
Developed data pipelines for transferring data from on-premises systems to cloud-based solutions using Spark.
Utilized Spark to process raw data, fill staging tables, and store processed data in various formats like JSON, XML, and CSV within the Data Warehouse.
Created visualizations using Tableau, employing charts like lines, bars, maps, and pies; worked with both Tableau data extracts and live connections.
Extracted data using SQL queries and stored procedures from RDBMS databases like MySQL and MS SQL Server.
Composed SQL scripts to move data from operational databases to simple flat text files.
Set up database user access permissions and data-level security measures.
Developed Tableau dashboards and reports for data visualization and analysis, presenting results to business stakeholders.
Designed and implemented Spark jobs using Scala for complete data pipeline execution in batch processing.
Built ETL pipelines in Databricks with notebooks, Spark DataFrames, SPARK SQL, and Python scripting.
Authored PySpark code in Databricks notebooks for data extraction from diverse sources and loading into ADLS Gen2.
Parameterized the notebooks to craft a versatile, reusable solution for data entity loading.
BI DATA ANALYST – Wells Fargo NY, NY 07/ 2017—07/ 2019
Participated in requirement gathering calls, working closely with Product Analysts and Solution Architects to understand client needs.
Developed high-level technical and application design documents that accurately reflected the client requirements and detailed the design architecture.
Built various pipelines in Azure using Azure Data Factory v2, utilizing functions such as Move & Transform, Copy, Filter, For Each, and Databricks to integrate data from multiple sources.
Configured tables in Azure Synapse using diverse distribution strategies like Hash, Round Robin, and Replicated, tailored to meet specific ETL needs.
Enhanced query performance in Azure Synapse by creating statistics post-data load.
Applied an ELT loading strategy in Azure Synapse's dedicated pool, using tools like PolyBase, Data Lake, external tables, and stored procedures.
Managed job orchestration in Azure Data Factory using different triggers, including Events, Schedules, and Tumbling windows.
Used Common Table Expressions (CTE) to construct multiple tables in a single SQL query within a notebook.
Designed and organized data layers in ADLS, categorizing them into Raw, Refined, and Trusted.
Created SSIS packages in Business Intelligence Development Studio to migrate data from flat files to SQL Server.
Developed and managed ETL processes (using SSIS) for data extraction, cleansing, transformation, and loading into data warehouses.
Engineered and reversed data models using Erwin, providing comprehensive modeling solutions.
Defined SSRS report layouts with specific parameters and crafted queries for detailed drill-down reports.
Created and administered primary database objects like tables, triggers, and indexes, and developed necessary stored procedures and user-defined functions to support logical designs.
Documented the ETL process design (using SSIS) reflecting business needs and technical specifications, demonstrating advanced design techniques such as source-to-target mappings and transformation processes.
Developed interactive dashboards and visual reports using tools like Power BI, Tableau, and SSRS to enhance business intelligence reporting.
Executed DAX queries and applied functions within Power BI to manipulate and analyze data.
Tailored charts and calculations to meet specific business requirements.
Authored and tested stored procedures, views, functions, and triggers to support BI systems.
Crafted complex SQL queries utilizing aggregate functions, Group By, CTE, and OLAP concepts to manage and analyze large datasets effectively.
Education
Master of Engineering, Industrial Technology – Tashkent State University, Uzbekistan 2003-2005
Bachelor of Engineering, Industrial Technology – Tashkent State University, Uzbekistan 1999-2003
Key Accomplishments
Reduced query response times by 50% through advanced optimization techniques.
Built scalable data pipelines using Azure Snowflake to manage and analyze large datasets efficiently.
Decreased infrastructure costs by 30% via cloud migration to Azure Synapse Analytics.
Built automated governance frameworks, cutting data discrepancies by 25%.