Julia Sharipova Sr Data Engineer
Springfield, VA ************@*****.*** +1-571-****-*** https://www.linkedin.com/in/juliasqlbi/ US Citizen
Senior Data Engineer with over 8 years of experience in designing, developing, and optimizing data solutions across multiple platforms including SQL Server, Azure, and Google Cloud. Expertise in ETL processes, data migration, and data warehousing using tools such as SSIS, ADF, Databricks, and GCP. Proficient in building and managing Power BI and Tableau dashboards, with advanced knowledge of DAX, Power Query, and report performance tuning.
●Expert in advanced database administration, proficient in designing and maintaining database objects such as tables, indexed views, materialized views, stored procedures, triggers, cursors, user-defined data types, functions, and indexes using T-SQL for high-performance data handling.
●Proficient in Azure Data Factory (ADF), using Copy Activity, Data Flow, Mapping Data Flow, and Integration Runtime (IR) to orchestrate complex ETL pipelines, enabling the seamless migration of data from on-premises SQL Server and Oracle environments to cloud platforms like Azure Data Lake Storage and Azure SQL Database.
●Expertise in DataBricks clusters, leveraging Apache Spark for distributed computing, optimizing resource allocation, and ensuring fast execution of batch processing, streaming jobs, and machine learning workloads.
●Automated deployment of machine learning models on DataBricks, utilizing MLflow for model versioning, tracking, and deployment pipelines in collaboration with data scientists to facilitate predictive analytics and large-scale experimentation.
●Experienced in setting up DataBricks job scheduling using Cron and Airflow, automating incremental data loads and recurrent job execution to maintain fresh datasets for business intelligence dashboards.
●In-depth knowledge of Spark SQL and PySpark, leveraging them for large-scale data transformation and advanced analytical queries, enabling faster insights from massive datasets stored across distributed file systems.
●Mastery of AWS Glue, utilizing its built-in ETL capabilities for data extraction, transformation, and loading into Amazon Redshift for scalable, high-performance data warehousing, enabling complex analytical queries on large datasets.
●Lead in implementing DataOps practices, building CI/CD pipelines for data pipeline development, monitoring, and testing using Azure DevOps, ensuring efficient and reliable data delivery.
●Extensive experience in Snowflake architecture, including the design and implementation of multi-cluster, shared-data environments with auto-scaling for dynamic workloads, ensuring cost optimization and performance.
●Optimized data storage and retrieval in Snowflake by leveraging techniques such as clustering keys, materialized views, and result caching, reducing query execution times and enhancing user experience for analytics teams.
●Proficient in implementing role-based access controls (RBAC) and data governance policies within Snowflake, ensuring secure access to sensitive data across different business units.
●Expert in Dimensional Data Modeling, implementing Fact and Dimension tables, SCD Type 1/2 handling, factless fact tables, and bridge tables in star and snowflake schema designs, to optimize data organization and retrieval in OLAP systems.
●Demonstrated expertise in SSRS (SQL Server Reporting Services) and Power BI, developing complex report types such as multi-level drill-through reports, sub-reports, cascading parameters, and paginated reports, enabling interactive reporting and decision-making.
●Expertise in crafting high-performance DAX expressions and measures within Power BI, optimizing report performance through calculated columns, quick measures, and aggregation techniques to ensure seamless user experiences.
●Advanced proficiency in Tableau, building sophisticated visualizations, dashboards, calculated fields, and LOD (Level of Detail) expressions to provide in-depth data insights and performance reporting for stakeholders.
●Skilled in integrating Snowflake with Power BI, utilizing Direct Query, import mode, and Azure AD single sign-on (SSO) for real-time reporting on large datasets.
●Proficient in ETL automation, using tools like SSIS, Azure Data Factory, and Python scripts to manage complex workflows, perform data cleansing, and ensure data lineage tracking for audit and compliance purposes.
●Extensive experience in Azure DevOps pipeline creation, streamlining the data integration and delivery process across cloud environments, improving efficiency through automated deployments and version control.
●Mentorship and leadership experience, guiding junior data engineers and analysts in using best practices for DataBricks, Power BI, and cloud-based data processing platforms, fostering skill development and promoting team success.
●Strong understanding of data governance principles, including GDPR and HIPAA compliance, ensuring secure handling of sensitive data and adherence to legal regulations during data migration and analysis processes.
EXPERIENCE
Sr Data Engineer Apr 2022- Precent
Chags Health Information Technology Columbia, MD
●Played an active role in migrating on-premise SQL Server schemas to Azure SQL Server DB and Azure VM, managing databases across on-premise and cloud environments.
●Imported data from OLTP systems, flat files, and other sources, implementing data flow solutions for data cleansing and validation before inserting or updating them in OLTP or OLAP databases.
●Proficient in SQL Server and T-SQL, handling stored procedures, triggers, nested queries, joins, views, user-defined functions, indexes, and database models, ensuring database consistency with DBCC commands.
●Conducted performance tuning and wrote queries for front-end applications.
●Implemented data cleansing, incremental data loading, logging, event handling, and error handling mechanisms.
●Developed SSIS Packages with various transformations and tasks like Pivot, Fuzzy Lookup, Derived Columns, and Condition Split.
●Configured SSIS packages using XML configuration files, environment variables, registry entries, and SQL Server tables.
●Designed efficient ETL packages using SSIS and Azure Data Factory ADF2 pipelines, managing complex transformations and slowly changing dimension (SCD) scenarios.
●Managed data migrations from on-premise and open-source repositories to modern Azure data services.
●Acquired experience and actively worked with Data Lakes and Business Intelligence tools in Azure.
●Generated operational reports and analytical dashboards on Azure Data Warehouse using Power BI and Azure Analysis Services.
●Regularly assessed and integrated new Azure data services, such as Azure Synapse Analytics and Azure Purview, to enhance scalability, performance, and cost-efficiency in cloud data architectures.
●Formulated secure ODATA API requests to SAP, integrating with SAP OData services for efficient data loading and retrieval while adhering to authentication mechanisms and security protocols.
●Developed complex ETL workflows in Azure Data Factory (ADF) and Azure Databricks, embedding error handling and monitoring capabilities to ensure high data quality, availability, and reliability.
●Leveraged a comprehensive suite of Azure Data Engineering tools, including Azure Blob Storage, Azure SQL Database, Azure Data Lake Storage (ADLS), and Azure Synapse Analytics, for end-to-end data ingestion, processing, and analytics pipelines.
●Utilized PowerShell and C# scripting for automation of data integration tasks, including API request formulation, data transformation, and retrieval, optimizing operational workflows.
●Implemented stringent security measures for OLTP systems on Google Cloud Platform (GCP), employing encryption, fine-grained access controls, and audit logging to ensure the protection of sensitive data.
●Designed and deployed dimensional data models following Kimball methodologies to optimize query performance and scalability for a wide range of business domains and analytical applications.
●Implemented role-based access control (RBAC) and row-level security (RLS) within Power BI and Cognos, ensuring data confidentiality and adherence to regulatory compliance standards.
●Conducted performance tuning and optimization of Cognos reports and SQL queries, leveraging caching strategies and optimizing execution plans to enhance report responsiveness and user experience.
●Provided extensive training on Power BI and Cognos reporting tools, creating user guides, conducting workshops, and empowering users to utilize self-service analytics for data-driven decision-making.
●Utilized Power BI to simplify data analysis by implementing semantic models and loading relevant data from Azure Synapse data warehouse.
●Designed, maintained, and delivered SSRS reports from central databases.
●Created various types of reports using Power BI, incorporating filters, calculations, expressions, and conditional formatting.
●Utilized Power Query to acquire data and developed calculated columns and measures using DAX in Power BI desktop for diverse visualizations.
Business Analyst Jan 2020 – March 2022 Center for Medicaid and Medicare Columbia Maryland
●Lead by interfacing directly with stakeholders to elicit, document, manage, categorize, analyze, and communicate requirements and ensure that they are consistent with ICT Unit standards.
●Conducted JAD sessions to develop architectural solution, business requirements, resolve open issues, and manage the required requirements changes. Lead and facilitated cross-functional teams for technical and non-technical business requirements. Created user stories and process flow diagrams.
●Successfully used Agile Scrum Method for gathering requirements and facilitated user stories workshop. Lead daily Scrum and Sprint Planning meetings to identify the tasks for the sprint and getting team members acceptance/commitment for the assigned tasks.
●Managed Sprint review meetings with the team and stakeholders to review the achievements from the sprint and get approvals.
●Documented the Requirements Management Process improvements to Agile Scrum processing operations. Proof-of-concept effort established a database-centered requirement gathering process, release-based execution, version controls, issues logging and improved traceability and transparency.
●Led to measurable improvements in ability to meet deadlines, increase quality and provide consistent throughput of system development processing.
●Proficient in Lotus Notes/Domino: Experienced in utilizing Lotus Notes/Domino for email communication, database management, and workflow automation, streamlining business processes and enhancing team collaboration.
●Utilized AWS services (e.g., S3, EC2, Lambda) to efficiently store, process, and manage large datasets, ensuring scalability and performance in data pipelines for real-time analytics.
●Led Sprint Planning meetings two weeks prior to start of sprint to define the scope, set goals, address any concerns, and plan out the sprint.
●Participated in Sprint Demos to solicit feedback from Project Stakeholders and capture any additional requirements.
●Played an integral role in Sprint Retrospectives by providing a detailed analysis of what the team did well and how the team may perform better in upcoming sprints.
●Worked closely with QA and DEV team to configure Okta to meet business requirements.
●Delivered a variety of artifacts, including User Stories (JIRA), Business Process Diagrams (MS Visio), Entity Relationship Diagrams (MS Visio), Activity Diagrams (MS Visio), Documentation Archives (SharePoint), User Manuals (MS Teams) throughout the Project life Cycle
●Developed and maintained internal Okta User Manual to educate Admin Staff on managing 1000+ members with Okta Identity Management Infrastructure
SQL ETL SSIS Developer Feb 2017 – Dec 2019 Bank of America New York, NY
●Developed indexing strategies for tables, views, and stored procedures, aligning with business requirements to optimize query performance. Crafted complex T-SQL queries, utilizing advanced operations such as multi-level joins, MERGE, and EXCEPT for efficient data processing.
●Performed query performance tuning on complex stored procedures, views, and user-defined functions (UDFs) by refining index usage and optimizing execution plans to enhance data retrieval efficiency.
●Contributed to the development and support of SSIS ETL solutions, integrating data from various sources, including Flat Files, Excel, SQL Server, and DB2, into central OLTP databases.
●Utilized C# scripting within SSIS Script Tasks to enhance ETL processes, ensuring high performance and maintainability in data integration workflows.
●Led the migration of Tableau dashboards and reports to Power BI, consolidating data from multiple sources, such as Excel, SharePoint Lists, SQL Server, Galaxy System DB2, and SSAS Multidimensional Cubes.
●Designed and maintained Power BI reports featuring complex DAX measures, drill-down capabilities, custom tooltips, and KPIs, implementing Row-Level Security (RLS) and Page-Level Security (PLS) to safeguard data access.
●Established data sourcing strategies for Power BI migrations, ensuring data consistency and standardization by leveraging Tabular Models built on SQL Server Data Warehouse and SSAS Data Marts.
●Responded to ad-hoc reporting requests, delivering custom reports for management with non-standard requirements, utilizing advanced data aggregation and visualization techniques.
EDUCATION
National University of Uzbekistan BS in Communication
National University of Uzbekistan MS in Communication
OS
Windows 2000, Windows Server 2003, Windows server 2008
Databases
SQL Server 2014/2012/2008/2005/2000, PL/SQL, Oracle 11g/8.0i, Microsoft Access 2000/8.0 Teradata14.0, SAS, Oracle (8i/9i/10g), MDS 2012
Languages
C, PL/SQL, TSQL, DB2, MYSQL, DHTML, VB.NET, ASP.NET, .NET Framework, C#, Python
Scripting Languages
HTML, CSS, XML, VB Scripting, JavaScript, React, TypeScript, Nodejs
Web Tools
C#.Net, ASP.Net, VB.Net, Visual Studio.Net, XHTML1.0, JavaScript, VB Script
ETL & Reporting
SSRS 2005/2008/2012/2014, SSIS 2005/2008/2012/2014
Hardware
IIS 6.0/5.0/4.0
Version & Defect Control
Microsoft TFS, MS VSS, Synergy, HP Quality Center & Jira
Others
AWS, MS Office, Erwin, Crystal Reports XI, Visio, Microsoft Business Intelligence Studio, Service Oriented Architecture (SOA) SQL Server Notification Services, SQL Server 2000/2005 Analysis Services (SSAS), Visual web developer, MS FrontPage, Windows Scripting Host, Erwin Data Modeler, MS Visual, SAS/Access, Crystal Reports SAP Business Objects