Post Job Free
Sign in

Cloud Data Engineering Architect

Location:
Louisville, KY
Posted:
April 23, 2025

Contact this candidate

Resume:

ASHUTOSH SHUKLA (“Ash”)

********@************.*** +1-502-***-**** https://www.linkedin.com/in/shuklaashutosh PROFESSIONAL SUMMARY

An Expert Database Administrator (DBA) & Data Engineer Architect on major AWS/Azure

Experience lies in Data on Cloud, RDBMS/ NoSQL, AI/ML & Data Science

Managing databases on cloud/On-prem such as AWS RDS, Oracle, Azure PostGreSQL / MySQL DBA & MongoDB

Well versed with complete DBA life cycle such as Server/SQL performance tuning and resolving issues, Performed various Backups and upgrades

Data protection, masking and security, Role based access

My SQL workbench, Dbeaver, Managing DB using performance views and Admin SQL Queries

Experienced working with multiple Stakeholders and managing project requirements

Works in an Agile Development Env’t using (CI/CD) devOps, GitHub, TF, Confluence & Jira

Certifications: DataBricks (DBX) Data Engineering, Azure (Data/AI) AZ-900/DP-900/AI-900 WORK EXPERIENCE

SOLUTION DATA ARCHITECT at COOLSOFT LLC 12/2024 – To Date Hybrid, Louisville, KY

Azure Cloud Database Architect for Databases (Oracle, Azure SQL & PostGreSQL), Data Architecture Projects for US State Government Programs & Projects viz. Idaho & Delaware

Leads Data & Big Data (Managed Spark) competency for Advance Analytics and BI data pipelines

Develop and document TPC-DS performance Benchmarking for Data Lakehouse PoC using Apache Spark using various Open Table/file Formats Delta, Iceberg and Hudi Tech Stack: Azure, PostGreSQL, Apache Spark, Apache Trino, Dot Not Core, TPC DS Dataset DATA ENGINEERING ARCHITECT at UNILODE AVIATION SOLUTIONS 12/2023 - 11/2024 Remote

Build, Enhance and Manage Enterprise Data Warehouse (EDW) on AWS

Transform external data using pySpark, Hive, Delta Lake bringing speed, scalability & reliability

Build and maintain Data processing pipelines performing large scale transformations to various medallion layers using pySpark, Delta Live Tables (DLT) in Databricks Ingestion workflows

Design, Develop and maintain Real-time / Streaming pipelines leveraging Auto Loader, Data Ingestion Framework (DIF) & FiveTran tools, Databricks Asset Bundles (DAB) for CI/CD

Streaming Ingestion using Debezium, Managed Kafka (MSK), Amazon Firehose, AWS Glue from Source Applications / databases (SQL, MongoDB, PostgreSQL, AWS RDS & Mobile Apps) Tech Stack: AWS, Databricks, Azure SQL, MSK Kafka, Debezium, Glue Catalog, MongoDB, PostGre ENTERPRISE DATA ARCHITECT at XIT SOLUTIONS INC (PWC) 06/2023 – 10/2023 Remote, Bengaluru, KA, India

Validation of Data Quality Rules, while ingesting Data into the CFT (Cloud for Tax Application)

Develop Engineering ETL using PySpark Notebooks, move data from/to Microsoft Dataverse

DQA Framework for CFT data checking KPIs related to Business Rules

Orchestration using Azure Data factory

Tech Stack: Azure Databricks, Azure SQL, PostGreSQL, MS Dataverse, Dot Net DATA ARCHITECT, ENT. DATA PLATFORM at ATRADIUS CREDIT INSURANCE 06/2022 – 05/2023 Amsterdam, The Netherlands

Creating / Maintaining enterprise Metamodels using Archimate / Sparx

Modeling data creating reusable Common Business Objects (CBO)

Azure Solution Architect focused on building a fresh Azure Native BIG Data Platform using Databricks EDW / DM with Data Vault 2.0 using Vault Speed & Apache Airflow

Implemented Medallion Arch using Unity Catalog for superior Data Quality & Governance

Architecture and engineering support for long term Data transformation programs and development of Newer Data Products for Credit Insurance domain Tech Stack: Azure, Databricks, Oracle / MSSQL / PostGreSQL, Sparx, Astronomer, Data Vault 2.0, vault Speed, Azure Purview

PRINCIPAL TECHNICAL ARCHITECT at TECH MAHINDRA 08/2021 – 06/2022 Bengaluru, KA, India

Lead Azure / AWS Data on Cloud & Databricks competencies, data architecture for D&A

Consulting in Cloud BIG Data space, develop solutions leveraging Big Data on Cloud tool/ Tech

Design and Development of data processing frameworks for data ingestion & Integration

Developing Accelerators/Sprinters to help build Ingestion/Quality frameworks faster

Manage certifications in Analytics Pre Sales Space with Partners such as Azure (MS)/Databricks Tech Stack: Azure Data Services, ADF, Synapse, Databricks, AWS EMR, Informatica, TechM Accelerator Sprinter, UDMS, Pre Sales: RFx process, Defense, Roadmap, Resource Loading, BoM SOLUTION DATA ARCHITECT at TRELLEBORG 07/2019 – 08/2021 Bengaluru, KA, India

Created a reliable & highly available data platform for analysts to be able to Easily derive insight for power decision making

Identify new customer & product opportunities with fresh data driven Analytics

Big Data Analytics using pySpark (Python / SQL)

Data Visualization- Data Storytelling using Power BI, lead a team of BI data engineers

Azure ML classification/ Anomaly detection models for QA Tech Stack: Azure, Data Tables, data Factory, Power BI, Azure Analysis Services OLAP Cubes SENIOR DATA ARCHITECT at SAP SUCCESSFACTORS 10/2017 - 04/2019 Bengaluru, KA

Responsible for small to large sized projects, automation, task tracking, process execution & metrics for Enterprise Data Lake workloads, Manage Data / design for SAP HCM SaaS Cloud

Relational / NoSQL database design for SF Apps workflows, MySQL & SAP HANA

Lead the team which does DB Support round the clock round the globe Tech Stack: SAP HANA, Oracle, MS SQL, MySQL, SAP HCM SaaS, Azure Data Lake, SF LMS BIG DATA TEAM / WAREHOUSE CONSULTANT at AMAZON INDIA 12/2016 - 07/2017 Hyderabad, TS, India

Managed data Ingestion ETL pipelines & transformations into Amazon Redshift

Managed multi-node big data Redshift & Oracle clustered DWs

Engineering support to update and maintain the data repository

Data guidance documents, road maps, data access design, policies & data governance for AWS Tech Stack: AWS Red Shift, Oracle RAC, PostGre SQL, AWS DMS, IAM PRINCIPAL CONSULTANT at ORACLE INDIA 02/2012 - 12/2016 Hyderabad, TS, India

Oracle Database ACS SSE for premium North American customers

Enterprise Oracle Database 12c including RAC / XD, Golden Gate, Web Logic Server & OCI

Evaluate oracle Data warehouse solutions utilizing clusters /parallel RDBMS

CORE Team Support and preemptive Assessments for Configuration/performance Tech Stack: Oracle ACS Core SSE, RAC, Exadata, Golden Gate, Streams, Weblogic, OCI DATABASE SPECIALIST at IBM INDIA 09/2010 - 02/2012 Pune, MH, India

Enhancing, maintaining performance of databases & applications using Oracle & MS SQL

Assessing data infrastructure of client Application environments

Expert level Support for Mission critical Enterprise data warehouse/databases

Performance tuning DB servers and SQL queries

Tech Stack: Oracle RAC DB 10gR2 on IBM AIX, 2-4 Node RAC, Oracle Business DW, Infosphere DATA ARCHITECT at EUROSOFT WLL, BAHRAIN 04/2009 - 06/2010 Manama, Bahrain

Re-engineered old legacy defense solution, develop and own data models - Logical & Physical

Provided System Design & Solution Architecture for TASMEEM Application Tech Stack: Mainframe VAMS, PL1, Java EE, Oracle Database 10g, Oracle Designer, IMP/EXP TECH MANAGER at RELIANCE RETAIL LTD, NEW MUMBAI, INDIA Sep 2006 – Mar 2009

Administration & Maintenance of SAP BIW databases using SAP BR*Tools

Supported & managed database Installations, Upgrades, storage structures, database tools

Performed Capacity Planning / Infra estimation & Performance Tuning Tech Stack: Oracle, MS SQL, DB Support, Team Handling SENIOR CONSULTANT at CAPGEMINI INDIA May 2005 - Aug 2006

The Policy Administration System (PAS) interfaces primarily with Tax & Administration department, internal departments of UWV and other public-sector entities for delivering of critical employee information and is integral to reintegration of work processes and employee tax/income information processing

Performed Logical Modeling & Physical Design for PDM Module of Bank using Oracle Designer

Developed PL/SQL Application Programs, Use Case Support to J2EE Team on Oracle development

Tech Stack: Oracle SQL, PL/SQL, Oracle Designer logical modeling, Application DB Schema Design SQL DEVELOPER at OFFICE OF STATE REVENUE, NSW, AUSTRALIA Feb 2003 – Apr 2005

MMDS - Prepared SRS & HLD for Development of code between JDA PMM and eMerchant

Office of State Revenue (OSR) - Developed a Centralized Module for Interest Calculation, Accrual and Imposition for return revenue types (Gaming Machine Tax (GMT) & Parking Space Levy (PSL)

Tech Stack: Oracle SQL, PL/SQL, Oracle Designer, Program Units – Procedure / Functions EDUCATION

INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY (IIIT-B), BENGALURU, KA, INDIA Post Graduate Diploma: Data Science & Machine Leaning (PGDDS) 2019 - 2020 HARCOURT BUTLER TECHNOLOGICAL UNIVERSITY (HBTU), KANPUR, UP, INDIA Degree: Bachelor of Technology (B.Tech.) in Chemicals 1995 – 1999 CERTIFICATIONS / PROFESSIONAL SKILLS

Enterprise Cloud Solutioning, Data Engineering & Architecture

DataBricks Certified: Data Engineering Associate, Data Lakehouse Accreditations

Microsoft Azure Certified: Azure /Data/AI Fundamentals (AZ-900/DP-900/AI-900)

Relational DBs – Azure SQL, MS SQL Server, PostgreSQL, MySQL, Oracle 12c RAC / XD / GG

NoSQL Databases – MongoDB, Cosmos DB

AWS – Amazon S3, Lambda, RDS, Amazon Athena, Redshift, AWS Glue

Scripting Languages - Python, SQL, MS Power Shell, Unix / Linux Shell scripts

BIG Data Analytics – Apache Spark, Trino, Kafka, Hive Meta Store, HDFS / MR, Hadoop

Cloud Data Warehouses – Databricks, Synapse Analytics, Big Query, Snowflake

Data / Dimension Modeling – STAR Schema, Data Vault 2, Data Integration & Harmonization

ETL, Azure data lake (ADLS), Azure Data factory (ADF) & Analysis Services (AAS)

Archimate3 Modeling for Enterprise Architecture (ADM), DBT, Apache Air Flow

Data (Visualization) storytelling using Power BI, DAX, SQL Query Tuning / Optimizations

Azure DevOps, GitHub, Terraform, Confluence, Jira and Service Now etc. Data Science and ML / Deep Learning Projects – EDA & Model Development

Azure ML Regression, Classification, Clustering Algorithms/ Models using PY SDK

Deep Learning Neural Network - CNN, RNN & Natural Language Processing (NLP)

Data Science – Inferential Statistics & Hypothesis Testing, Probability etc.

Time Series Analysis & Forecasting using Python / Spark ML libraries, Pandas, NumPy, Keras, Tensor Flow & PyTorch, Scikit-learn, Matplotlib, Apache Spark MLlib HOBBIES/INTERESTS

An Omnivorous Reader

Loves Music - Indian classical vocal - Ragas

Like Traveling and exploring new places, people & things

Flair for learning new languages & cultures - Nederlands (Dutch), Español & Deutsch (German)

A continuous Learner of Indian classical languages such as Sanskrit & others



Contact this candidate