Amir Khan Sr Data Engineer
Detroit, Michigan
***********@*******.***
Professional Summary
●Senior Data Engineer and Cloud Platforms Data Warehousing Specialist.
●With over 18 years of experience, I design and implement scalable data solutions that bridge on-premise and cloud environments, transforming complex datasets into actionable insights that drive smarter business decisions.
●Expertise in data engineering, cloud platforms, and business intelligence, ensuring robust data pipelines, optimized warehouses, and intuitive analytics that empower stakeholders. and building scalable, efficient data ecosystems.
●Optimized enterprise data warehouses and data lakes for enhanced performance, scalability, and seamless data accessibility for analytics and reporting.
●Experienced in building and supporting scalable data warehouses, data lakes, and pipelines using Cloud services and technologies. Proven ability to deliver high-performance analytics solutions leveraging modern cloud data platforms.
●Consistently delivering high-impact analytics solutions and intuitive business intelligence capabilities, I empower stakeholders with critical, real-time information. I have a proven track record of optimizing enterprise data warehouses and data lakes for peak performance, exceptional scalability, and seamless data accessibility, directly contributing to business value through meticulously crafted and resilient data foundations.
Expertise
●Cloud & Data Platforms: AWS, S3, Redshift, Glue, Athena, Lambda, RDS, DMS, Kinesis, EventBridge, PostgreSQL, DynamoDB, Snowflake (DWH), Enterprise Data Warehouse (EDW), Data Lakes, Cloud ETL/ELT, and Data Pipelines.
●Data Engineering: ETL/ELT Development, Data Pipeline Orchestration, CI/CD Pipelines, Data Ingestion Strategies, SQL Optimization (for Real-time Analytics).
●Data Architecture: Enterprise Data Warehousing (EDW) Design, Data Lake Design, Data Quality Frameworks.
●ETL & Integration: Informatica (IICS, IDMC, PowerCenter), AWS Glue, Azure Data Factory, Talend, Apache Airflow, Apache Spark.
●BI & Data Visualization: Tableau, Power BI, AWS QuickSight.
Professional Experience
Cloud Data Engineer - INNOVATIVE Solutions - AWS Premier Services Partner
Aug 2024 - March 2025
●Designed and delivered interactive dashboards and reports via AWS QuickSight and Tableau, providing critical actionable insights for strategic decision-making for the Respiratory Healthcare System and the US top HVAC service provider, showcasing critical KPIs for sales, revenue, and regional performance trends derived from AWS data solutions.
●Engineered and maintained robust data integrations between diverse data sources and enterprise analytics platforms, leveraging AWS services including Redshift, RDS, S3, Athena, Glue, and Parquet file format to support comprehensive, real-time data analysis.
●Contributed significantly to data modeling and ETL development initiatives, improving data preparation workflows, ensuring data consistency, and optimizing data structures for analytical consumption.
●Architected and developed scalable ETL and Data pipelines using AWS Glue and other core AWS data services for efficient data ingestion, transformation, and loading into AWS Redshift, S3, and RDS.
●Executed complex data extraction, transformation, and validation tasks utilizing AWS Glue, Redshift, RDS, and Athena to build resilient and accurate data pipelines.
●Optimized underlying AWS data warehousing solutions and ETL processes (especially within AWS Glue) to significantly enhance efficiency, performance, and usability of data for various business applications, including healthcare and sales/revenue analysis.
●Provided optimized data structures and integrated data sources to facilitate the creation of high-impact dashboards and reports in platforms like AWS QuickSight and Tableau, ensuring data accuracy and accessibility for strategic decision-making.
●Designed and implemented data architectures that facilitated Gen AI integrations (Amazon Q,
Chatbot), optimizing data delivery and structure for enhanced data exploration within downstream BI tools.
●Collaborated with BI teams to identify and resolve data performance bottlenecks, ensuring optimal query and rendering speeds for analytical assets.
●Collaborated with data engineering teams to ensure data structures and ETL processes supported efficient BI reporting, consistent data presentation, and accurate insights.
Sr Data Engineer - Volkswagen Group of America, Michigan
March 2015 - April 2024
Over the years, as a member of the Data Engineering team at Volkswagen Group of America, I have extensive experience in designing, developing, and managing end-to-end data pipelines and Enterprise Data Warehouse (EDW) processes. My expertise spans the entire data lifecycle, from ingestion to consumption, with a strong focus on Enterprise Data Warehouse (EDW), Extract, Transform, Load (ETL), Informatica Intelligent Cloud Services (IICS), Snowflake, and cloud-based data solutions, including Business Intelligence and Data Operations.
●Designed, developed, and maintained robust Enterprise Data Warehouses (EDW) and Data Lakes, ensuring data consistency, quality, and accessibility for analytical purposes across large-scale automotive datasets within Volkswagen Group of America (manufacturing, vehicle, dealer, customer).
●Implemented end-to-end ETL and data pipelines from diverse source systems (Vehicle Parts, Dealers, Warranty claims) to Enterprise Data Warehouse and Data Lake environments using Informatica PowerCenter and IICS for Volkswagen Group of America.
●Developed and optimized complex ETL processes with Informatica 10.4.1 PowerCenter and IICS for high-volume automotive vehicle data integration across on-premise and cloud environments at Volkswagen Group of America.
●Designed and implemented scalable, resilient cloud-based data solutions using a wide range of AWS Cloud services (Redshift, S3, Glue, Athena, Lambda, RDS, EC2, EventBridge, PostgreSQL, Snowflake) to meet the unique automotive sector demands.
●Orchestrated data ingestion and transformation into the Data Lake, managing diverse data formats (structured, semi-structured, unstructured) to support advanced analytics and machine learning initiatives.
●Developed complex SQL and ETL scripts for data validation, transformation, and extraction across Oracle, Snowflake, AWS Redshift, and SQL Server databases.
●Designed and maintained data quality frameworks and validation rules within ETL processes specifically for Data Lake environments, ensuring high data integrity and reliability for downstream consumption by Volkswagen business units and customers.
●Utilized Agile/Scrum methodologies (JIRA, Confluence) and contributed to CI/CD pipeline development, automating infrastructure provisioning and data pipeline deployments with AWS SDK, AWS CLI, Python Boto3, CloudFormation, and Terraform for rapid iteration.
●Led production support and day-to-day data operations, proactively monitoring and resolving issues to ensure high availability and reliability of the Data Warehouse, ETL, and batch jobs for real-time automotive insights and operational efficiency at Volkswagen, leveraging strong Linux command-line skills.
Data Warehouse Consultant,
Blue Cross Blue Shield of Michigan
December 2012 - February 2015
●Responsible for leading the Enterprise Data Warehouse(EDW) projects, both inbound and outbound, from the design to implementation phases of the project. Responsible for providing the development estimates from design to post-implementation to the Delivery Lead and Project Managers. Worked effectively on multiple projects and ad-hoc needs of the business users. Involved with business stakeholders to collect and gather report requirements.
●Involved in gathering business specifications and preparing technical specifications by closely working with Business Users. Led the development team and coordinated with teams such as Data Architects, Data Modelers, database administrators, and Testing teams to ensure that development work was completed on time. Managed Project Scope, determined daily priorities, and ensured efficient and on-time delivery of project tasks and milestones by following proper escalation paths. Effectively communicated the Project Risks, Issues, and Weekly progress report to the Delivery lead and other business stakeholders. Developed Functional Design Documents (FDD) and Technical Design Documents (TDD). Communicated and reviewed the design, functional, and technical specifications to ETL and Report development teams, designed complex mapping/code. Developed Stored PL/SQL Procedures, Packages, and Functions.
●Implemented PL/SQL code into ETL mappings and transformations. Performed Data Analysis by executing SQL queries using COGNOS Analysis and Query Studio, TOAD / SQL developer, and Teradata SQL Assistant. Designed and developed a high-level Informatica ETL diagram flow.
●Developed complex ETL code and performed unit testing, developed source-to-target mappings using Informatica PowerCenter 9.5. Developed ETL code, added and set up UI configuration tables within the custom mappings/workflows, And Completed End-to-end EDW/ETL Testing in QA1 and QA2 environments. Developed Informatica performance tuning, naming standards, and project guideline documentation. Developed multiple files merge UNIX .ksh script within the ETL process. Designed and developed an automated ETL process for monthly/weekly data feeds for the BCBSM clients. Performed SIT and UAT testing. Developed test cases for EDW/ETL testing.
●Developed IBM Tivoli Job streams and flow process documentation. Developed and tuned complex SQL queries in the source qualifier. Created Lookups and joined transformations.
●Designed and developed multiple target mapping/codes with the Router transformation.
Project Environment:
Oracle 11g, Informatica 9.5 (ETL), DWH, Informatica Data Profiling, Oracle SQL Developer, SQL/PLUS, UI Configuration/ IBM Tivoli, MS Visio 2010, Unix, Win SCP, Putty- (FTP), Windows 8
Business Intelligence Application Lead,
General Electric (GE Capital Americas), Michigan
February 2010 - December 2012
●One of the lead contributors to the Business Intelligence Center of Excellence (BI COE) at GE Capital Americas (GECA). Contributed to the BI COE team in rolling out OBIEE 11g (Analytics) for the first time across all of GE and have laid a foundation for future success. Strong contributor in all phases and areas of expertise, including SDLC for BI, OBIEE 11g project development architecture, OBI best practices, ETL Informatica best practices, QA, Production support, and product evaluations. I regularly collaborate and present to customers, executives, and senior leaders.
●As BI COE and BI Application team lead, my responsibility is to deliver Interactive dashboard Analytics reporting solutions using OBI and IBM Cognos to the end user, including Design, Analysis, Gathering business requirements from business users, and building a proof of concept (POC) to get design and requirements approved.
●As BI lead, I performed and led UAT BI testing in all releases and project development phases in the staging environment (QA).
●Build ETL Informatica mapping design process, Informatica ETL standards, and metadata documentation.
●Performed Informatica 8.6.1 mapping designer to create/update mappings using different transformations required to move data into a data warehouse, Informatica workflow manager to create sessions, and batches to run with the logic embedded in the mappings. Created full load and incremental loads workflows.
●Performed OBIEE 11G and IBM Cognos dashboard development process from Installation to building each report and dashboard page, OBIEE 11g dashboard and reports Customizations, building complex reports in BI Publisher Integration, Configuration, RPD development, and Performance tuning in different OBIEE 11g and BI Server components. Performed RPD development in Physical, Logical (BMM), and Presentation layers, including setting up Multi-User Development (MUD) environment and OBIEE 11g Usage tracking. Implemented OBIEE 11g end-to-end performance tuning best practices. excellent knowledge of OBIEE 11g Administration tasks handling BI server and related configuration XML files, user and data level security on the dashboard and reports, webcast migration, RPD merge and development, troubleshooting web catalog, user management, BI server, and opmnctl services activities and handling critical BI Server issues in Oracle 11g Enterprise Manager Fusion Middleware (EM) and Oracle Weblogic Server 11g Admin Console.
●Developed Geospatial Visualizations Integration (OBIEE 11g MapViewer),
Environment:
OBIEE 11g 1.1.6, IBM Cognos, Oracle 11g DB, Informatica 8.6, Teradata DB, Oracle Weblogic Server11g, Oracle 11g, Enterprise Manager Fusion Middleware, SAP Business Object, Oracle SQL Developer, SQL/PLUS, BI Composer, SSO Integrations., SAP Business Object(BO) 3.0, jQuery, Unix, Windows 7, One Note, SnagIt 10.
BI Developer, John Hopkins University Applied Physics Lab (JHU APL),
Maryland- February 2010 - November 2010
Project: EBSS
●The project is to establish a next-generation business intelligence/data warehousing (BI/DW) platform for the Applied Physics Lab (APL) business systems. The platform will be used to deliver ad-hoc queries and reporting in OBIEE capabilities that extend the capabilities of the OBIEE 10g, OBIA, Oracle E-Business Suite, Financials, and Procurement modules. Responsible for end-to-end design of the process to extract, transform, and load (ETL) the data in preparation for the data warehouse.
●Responsible for the establishment of project design documentation. Responsible for the establishment of project standards. Performed and developed OBIEE 10g/Siebel Analytics dashboards, Admin tool (rpd), configured custom dashboards, Metadata Definition, and mapping design documents, built DFFs and out-of-box mapping customization method Documentation.
●Design source-to-target mappings using Informatica PowerCenter 8.6.1. and performed unit and regression testing, Responsible for out-of-box SDE/SILOS ETL mappings customization. Customize Prebuilt ETL mappings and add new columns in the facts/dimensions tables. Created new DFFs in the Data Warehouse/facts to get additional conversion data.
●Design and Development effort for the new JHU APL Business Analytics Warehouse
●Created Informatica PowerCenter 8.6.1 custom reusable transformations for faster development, like Expressions, lookups, etc. Performed OBIEE 10g report and RPD development, created new tables and custom FLEX FIELD X_column(s) in the Oracle Data Warehouse using Oracle SQL Developer. Created new custom workflows and sessions when it's needed to load the out-of-box SDE/SIL data. Managed SCD (Slowly Changing Dimension) mappings where Lookup will be touched, where Custom DFFs need to be part of SCD. Created a new Custom Lookup Transformation, where DFFs (FLEX FIELD) based lookup is needed.
Environment:
Informatica 8.6.1, Oracle E-Business Suite, Financials, Procurement, Oracle Business Analytics Warehouse(OBAW), OBIEE 10g, Oracle SQL Developer, MS Access, Flat Files, Unix, Visio, Windows XP 2003
ETL Developer, July 2005 - June 2009
Clients: Blue Cross Blue Shield, HealthNow, Horizon BCBS, NJ
Project: Business Intelligence (BI)
A multi-year project to build a data warehouse to identify the most profitable or potentially profitable customer for future interaction and to perform claim analysis, Performed a major role in understanding the business requirements and designing, loading, and extracting the data into the Data Warehouse of Facets system, Pharmacy Claims, Provider, Med Claims, Members, and BHI, CRMS for McKesson, OPA files, used ETL (Informatica) to load data from source to ODS in Informatica PowerCenter 8.6 Tools, Source Analyzer, Target designer, Mapping Designer, Mapplet Designer, and Transformation Developer for defining Source & Target definitions and coded the process of data flow from the source system to the data warehouse. Created reusable transformations and mapplets based on the business rules to ease development and completed multiple end-to-end projects for healthcare clients.
Education
Bachelor's and Master’s in Computer Science
February 1993 - April 1997, Cambridge College of Computer Science
Trainings and Certifications:
Oracle Certified Database Administrator -DBA (Oracle Corp). USA
AWS Certified Cloud Practitioner (AWS Workshop)
AWS Certified Solutions Architect - Associate (AWS Workshop)
AWS Technical Essentials (AWS Workshop)
Certification in Oracle Developer and SQL