OBJECTIVE
Design, implement, and optimize data
structures to ensure accuracy,
efficiency, and accessibility. They
translate business requirements into
technical specifications, create
relational and non-relational models,
and collaborate with stakeholders to
support data-driven decision-making
and enhance overall data
management processes.
Avinash
Kumar
DATA MODELER
With a decade of experience, I specialize in data modeling, transforming raw data into structured insights. My expertise spans database design, data warehousing, and advanced analytics. I bridge technical and business realms, ensuring data integrity and accessibility, and driving informed decision-making and strategic growth. CONTACT
• *****.*******@*****.***
• https://www.linkedin.com/in/avina
sh1987/
• Frisco, Tx
EDUCATION
• Master of Science in Data Science,
Western Michigan University, USA.
• Master of Engineering in Computer
Science, RGPV, India.
• Bachelor of Engineering in
Information Technology, RGPV,
India.
SKILLS
• Erwin, ER Studio
• Power BI, Tableau, Python
visualization libraries.
• AWS, Azure, NoSQL, SQL
• ER/Dimensional/Data vault 2.0
modelling
• Agile, scrum, Iterative
Development, UML
DATA MODELER, TARGET
Oct. 2023 - Present
• Explained intricate data analysis and modeling concepts to non-technical stakeholders for data-driven decision-making.
• Developed and implemented a change management plan, ensuring minimal disruption during organizational changes.
• Communicated data experiment outcomes effectively to both technical and non-technical audiences.
• Worked on NoSQL databases MongoDB and Spark for data modeling, utilizing reverse engineering for model creation and documentation.
• Used NoSQL booster for MongoDB to analyze, profile, map, test, and validate data during reverse engineering.
• Converted JSON strings to native objects. Designed and developed NoSQL database models in Cassandra and
DynamoDB, and relational models using Erwin.
• Transformed monolithic data models into scalable, domain- driven models. Developed applications and migrated data from RDBMS to NoSQL databases like Cassandra.
• Used Cassandra stress tool for read-write latency testing and cluster fine-tuning, maintaining data dictionary and global maintenance processes.
DATA MODELER, CATERPILLAR INC.
Nov. 2021 – Sept. 2023
• Translated business requirements into functional and non- functional requirements using Workflow, Sequence, Activity Diagrams, and Use Case Modelling.
• Created Physical and Logical models using Erwin, and generated CSV reports and DDL. Built data models in Erwin, and designed grain of facts based on reporting needs.
• Conducted data analysis, identifying data sets, sources, metadata, definitions, and formats. Operated on AWS cloud platform (EC2, S3, EMR, Redshift, Lambda, Glue).
• Defined/tracked KPIs and metrics for data governance effectiveness and improvement.
• Trained personnel in relational database use and Data Governance policy/procedures.
• Implemented/maintained data governance frameworks (e.g., DAMA-DMBOK).
• Enforced referential integrity in the OLTP data model for consistent table relationships.
• Analyzed system specifications/business requirements, ensuring compliance with corporate rules and regulations.
DATA MODELER, T-MOBILE
Jan. 2020 - Nov. 2021
• Successfully migrated on-premises data to GCP using Azure Migrate tools for scalability and cost- effectiveness.
• Conducted thorough analysis of client's data architecture, implementing data governance policies compliant with regulations.
• Utilized Azure Data Factory to orchestrate data migration from Azure SQL databases and Blob storage to GCP's BigQuery and Cloud Storage.
• Configured secure network connectivity between Azure and GCP via VPC peering and VPN gateways.
• Used GCP Transfer Service to simplify migration, transferring large data volumes from Azure Blob storage to Google Cloud Storage.
• Leveraged Snowflake's data warehousing for optimized data storage and retrieval, enhancing accessibility.
• Implemented Azure Cognitive Services, increasing document processing efficiency by 25%.
• Enhanced document intelligence through Azure forms designer and custom vision, achieving a 20% accuracy boost.
• Worked extensively in Data Governance, focusing on metadata management, master data management, data quality, and security.
• Led cross-functional teams, prioritizing transparency and efficiency through agile methodologies, developing roadmaps and managing budgets.
DATA MODELER, HD SUPPLY
Aug. 2016 – Nov. 2018
• Utilized Azure Data Factory and Azure Databricks to streamline data audit trails, reducing platform usage costs by 15%.
• Constructed episodes using Medicare Advantage claims data with Python, SQL, and Azure Synapse Analytics, increasing program enrollment by 10%.
• Applied data intelligence techniques on Azure, resulting in a 30% surge in BPCI-Advanced program enrollment.
• Optimized ETL data pipelines in Apache Airflow on Azure, boosting data processing efficiency by 60%. Developed ETL workflows with Azure Data Factory, loading diverse datasets into Azure SQL Data Warehouse. Collaborated on Airflow-based data solutions on Azure, aligning with business needs and best practices.
• Integrated Azure Functions in Airflow to enhance ETL job orchestration and data pipeline optimization.
• Designed and implemented over 50 Airflow DAGs using Azure Functions for efficient ETL workflow management.
• Developed CI/CD workflows with Jenkins and Azure Functions, improving deployment speed and reducing manual errors.
• Crafted custom Airflow operators and sensors on Azure, enhancing development efficiency and data processing accuracy.
DATA MODELER, AXIS BANK
July 2012 – Mar. 2014
• Defined project scope, gathered business requirements, and performed GAP analysis. Implemented Data Lake using Hadoop architecture.
• Loaded data into Hive tables from HDFS to enable SQL access to Hadoop data. Created MDM rules using IBM Infosphere Master Data Management.
• Developed logical data models with IBM Infosphere Data Architect.
• Converted user requirements into business, functional, and technical specifications, and managed requirements using DOORS.
• Defined business logic for web services in SOA-based applications. Created physical data designs and first- cut data models for various projects.
• Performed performance tuning using IBM Infosphere DataStage 8.5. Worked with diverse data formats and developed ETL strategies for data warehouse projects, including running Hadoop streaming jobs and using Flume for data collection.
PL/SQL DEVELOPER, AMERICAN EXPRESS
Aug 2010 – jun. 2012
• Identified core systems' processes, workflows, and data flows.
• Conducted extensive user requirements gathering and gap analysis. Participated in the full development cycle: Planning, Analysis, Design, Development, Testing, and Implementation. Developed PL/SQL triggers and master tables for automatic primary key creation.
• Conducted data analysis for data conversion, including data mapping, specification, and writing data extract scripts. Created advanced PL/SQL packages, procedures, triggers, functions, indexes, and collections for business logic implementation using SQL Navigator.
• Built ETL packages using SQL Server SSIS, extracting data from XML files and loading it into databases.
• Designed and developed Oracle forms and reports, generating up to 60 reports. Performed data loading and extraction using SQL*Loader.
• Administered all database objects, including tables, clusters, indexes, views, sequences, packages, and procedures, and extensively used exception handling for debugging and error messaging.