Post Job Free
Sign in

Accounts Payable Data Architect

Location:
Glenelg, MD
Posted:
December 08, 2024

Contact this candidate

Resume:

Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD

LinkedIn

(Continue) Page 1 of 8

About Me

I am a passionate and driven IT professional seeking a position as a Solutions or Data Architect/Engineer in the retail, manufacturing, or finance sectors. I am eager to analyze raw data and provide valuable insights for business groups and higher management. With my experience, I tailored Accounts Payable, Accounts Receivable, Contract & Pricing, Finance, FP&A, and Management to get the most out of day-to-day ERP and commercial activities. My interest is about finding revenue leaks with data cleansing, modeling, bringing ERP & non-ERP data into one system, and building bridge applications.

Through these experiences, I have developed skills in ETL/ELT, SQL optimization (including creating indexes and table partitions), content creation, and data analytics. I also possess a understanding of Data Governance and Big Data technologies, including Machine Learning (ML), Natural Language Processing (NLP), Artificial Intelligence (AI), HDFS, Spark, and Scala. Recently, I have expanded my expertise into Generative AI (GenAI), utilizing advanced techniques such as Retrieval-Augmented Generation (RAG) to enhance data-driven decision-making, particularly in the context of clinical trial document search and other business applications. Additionally, I have domain-specific knowledge in the Pharma industry, including EDI 852, 867, Drug Distribution Data (DDD), and various pharmaceutical landscape datasets, from drug distribution and patient access to claims processing and national metrics. Skills

Langages: Python, C#, Java, VB.Net, .Net, SQL, PL/SQL, NoSQL, Scala Databases: MySQL, MS-SQL, Oracle, Snowflakes, Redshift, Hive, MongoDB, PostgreSQL ETL Packages: SSIS, AWS Lambda, BODS and Informatica Know Skills: Data Architect, Data Engineering, Data Analytics, Big Data, Project Manager, Solution Architect, Business & Technical SME, Data Science, Gen AI and Airflow Front End Development: React, JavaScript, Django, and Angular Soft Skills: Product owner, Team Player, Delivering product on time within budget. Cloud & Big Data: AWS & Azure.Hadoop HDFS,Spark,YARN,TensorFlow,Kafka,MapReduce and Sqoop Strength: Diligent Problem Solving, Result & Data Driven, Adaptability, Collaboration, Application Integration, Time Management and Critical Thinking Visualization Tools: Tableau, Power BI, Qlik, Cognos, and Looker Domain/3PL/EDI: Pharma Commercial (Model N (Contracts, Pricing, chargebacks, Rebates, Medicaid and GP ), Medical (Veeva CRM DDS, ODS and Reports Databases), Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 2 of 8

UPS/Cardinal/Prasco/Knipper 3PL’s SFTP files ingestion, EDI 850, 856, 810, 844, 845, 849, 852, 867, 943, 944, 945 and third party’s IQVIA to transfer weekly 3PL sales files. Education

Master of Science: Master of Computer Applications Osmania University Bachelor of Science: Computer Science Osmania university – Hyderabad, India Certificate: PMP, Nov 2020-2026 Project Management Professional (PMP)® - Credly Post Graduate Program in Data Engineering. Cert:36332878 Purdue, USA 2021 Hands-On Essentials: Data Engineering Workshop Snowflake Education Services Data Engineering Capstone Project Coursera

Awards Interest’s

Best Employee 2018

Hacker Rank: SQL 5 Stars HackerRank

Self-Improvement: Committed to lifelong learning

through reading, online courses, and mindfulness

practices.

Helping Needy: Actively involved in community

service projects, such as supporting for education Food & Cooking: Hobby for experimenting with new

recipes; enjoy hosting dinner gatherings.

Summary

I am a business-focused Data and Solutions Architect who specializes in testing, architecting and implementing new ETL/ELT solutions. I am proficient in SQL,PL/SQL, C#, Python, Java, API, Airflow etc and I’ve a real-world experience data modeling, cleansing, visualization and using cutting-edge analytics to gain useful insights. I especially love working on high data volume (in past work with 6TB), high level complex projects where I can use my knowledge to address business issues and bring data driven solutions. My passion for innovation extends even into innovative technologies such as Generative AI (GenAI) and Retrieval-Augmented Generation (RAG) which I look forward to seeing new possibilities to drive business intelligence and decision making. I’m also open to taking on new projects that will help me continue my Data Architect/Engineering journey, working on impactful, futuristic projects. Experience

Idorsia Pharmaceuticals US Inc, Radnor, PA / September 2021 - Present

Worked directly with business and IT stakeholders to design, create and deliver solutions for North America Commercial Operations, Market Access and Revenue Management, Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 3 of 8

keeping alignment with business.

Product Owner for several applications and lead continual improvement and projects to achieve business goals and requirements.

Developed comprehensive business intelligence, such as understanding of Idorsia products, legacy systems and business operations to help with decision making and customer service.

Provided systems functional design, architecture, interface design, and data management subject matter knowledge for the best performance and integration of systems.

Develop great relationships with both internal and external stakeholders and tempered expectations with regular communication, frequent updates, and resolution of issues.

Collaboration with international IT teams and vendors to advocate technologies and capabilities, shape roadmaps, and prioritize projects that align with business activities.

Design, develop, test, deploy, maintain, and continuously improve software to support customer needs.

Clean, preprocess, and transform data to ensure its suitability for training AI/ML models.

Train AI/ML models using appropriate algorithms and techniques. Evaluate model performance and make improvements as needed.

Spearheaded the development of a clinical trials document search engine using GenAI and RAG to improve document retrieval and enhance decision-making in clinical research.

Integrated RAG models with clinical datasets, significantly improving information access and reducing time spent on manual document searches.

Designed and executed data preprocessing and transformation pipelines to optimize performance of the RAG-based document search system.

Published findings on how combining generative models with retrieval techniques can significantly improve search relevance and information accuracy in clinical research settings.

Conducted research on integrating speech recognition technology into presentable report to transform unstructured voice data into actionable insights for business intelligence.

Developed a speech recognition system to convert voice input into SQL query that extract MySQL database data for business intelligence reporting.

Integrated speech-to-text APIs (such as Google Cloud Speech-to-Text) with Django API to automatically convert voice recordings into analyzable data points.

Automated data updates through scheduled data refreshes, ensuring real-time analysis by using speech-recognition text convert it to SQL queries. Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 4 of 8

Served as an interface SME for the US headquarters and 3PL businesses, validating interface data to minimize corrections and manual financial postings.

Developed BI reports on the Veeva CRM (Medical) using the Redshift database.

Possess a clear understanding of Veeva, SAP (including stock movements, O2C, sampling goods, and accounts receivable), and CPI.

Created a data ingestion pipeline to load files from S3 to Redshift using AWS Lambda on a daily schedule and built a dashboard for reconciling data with monthly G2N Prasco metrics.

Collaborated with EY auditors to clarify business processes related to selected data points.

For IQVIA developed a process using the UNLOAD command in Redshift to extract data from the 3PL database and write it directly to an AWS S3 bucket. During this process, an issue arose with the unintended addition of extra zeros in the generated file names. To address this, I implemented an AWS Lambda function that created a corrected file in a separate folder, allowing for the updated file to be copied to the intended location.

Loaded the EDI 867 data file into Redshift to assist Market Access in identifying newly added pharmacies. This process enables effective segregation of data into the appropriate buckets for streamlined analysis.

The Project Environment: Python,boto3, Redshift,EDI (943,944,850,856,810,852,867), S3, AWS Lambda, speech_recognition,Django, Hugging Face’s transformers. Lupin Pharmaceuticals Inc., Baltimore, MD / September 2014 – September 2021

Model N Upgrade (Project Manager & Solution Architect): Managed the upgrade of Model N from version 5.2 to RMAAS, taking on the Project Management and technical lead roles from Lupin to reduce implementation costs. During the deployment phase, we encountered performance issues, which I resolved by implementing partition indexing to reduce I/O reads. I worked closely with both offshore and onsite vendor consultants to avoid any delays in the go-live process. I also took ownership of technical tasks such as loading historical data, building processes for data conversion, and making configuration changes.

Automation of DEA and HIN Customer Loads: I developed a monthly process for loading DEA and HIN data into a database. This daily process pulls kick-out customers from Model N and runs business logic in the database to create XML files for customers, which are then placed in an SFTP folder.

Automation of ASN (EDI 856 and 943): I developed a download agent in C# to retrieve the previous day's file from the UPS site and load it into MS SQL using SSIS. I incorporated business logic in the stored procedures to process and publish the EDI document using the MSSQL BCP command. Similarly, when a SAP user creates a PGI Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 5 of 8

file and drops it into a designated folder, the agent picks up the file, runs the SSIS packages, and generates the EDI 856 file, which is then transferred to the EDI vendor's SFTP for distribution to wholesalers and 3PLs.

Model N & SAP Credit Memo Validation: I built a validation process that compares Model N credit data with SAP aging data files, generating multiple reports (including deductions not submitted, discrepancies between the two systems, and pending credits). This process helped recover $9 million in chargebacks from our customers. The reports also triggered alerts regarding wholesalers or distributors submitting incorrect WAC pricing and rejecting system-generated credit/debit memos, which amounted to nearly $12 million.

Developed a process for conducting ad hoc audits on chargeback data using the EDI 867 to analyze total sales against chargeback quantities, aiming to identify resalable units that match chargeback returns.

Model N Commercial Subject Matter Expert

o Provided support to the Contracts and Chargebacks teams for day-to-day issues o Addressing rebate and pricing commitment issues, as well as membership issues on rebate contracts, particularly for strategy-based programs. o Creating and uploading data extracts (chargebacks, direct and indirect pricing) based on three months of average sales data into Model N. o Developing a non-negative chargeback sales strategy to prevent salable returns from being bucketed into rebates.

Reports: Developed and deployed several standard reports in Model N o Daily Rebates Release Report (aids Accounts Receivable and FP&A). o Chargeback Claims Report for SAP and Aggregated Monthly Chargeback Report

(supports FP&A for accruals vs. actuals).

o Chargeback Trend Report for management.

The Project Environment: Model N(PbN), C#,EDI (844,845,849,852,867 & 820),VBA, SSIS, SSRS, Oracle (SQL & PL/SQL) and MSSQL 2005/2008,jscript, Cognos, Qlik and .Net System Analyst Level III

Actavis, Parsippany, NJ / Nov 2013 – August 2014

BCS (Business Unit Scrubbing System - Java Web Application): At Actavis, I developed a system to streamline the validation of DEA and HIN for both direct and indirect customers stored in SAP. Recognizing that developing these validations in SAP was time-consuming and costly, I designed and implemented a user-friendly application that allows business users to log in and deduplicate customer entries. For instance, when SAP sends indirect customers with different identifiers (DEA, HIN, or 340B) but the same address, the BCS application categorizes these customers into a duplicate Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 6 of 8

address bucket. Users can establish a parent-child hierarchy between customers, consolidating all child attributes (identifier, COT) under one parent. This functionality enables users to resolve multiple customers sharing the same address through a single web form.

Technologies Used: Java, jQuery, SAP BODS for ETL, and Oracle stored procedures for business logic. The ETL process leverages SAP BODS, while business logic for adding COT, addresses, and identifiers, along with duplicate alerts (address, identifier, and member name), is encapsulated within an Oracle package using stored procedures and functions.

Indirect PO Price Call: The requirement was to retrieve the indirect least price for the past three years. The SAP development team initially wrote a SQL statement that combined NDC11 and DEA to extract pricing, resulting in a processing time of approximately 45 seconds per transaction. Model N flagged this as a performance issue and advised against implementation. The SAP team manager consulted me for an analysis and to propose a better approach. After evaluating the process, I recommended inserting distinct customers and NDCs from a PO into a temporary table and executing a stored procedure on this table, which significantly reduced the callback time for each transaction. I created a package to extract bid award price changes for the past three years, joining this data with DEA to return the least price efficiently. Project Environment: Model N, Java, and Oracle (SQL/PL-SQL). Principal Consultant - Technical Lead (Professional Services) Model N, Princeton, NJ / May 2013 – Oct 2013

Clients: Celgene / Watson / Model N

Watson: Conducted a comprehensive data audit to identify and resolve issues with master data, significantly reducing data validation errors and improving overall database performance. Implemented master data validation processes for customers

(validating DEA, HIN, CUST, and 340B IDs), ensuring no duplicates; for products (both internal and competitor), confirming valid NDC 11/9 codes and ensuring products were accurately categorized across multiple groups. Performed transactional data validation for chargebacks, verifying valid products and effective dates within contract periods, as well as validating members, COT, and invoice dates.

Celgene: Developed a function to establish product hierarchy levels for use in custom reports and data flows, facilitating the extraction of paid chargeback lines. Troubleshot data flow warnings, correcting issues to improve quality of service and eliminate warning messages. Demonstrated and implemented user relationships between Contract and Bid Award IDs for use in EDI 844 communications sent to customers as EDI 849. Extracted comprehensive customer master data from the database for the RME implementation team to validate against client records. Created custom user Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

(Continue) Page 7 of 8

access roles to manage data file loading, incorporating these roles into the build for RME V5.5. Identified issues with overlapping effective dates for primary and non- primary IDs within the customer data flow and initiated a Product Development ticket to address these concerns.

Additionally, I conducted a data audit to identify master data issues, reducing validation errors and enhancing database performance through the creation of functions to check for duplicates in master data (including DEA, HIN, CUST, and 340B IDs) and validating products with NDC 11/9 codes.

Key Responsibilities:

o Master Data and Transactional Data Validation

o Data Flow Troubleshooting

o Custom Reporting Development

o User Access Role Creation

Project Environment: Model N, Java, and Oracle (SQL/PL-SQL). Principal Consultant - Technical Lead (Professional Services) for Merck Paragon Solutions, PA / Oct 2011 – May 2013

Collaborated with Model N Revenue Management for Life Sciences application, extracting data from legacy systems and importing it into Model N to ensure accurate rebate creation. Managed data flow processes, including routing to PHUB for acknowledgment before forwarding to GP.

Verified UOW (Unit of Work) creation by the Model N team, ensuring SQL queries were accurate and free from hardcoded values or decodes. Communicated any discrepancies to the IT team prior to approval and validated the results of data flows.

Explained the relationships between Model N, Customer Master, and Product Master databases to cross-functional business analysts (BAs) and technical analysts (TAs) for improved understanding and collaboration.

Developed Functional Requirement Specifications (FRS) for integrating legacy systems with Model N, facilitating the flow of Medco utilization data back to CDVS and rebates data back to the GP system.

Executed performance testing for the Model N application during UBC data import and validation, implementing enhancements such as adding hints to Model N stored procedures to reduce processing time.

Created and presented SQL queries in meetings to streamline processes for Business, ADI, and Model N teams, facilitating the efficient creation of UOWs.

Enhanced the Contract Registry for business users, implementing improvements through object-oriented programming (OOP).

Provided data to system TAs for testing Model N application functionality and assisted other workstream TAs in understanding the Model N database, thereby reducing the time needed to create FRS, UOM, and IFS.

Converted CSV data to XML format for loading into Model N via data flows. Gulam Rasool Siddiqui PMP®

+1-410-***-**** **************@*****.*** Glenelg, MD LinkedIn

Page 8 of 8

Developed SQL queries to extract data from various systems, including CDVS and GenCo.

Project Environment: Model N, C#, VB, and Oracle.

Senior Consultant

Apotex Corp, Weston, FL / Jun 2006 – Oct 2011

Developed a GPO wholesaler application to capture chargeback data, allowing users to add wholesalers and their members while managing contracts with distributors, wholesalers, and third parties. The application features a user-friendly GUI built in C# and ASP.NET.

Created a rebate and sales validation application that enables users to execute ad-hoc queries and export data in various formats (Text, Excel 2003, and on-screen viewing) using C# and Windows Forms. This tool significantly aids in auditing customer- requested rebates, helping to preserve company revenue.

Designed a reconciliation application for checks from McKesson, AmerisourceBergen, and CVS using a VBA program in Excel (2003 & 2007), reducing processing time from weeks to hours.

Developed an invoicing system that generates PDF invoices and sends them via email to customers.

Created invoice reports using SSIS deployed in Reporting Services, accessible through a .NET web application.

Established an ETL process with SSIS to transfer chargeback data from the I-Many- CARS (Oracle) database to SQL Server 2005 for analysis and sales forecasting.

Traced Oracle sessions for troubleshooting and to automate manual processes, enhancing operational efficiency.

Maintained and extended the ICorp .NET application for the contracts team, aiding national account managers (NAMs) in reviewing the least prices for each NDC with other wholesalers and distributors.

Project Environment: ASP.NET, C#, Oracle, MSSQL 2005/2008, Visual Studio.NET

(2010/2008/2005), Crystal Reports, JavaScript, XML, IIS web server, iMany-CARS V3.12, Medicaid V8.4.2, SSIS & SSRS.

Team Lead

Perimeter E-Security, Weston, FL / Oct 2002 – Jun 2006 Application Architect

Domain Bank Inc, Bethlehem, PA / Jan 2002 – Sept 2002



Contact this candidate