SHIVAM VERMA
** ****** ******, *******, **-***** ******.********@*****.*** 857-***-**** LinkedIn GitHub EDUCATION
Northeastern University, Boston, MA, USA Sep 2019 – Aug 2021 Master of Science in Engineering Management GPA: 3.57 Relevant courses: Data Warehousing & Business Intelligence, Data Mining & Machine Learning, Database Design & Management, Computation & Visualization for Analytics, Probability and Statistics Jaypee Institute of Information Technology, UP, India May 2015 – Jun 2019 Bachelor of Technology in Electronics and Communications Engineering TECHNICAL SKILLS
Programming Languages: Python, SQL, R, SOQL, PySpark, BASH (Linux) Software & Tools: Tableau, Power BI, DOMO, AWS (Redshift, DynamoDB, S3, Athena, Kinesis, EMR, Lambda, EC2), Celigo, NetSuite ERP, Salesforce, Alteryx, Informatica, DataWrapper, Oracle BI, JIRA, Confluence, Docker, Git Big Data: Hadoop, Hive, MapReduce, HDFS, HBASE, YARN, HIVE QL, Apache Spark SQL, Apache Airflow, Superset, Sqoop Database Tools: MySQL, PostgreSQL Talend, SSIS, SSRS, SSAS, Azure Data Studio, Toad Data Modeler, ER Studio Packages: NumPy, Pandas, SQLAlchemy, Scikit-Learn, Plotly, Streamlit, ggplot, Statsmodels, Matplotlib, Seaborn, MLxtend Certifications: AWS Cloud Practitioner, Tableau Desktop Specialist, Predictive Analysis using Python, Lean Six Sigma EXPERIENCE
Barton Associates, Peabody, MA, USA Oct 2023 – Present Business Intelligence Developer
• Executed strategic data migration and integration initiatives, empowering company-wide self-service dashboards for KPI analysis.
• Designed DOMO dashboards analyzing 10 years of finance data using ETL SQL, resulting in a 15% efficiency increase and informing strategic decisions on underperforming locations.
• Developed Salesforce CRM and NetSuite ERP integration using Celigo, achieving a 70% reduction in manual record creation and saving over 20 hours per week in Finance operations.
• Implemented daily automation converting 100+ timesheet PDFs, loading data into SQL Server and integrating it with Salesforce using Lambda functions resulting in 40% increase in data accuracy.
• Extracted data from Marketo API to perform user engagement, market campaign, and traffic analysis using Python, to support targeted marketing for clinicians.
• Participated in Agile methodologies, translating business requirements into technical specifications within cross-functional teams. ImmuneID Inc, Waltham, MA, USA July 2022 – Oct 2023 Data Engineer II
• Spearheaded development of a comprehensive conceptual, logical, and physical relational data model to support LIMS.
• Designed efficient data pipelines using AWS Glue to ingest and streamline EMR data from multiple vendors by storing harmonized data in AWS RDS, resulting in a responsive and dependable system for end users.
• Extracted data from Neo4j NoSQL database using Cypher query language to analyze protein & gene information with precision.
• Engineered AWS Lambda functions for automating daily data transfer between Benchling API and AWS RDS, led to 80% reduction in manual effort for data extraction and transformation.
• Utilized Docker and ECR to package Lambda function code, dependencies, and runtime, streamlining deployment processes.
• Executed data migration process for 1TB+ data, from on-premises servers to AWS S3, optimizing resource utilization.
• Implemented ML pipeline to flag abnormal pipetting events in real time, preventing scientific experiment failures.
• Developed and deployed multi-page R Shiny applications for scientists and lab personnel, enabling efficient lab data capture. MultiPlan Inc, Naperville, IL, USA Oct 2021 – July 2022 Software Quality Assurance Analyst II
• Responsible for testing Data Warehouse, ETL and Health Claims solutions using agile software development processes.
• Validated healthcare data of 90M+ rows by testing database from source to target in Oracle via SQL queries (utilized window functions, CTEs and complex subqueries).
• Maintained historical data of 20M rows in dimensional and fact tables using Slowly Changing Dimension type1, 2.
• Provided ad-hoc analysis solutions using Azure Data Lake Analytics and Hive QL to support end-to-end database testing.
• Created and maintained detailed, accurate, reusable test scripts and files to expedite regression testing.
• Logged and reported defects using JIRA; collaborated with developers to resolve issues so that delivered product performed as expected.
• Created Requirement Traceability Matrix, test plans, detailed defect reports outlining major requirements, which increased product quality and expedited product release.