Post Job Free

Resume

Sign in

Data Engineer

Location:
Brooklyn, NY
Posted:
August 17, 2023

Contact this candidate

Resume:

Sagar Pradeep Shringarpure

ady0h2@r.postjobfree.com +1-929-***-**** www.linkedin.com/in/sag791s/

SUMMARY

Data Engineer with 3 years of professional experience possessing a strong background in designing, implementing, and maintaining complex data architectures. Proven track record of working with clients in Agile setup to build scalable data pipelines, integrating data from diverse sources, and deriving insights using data analytics to support business goals. Passionate about solving complex data challenges and delivering high-quality solutions that enable data-driven decision-making and data forensics.

EDUCATION

New York University, Master of Science, Computer Science May 2023

Relevant Coursework: Big Data, Principles of Database Systems, Design & Analysis of Algorithms, Databases for Business Analytics

University of Mumbai, Bachelor of Engineering, Computer Engineering June 2018

Relevant Coursework: Data Warehousing and Mining, Machine Learning (ML), Artificial Intelligence (AI), Cloud Computing

SKILLS

Programming Languages: Python, Scala, JavaScript

Database systems : PostgreSQL, MySQL, MongoDB, SQL Server, AWS Redshift, DynamoDB, BigQuery

Cloud Computing : Azure Cloud (ADLS, Blob, Database, Azure Apps), AWS (EC2, Kinesis, S3, Glu, Spectrum)

Tools : Apache Spark, GIT, Jenkins, Talend, Informatica, Snowflake, Airflow, Shell Script

Visualization Tools : Tableau, Microsoft Power BI, Excel, Looker

Soft Skills : Communication, Critical Thinking, Problem-Solving, Creativity, Helpful Nature, Adaptability

EXPERIENCE

Graduate Teaching Assistant, New York University September 2022 – May 2023

●Provided technical support to 150+ students in 3 Big Data courses, including High-Performance Computing using DataProc, working with Spark, Hadoop, facilitating class discussions, and holding office hours for individualized assistance.

Data Intern, Quantiphi Inc, USA June 2022 - August 2022

●Developed data pipelines using AWS Lambda and Kinesis stream which boosted data process by 40%

●Designed the transition of data lake using Dynamo DB and Kinesis stream to perform ingestion, transformation, and data quality testing which resulted in acquiring new business domains for the insurance client

Senior Data Engineer, Quantiphi Analytics Pvt Ltd, India October 2020 – July 2021

●Achieved 50 % improvement in ETL processing by restructuring ingestion of event-based data using AWS which ensured customer retention and helped clients adhere to financial transactions on time

●Interacted with clients to gather business requirements for data understanding and used the knowledge to perform data modeling on key insurance metrics of pricing and billing

●Deployed stored procedures, views, and triggers in SQL for customized financial reporting

●Implemented customized programs to manage the orchestration of batch jobs and streamlined the deployment of data solutions by documentation and logging

Data Engineer, Quantiphi Analytics Pvt Ltd, India July 2018 – October 2020

●Conceptualized and developed real-time data pipelines using Dataflow, Talend, and Big Query for the analysis of Insurance data

●Created, analyzed, and presented data visualizations in Tableau and Microsoft Power BI for consumer data analysis and finding important business metrics which helped the team deliver crucial product updates to the stakeholders

●Tested and validated data pipelines using Microsoft Azure, SQL Server to successfully execute data migration POC from on-premises to Azure cloud

PROJECTS

Web Application Development [Flask, SQLite, PHP, Python, UX Design] May 2022

●Created a website for question answering system using Flask by conceptualizing relational schema and front-end design

●Website facilitated keyword search, user creation, password hashing, maintaining like/dislike count, and database indexing

Tennis Analytics – Big Data Project [Apache Spark, Python, Data Analysis] December 2021

●Created a Tennis Analytics Big Data solution using PySpark and visualization libraries to show Sports analytics for Tennis

●Computed metrics such as head-to-head analysis, historical analysis for players, and live data integration through public API

Banking and Finance Data Project [SQL, Google Cloud Platform, Tableau] July 2018

●Led a group of 5 developers and delivered a finance project through illustrative graphs on Tableau to convey data insights

●Developed and tested data ingestion to Big Query using SSIS as an ETL tool

Prediction of Personality based on Handwriting [Python, CNN, OCR] April 2018

●Applied Image Processing and Machine Learning methodology to predict Myers-Briggs Type Indicator for personality

●Achieved an accuracy of 85% in identifying personalities for data of 250+ participants at the University

CERTIFICATIONS

●Google Cloud: Big Data and Machine Learning Fundamentals (Coursera) 2019



Contact this candidate