Post Job Free
Sign in

Data Engineer

Location:
Dallas, TX
Posted:
September 17, 2020

Contact this candidate

Resume:

Tej Kashiparekh (available **-Dec)

469-***-**** ***.***********@*****.*** linkedin.com/in/tejk

SUMMARY

Self-motivated Technologist with 4+ years of experience in Data Warehouse and Management. Collaborates effectively with multi-functional roles to identify and leverage areas for improvement in Data Systems. Bringing a comprehensive understanding of Product Operation, Strategy, and Documentation needs

BUSINESS EXPERIENCE

Data Engineer Intern Prospect Direct Jun 20 – Dec 20

•Improved Business Analytics infrastructure 3-folds by modeling star-schema Data Warehouse hosted on AWS

•Assisted in Product Growth Planning by developing Peak Sales vs Inventory Exhaustion forecast to target 1.5x sales lift

•Reduced data refresh to 1 day by introducing ETL data mappings to stream new Woodpecker CRM data using Zapier

•Analyzed and maintained spreadsheets using formulas to perform analysis on product metrics

•Optimized Database using indexes, constraints, joins, and clean-up reducing query execution time by almost 60%

Product Analyst ECLINICALWORKS Jul 16 – Jun 19

•As an SME, trained 4 engineers & drove module features through release management lifecycle with 90% on-time launch

•Performed root cause analysis and provided permanent resolution on 500+ product issues with 98.5% on CSAT survey

•Bolstered PM decision making for prioritizing bug-fix using SQL & Jira to increase Module usability by 20%

•Collaborated with managers to define productivity KPI using Business Intelligence and Visualization tools, realizing a 12% increase in average product support performance over 2 quarters

•Predicted the likelihood of employee churn and recommended strategies that reduced churn by 15%

SELECT PROJECTS

Big Data Analytics Road Accident Risk Mitigation on Truck Fleet Data

•Loaded, transformed fleet data in Cloudera using Hadoop Pipeline tools to identify risky drivers and curtail expenses

•Made interactive visualizations by connecting to Tableau for presenting stories to decision-makers

SDLC Web Application for Marketing Agency

•Acted as an Agile Project Manager to present a business proposal to Prospect Direct for scaling their operations 5x

•Designed BPMN, UML, Functional, Class, and ER diagrams as a part of 7-weekly sprints with a team of 4 students

Predictive Modelling Mining Insights for Coffee Brand using Scanner Data

Skills used: SAS, SQL, Tableau, Panel Regression, ARIMA Forecast, K-means clustering, Chi-sq test, T-test, f-test

•Used Panel Regression in SAS to determine effects of price, promotion, flavor, and shelf display on customer sale

•Developed forecast experiments to identify trends using input from the above model to offer loyalty rewards

Applied Machine Learning Predicting Repair Maintenance on Tanzania Water Pump Data

Skills used: Jupyter Notebook, Python, One-hot encoding, Logistic Regression, Random Forest, XGBoost

•Built reusable Python pipeline to clean, describe, normalize, and split data into test-train in Jupyter Notebook

•Using SciKit-Learn coded Logistic Regression and Random Forest to predict pump failure for minimizing water shortage

Data Modelling MySQL powered DBMS for City of Frisco Traffic Data

•Authored logical diagram in MS Visio, and Implemented physical schema with 3NF

•Integrated batch CSV data into MySQL using ETL jobs in Informatica PowerCenter

TECHNICAL SKILLS

Certificates: AWS Cloud practitioner fundamentals, Data Engineering with GCP, Data Warehousing for BI

Scripting: Python, SQL, R, SAS, HTML, JSON, CSV, XML, Shell CLI

Hadoop Data Pipeline: Sqoop, Flume, Kafka, Hadoop, MapReduce, Impala, Hive, Pig, Spark (Databricks), NoSQL

Data Visualization: Tableau, Power BI, MS PowerPoint

Business Intelligence: MS Excel, MS Visio, Cloudera, Talend, Zapier, Informatica, SSIS, SSRS, Looker, IBM Cognos

Product Management: A/B Testing, User Research, Roadmap, Prioritization, KPIs, CSAT, Jira, Agile-Scrum, User Stories

EDUCATION

M.S. Information Management University of Texas at Dallas May 21

Graduate Coursework: Data Mining, Database Management, Statistics and Data Analysis, Predictive Analytics, Applied Machine Learning, Programming for Data Science, Systems Analysis, Big Data Analytics, Object-Oriented Programming

B.S. Computer Science Gujarat Technological University, India May 16



Contact this candidate