Sign in

Data Quality Engineer

Richardson, TX
January 15, 2020

Contact this candidate


Palash Jain 682-***-****

Self-motivated and process-oriented professional with in-depth knowledge of database type and big data capture, curation, Data Visuals, Data mining, paired with Python Programming. I worked in the domain of quality assurance of Big data analytics tool that sparked my interest in Data Analytics. Furnish insights, analytics and business intelligence used to advance opportunity identification, process reengineering and corporate growth.


Programming: Java, Python, My SQL, R

Technology: HDFS, Hive, Hadoop

Tools: Jenkins, JIRA, Tableau

Certification: Python for Data science (IBM), Python Programming (Datacamp)


The University of Texas at Dallas May 2021

M.S., Information Technology and Management (Dean’s Excellence Scholarship) 3.89

RGPV- State University, Bhopal, India May 2018

Bachelor of Engineering, Computer Science and Engineering


Impetus Technologies — Indore, India

Associate Quality Engineer, 2018 to 2019

Conducted data modeling, business intelligence gathering, trending and benchmarking also gave presentation to Pre-Sales and product management about the feature understanding while working for testcase automation of product

Generated complex quantitative datasets from Hive tables to perform benchmarking of data modelling and data aggregation process and formulated a consolidated report of several concurrent multidimensional queries response time and data accuracy.

Performed third party testing, after building complex star schemas, on tools like Tableau and MS Excel and executed SQL queries to perform benchmarking in concurrent environment

Certified the product on cloud services like AWS and Azure data lake for proper loading, transformation and aggregation of data


Price prediction analysis of Berlin Airbnb (Python) September 2019

Performed exploratory analysis on data to make decision on cleaning the data for it to be injectable in the prediction models

Analyzed the data using pandas and matplotlib that led to critical insights such as the busiest times of the year to visit Berlin

Trained different regression model (Linear, Logistic, KNN, Random Forest Regressor) in python, evaluated model performance and implement cross validation. After improvements model accuracy increased by 20%

Data analysis using statistics libraries, on how much do prices spike and trends in reviews of Airbnb visitors to Berlin and formed reports

E-commerce website August 2017 – May 2018

Designed a web application for e-commerce to perform business operation flow from product marketing to product delivery

Cataloged and categorized all the products in an interactive UI using jQuery and CSS for attracting prospective client

Programmed all the major e-commerce modules which enabled Administrator and Customer users to perform query on cart, customer, product and orders databases.


N.G.O. (Rang De Zindagi) in Teaching and Resource Development (RD) – Intern July 2017 - April 2018

Contact this candidate