Post Job Free
Sign in

Data Analyst Engineer

Location:
Richardson, TX
Salary:
80000
Posted:
November 07, 2020

Contact this candidate

Resume:

Raunak Pandey

+1-815-***-****; ****************@*****.*** www.linkedin.com/in/praunak

EDUCATION

The University of Texas at Dallas May 2020

M.S., Business Analytics

M.S Ramaiah Institute of Technology. - Bangalore, INDIA June 2016

B.E., Instrumentation Technology

TECHNICAL SKILLS

Languages and Databases: C, C++, Java, R, Python, SQL, SCALA, Oracle, MYSQL, MSSQL, and MongoDB

Hadoop Ecosystem: HDFS, YARN, MapReduce, Apache Spark, SQOOP, HIVE, HBase, and Kafka.

Tools: Tableau, SAS, Power BI, Databricks, Hortonworks, and Advanced Excel.

Amazon Web Services: S3, EC2, EMR, RDS, Redshift, CloudWatch, and Lambda.

LICENSES & CERTIFICATIONS: Microsoft Certified: Data Analyst Associate, Microsoft Certified: Azure Data Fundamentals, Microsoft Certified: Azure Fundamentals.

BUSINESS EXPERIENCE

EPSILON, Bangalore, INDIA Aug 2016-Dec 2018

Data Engineer (Databricks and Informatica)

Developed data pipelines using Kafka, Sqoop, Hive, and Java Map-reduce to ingest customer behavioral data (real-time and batch data) and financial histories into HDFS for analysis.

Documented ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling, and analysis.

Developed business logic using SCALA and managed data coming from different sources like CSV and JSON.

Developed scripts using Python and automated data management from end to end and sync up between all the

clusters.

Communicated and presented insights clearly and compellingly to the senior leadership of the organization REDWOOD ALGORITHMS – Bangalore, INDIA Jan 2016 – July 2016 Data Engineer (Hortonworks and Tableau)

Analyzed business-specific requirements specifications by interacting with clients and understanding business

requirement specification documents.

Worked on Spark RDD transformations to map business analysis and apply actions on top of transformations.

Developed SQOOP scripts for importing and exporting data into HDFS and Hive.

Designed 167 SQL QA queries from source to target tables based on transformation rules and lookup tables comparing business and production data with an efficiency of 99%.

Developed design documents considering all possible approaches and identifying the best of them.

Imported results into visualization Business Intelligence tool Tableau to create dashboards.

Academic Projects

Customer segmentation Visualization – Marketing Analytics (Tableau) Jan 2020

•Created an Interactive Tableau Dashboard and story with specific insights for strategizing marketing campaigns.

•Used Map by Region, Histograms for customer distribution by Age and Balance, TreeMap for job classification.

Appliance Energy Prediction – Linear Regression, Gradient descent, Logistic Regression Nov- 2019 Implemented both regression and gradient descent algorithm from scratch and optimized cost function for different learning rates.

ADDITIONAL INFORMATION

Eligibility: Eligible to work in the U.S. with no restrictions for 36 months (STEM only) without sponsorship



Contact this candidate