Raunak Pandey
+1-815-***-****; ****************@*****.*** www.linkedin.com/in/praunak
EDUCATION
The University of Texas at Dallas May 2020
M.S., Business Analytics
M.S Ramaiah Institute of Technology. - Bangalore, INDIA June 2016
B.E., Instrumentation Technology
TECHNICAL SKILLS
Languages and Databases: C, C++, Java, R, Python, SQL, SCALA, Oracle, MYSQL, MSSQL, and MongoDB
Hadoop Ecosystem: HDFS, YARN, MapReduce, Apache Spark, SQOOP, HIVE, HBase, and Kafka.
Tools: Tableau, SAS, Power BI, Databricks, Hortonworks, and Advanced Excel.
Amazon Web Services: S3, EC2, EMR, RDS, Redshift, CloudWatch, and Lambda.
LICENSES & CERTIFICATIONS: Microsoft Certified: Data Analyst Associate, Microsoft Certified: Azure Data Fundamentals, Microsoft Certified: Azure Fundamentals.
BUSINESS EXPERIENCE
EPSILON, Bangalore, INDIA Aug 2016-Dec 2018
Data Engineer (Databricks and Informatica)
Developed data pipelines using Kafka, Sqoop, Hive, and Java Map-reduce to ingest customer behavioral data (real-time and batch data) and financial histories into HDFS for analysis.
Documented ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling, and analysis.
Developed business logic using SCALA and managed data coming from different sources like CSV and JSON.
Developed scripts using Python and automated data management from end to end and sync up between all the
clusters.
Communicated and presented insights clearly and compellingly to the senior leadership of the organization REDWOOD ALGORITHMS – Bangalore, INDIA Jan 2016 – July 2016 Data Engineer (Hortonworks and Tableau)
Analyzed business-specific requirements specifications by interacting with clients and understanding business
requirement specification documents.
Worked on Spark RDD transformations to map business analysis and apply actions on top of transformations.
Developed SQOOP scripts for importing and exporting data into HDFS and Hive.
Designed 167 SQL QA queries from source to target tables based on transformation rules and lookup tables comparing business and production data with an efficiency of 99%.
Developed design documents considering all possible approaches and identifying the best of them.
Imported results into visualization Business Intelligence tool Tableau to create dashboards.
Academic Projects
Customer segmentation Visualization – Marketing Analytics (Tableau) Jan 2020
•Created an Interactive Tableau Dashboard and story with specific insights for strategizing marketing campaigns.
•Used Map by Region, Histograms for customer distribution by Age and Balance, TreeMap for job classification.
Appliance Energy Prediction – Linear Regression, Gradient descent, Logistic Regression Nov- 2019 Implemented both regression and gradient descent algorithm from scratch and optimized cost function for different learning rates.
ADDITIONAL INFORMATION
Eligibility: Eligible to work in the U.S. with no restrictions for 36 months (STEM only) without sponsorship