.
.
VIRASHREE PATEL
Austin, Texas C: (***) *** - 5150 E: **************@*****.*** https://patelvirashree.wixsite.com/virahome Summary
Self-motivated individual with superior skills in working in both team-based and independent capacities. Bringing strong work ethic and excellent organizational and leadership skills to any setting. Excited to begin new challenge in data engineering and data science fields with a successful team. Skills
• Python, Scala, Bash, SQL
• Spark, Hive, Sqoop, Teradata, Solr, Oozie, Airflow
• Pandas, Natural Language Toolkit (NLTK), Gensim, NumPy
• R Studio, R Shiny
• MySQL, PostgreSQL, MS SQL Server
• PHP, CSS, HTML, JavaScript
• AWS, Snowflake
Experience
DATA ENGINEER 04/2020 to Current
Waystar (eSolutions) – Kansas City, KS
• Worked alongside Data Analysts and Business teams to build aggregates following the business requirements using Snowflake and Spark
• Took initiatives to improve communication and collaboration with offshore consultant team as well as established good coding practices to improve code quality and implement unit testing for the inhouse Spark Scala based ingestion framework application
• Improved the code review process and performed code reviews for offshore developers’ team on regular basis
• Assisted with ingesting data from legacy system to AWS S3 storage BIG DATA ENGINEERING CONSULTANT 03/2019 to 04/2020 Express Scripts - Kansas City, KS
• Delivered customer insight data to perform customer churn analysis by restoring a Java MapReduce application to use Spark capabilities using Scala API. Performed reverse-engineering to understand the application's functionality as well as improved the overall application performance by 33%.
• Rebuilt a Spark Scala application and gained performance improvement from 8 hours to 2 mins and 27 sec by researching and implementing Spark's inbuilt windowing functions capabilities.
• Achieved reusability of a Spark Scala application code by adding configuration enhancements
• Repaired a Spark Python application to be able to operate on the latest Hortonworks Data Platform environment to make sure to the timely delivery of data to the business users.
• Accomplished data migration from relational databases into Hadoop data lake by utilizing a configurable in-house Scala-based framework
• Leveraged the use of bash scripting to automate repetitive manual tasks to improve productivity as well as prevent errors.
.
.
BIG DATA INTERN 06/2018 to 08/2018
H&R Block - Kansas City, KS
• Improved productivity and ease-of-use of the Hadoop cluster, for the H&R Block data analytics team, by creating an interactive dynamic search dashboard using Hue with Apache Solr backend GRADUATE RESEARCH/GRADUATE TEACHING ASSISTANT 09/2017 to 05/2018 Kansas State University - Manhattan, KS
• Designed a Topic Model using Python's Gensim library to determine highly affected locations information from the tweets collected during a natural disaster. Detailed documentation regarding the project can be found at https://krex.k-state.edu/dspace/handle/2097/39337
• Mentored a class of 106 students for an undergraduate data management class through one-on-one meetings as well as grading assignments
SOFTWARE DEVELOPER 11/2016 to 09/2017
Network Computer Solutions - Manhattan, KS
• Maintained, developed and improved company's web-based applications with the use of PHP, JavaScript, HTML, CSS and MySQL
Education and Training
Kansas State University - Manhattan, KS Master of Science Computer Science, 2018
Kansas State University - Manhattan, KS Bachelor of Science Electrical Engineering, 2015