RHEA SHARMA
Arlington, TX ***** 682-***-**** ************@*****.** https://www.linkedin.com/in/rheasharma22/ Summary
Looking for full-time opportunities as Data Engineer/Software Engineer/Data Scientist/Data Analyst/Full Stack Developer. Experience
Database Engineer Intern - Acxiom Corp. Jun 2019 - Aug 2019 (Austin, USA)
● Wrangled test data & further executed queries to simulate actual data cluster. Scripted in shell to generate cluster reports displaying user usage statistics.
● Monitored & prioritized alerts & incidents on Cloudera Manager. Fixed bugs for alerts displayed on the same.
● Executed SQL queries using Hive, Pig & Impala creating Oozie workflows. Ran Spark & Yarn jobs like word count to understand basic functionality. Improvised monitoring dashboard for Hadoop team. Data Science Intern - Sakha Global May 2017 - Jul 2017 (Bangalore, India)
● Designed & developed a "Resume Parser" in Python and MongoDB, which extracts important skills of candidates from their resumes supporting two file formats (PDF/Word).
● Created pipelines & implemented different Machine Learning algorithms (Decision Trees, K-Nearest Neighbours - KNN, LinearSVC), to improve the accuracy of the model.
● Used Natural Language Processing - NLP techniques like tokenization, along with Regular Expressions for extracting data.
Education
Master of Science: Computer Science - The University of Texas At Arlington GPA: 3.7/4.0 - May 2020 Coursework: Machine Learning, Neural Networks, Advance Networking, Big Data, Statistics, Software Engineering, Design
& Analysis of Algorithms, Data Mining, Database Systems, Software Testing, Web Data Management, Data Warehousing and Business Intelligence.
Bachelor of Engineering: Computer Science - PES Institute of Technology GPA: 3.5/4.0 – May 2018 Coursework: Data Analytics, Machine Learning, Statistics, Data Structures, Web Development, Software Engineering. Skills
● Languages: Python, Java, SQL.
● Web Development: HTML, CSS, JavaScript,
PHP, Bootstrap, AJAX.
● Frameworks: Laravel, Flask, Django.
● Databases: MySQL, PostgreSQL, OracleDB, MongoDB, Hadoop.
● Data Visualization: SAP BI, SAP BusinessObjects, Tableau.
● Technologies: Unix/Linux, Windows, Mac, Jupyter Notebook, Atom, PyCharm, Git, Docker, Netbeans IDE, Eclipse, MS Office, MS Excel, MS Sharepoint.
Academic Projects
● Data Warehousing & Business Intelligence (Spring 2020): Visualization of data using SAP BI, SAP BusinessObjects.
● Neural Networks (Fall 2019): Implementing perceptron, multilayer NN, CNN using Tensorflow in Python
● Machine Learning in Python (Fall 2019): Classification of data using Decision Tree, Naïve Bayes and K-Means clustering algorithms. Linear Regression implementation using K-Folds Cross validation.
● Big Data (Spring 2019): Developed & implemented simple graph processing & matrix multiplication algorithms
(Hadoop Map-Reduce, Spark, Pig and Hive) in Java, Pig Latin Script and HiveQL on Cloud Computing platform: SDSC Comet.
● Car Parking System (Spring 2019): Designed & developed an android app for a Car Parking System using the underlying models of Software Engineering - Agile.
● Hospital Database Management System (Fall 2018): Backend in Java. GUI made use of Java JPanel & Swing classes through Netbeans IDE on OracleDB.
● Movie Booking and Review System (Spring 2016): Designed a movie booking & review website using HTML, CSS, PHP, Bootstrap, JavaScript, & AJAX techniques.