Post Job Free

Resume

Sign in

Engineer Intern Data

Location:
Rolla, MO
Posted:
December 15, 2022

Contact this candidate

Resume:

Vaishnavi Tumu

Rolla, MO ***** (flexible to relocate)

636-***-**** adt2u3@r.postjobfree.com LinkedIn: http://linkedin.com/in/vaishnavi-tumu Education:

Missouri University of Science and Technology Rolla, MO Master of Science in Computer Science Expected 12/2022 Relevant Coursework: Big Data and Cloud Management, Cyber Security and data science, Data Modeling and Visualization, Data Mining, Data Structures & Algorithms Osmania University Hyderabad, India

Bachelor of Engineering in Computer Science 07/2020 Experience:

SmartBridge Hyderabad, India

Data Engineer Intern 06/2019 - 08/2019

• Developed a system to generate product likeability reports to enable product inventory prioritization.

• Deployed the Apache Hadoop clusters for processing the data using HDFS as the underlying distributed filesystem.

• Developed MapReduce programs for pre-processing and cleansing the data in HDFS obtained from various data sources to make it suitable for ingestion into hive schema for analysis.

• Used Pig to perform data transformations, event joins, filter, and some pre-aggregations before storing the data onto HDFS.

• Created Hive external tables with partitioning to store the processed data from MapReduce. Projects:

Average Views for a keyword in StackOverflow– MapReduce, Hive, Pig

• Developed a system to generate average views for a question in StackOverflow of a given keyword

• Deployed the Hadoop cluster in a pseudo-distributed mode and used HDFS as the underlying file system.

• Performed Map-reduce jobs on the dataset in HDFS to extract questions containing the given keyword.

• Created Hive schema to enable easy querying with various keywords. Smart Parking System – IOT with Data Science

• The project deals with availability of parking spaces and parking a car in a vacant parking spot. This is accomplished by installing an IR sensor in a parking lot.

• Arduino IDE is used to detect any free slots available in the parking.

• The status of the parking is sent to mobile application called Blynk app which is controlled and checked by the admin to monitor the parking area.

Deploying Web Service on AWS – Python, AWS, IAM, EC2, RDS

• Developed a simple HTTP webserver in python to automate grade calculation.

• Deployed the server on Amazon EC2 instance and managed access control using the IAM Security groups to enable SSH and HTTP access to the instance.

• Used RDS as the underlying data store by creating the schema and table definitions.

• Created necessary IAM roles to grant EC2 instance access to the RDS instance.

• Gained proper understanding of concepts like VPC, access control, Security Groups.

• Used amazon CLI to access the AWS resources.

Skills:

Programming Languages: Python, C, C++, Java, JavaScript Datastores: Apache HBase, HDFS, Hive, Pig, RDS, Spark, MySQL Tools: Apache Spark, Apache Hadoop, MapReduce, GIT, Weka Libraries: NumPy, pandas

Others: AWS(EC2, IAM, S3, RDS, Lambda)



Contact this candidate