Post Job Free

Resume

Sign in

Data Engineer

Location:
Nashville, TN
Posted:
March 28, 2020

Contact this candidate

Resume:

Iswarya Vardhini Vigneshwar

Data Enthusiast with 4+ years of experience.

ACADEMIC PROJECTS

Determining factors which affect Pneumonia immunization

**/**** - **/****

Developed a Linear Regression model for the Big data (BRFSS dataset) to determine the factors which influence Pneumonia immunization and its awareness for the past 5 years.

Data was stored in HDFS hosted in AWS EMR, developed MapReduce job using Apache Pig.

Data Cleanup and analysis were handled using pandas, NumPy and Scikit learn.

Developed dashboard using Tableau for visualization.

Prediction of Customer Churn in Telecommunication Company

05/2018 - 07/2018

Analyzed a telecommunication company to estimate the probability of customer churn in the next six months.

Developed a Logistic regression model using R to analyze 8000 current and old customer details.

WORK EXPERIENCE

Data Engineer

Wipro Technologies, India

PROJECTS:

Fujitsu Network Communication

01/2014 - 05/2017

Achieved fast performance by developing Apache Hive script to extract product data from HDFS to analyze the market trend.

Configured Hadoop clusters.

Performed data cleaning and data sorting to remove the irrelevant and missing values in the raw data set using pandas, numpy (EDA) in Python.

Responsible for designing and development of jobs to ingest data into clusters using Talend.

Identified trends, relationships, and metrics from raw data in the database (SQL Server).

Efficiently utilized Tableau to develop dashboards to explain and provide an in-depth analysis of the performance of the devices in the market.

Implemented PLSQL stored procedures to assign the values for shelves, racks, modify the results and perform functions.

Developed Web scraping script using Python-Beautiful soup to extract the product news.

Worked on a module to develop the logistic regression model to estimate the customer churn using Python.

Liaised with Onsite Team for knowledge sharing, documentation, deliverables.

SKILL SET

EDUCATION

Master of Science in Information Technology,

Middle Georgia State University, USA

01/2018 - 05/2019 4.0

Bachelor of Engineering, Electronics and

Communication Engineering,

Anna University, India

08/2008 - 05/2012

TOOLS

Scripting

Ipython Notebook

Database

SQL Server, MySQL, PostgreSQL, Oracle 11g

HDFS

ETL

Talend

Business Intelligence

Tableau, IBM SPSS

Project Management

GitHub, JIRA, Confluence, Jenkins

AWS

AWS Kinesis Data Stream, AWS Kinesis Fire Hose

AWS DynamoDB, AWS S3, AWS Redshift,

AWS Glue, AWS Athena,

AWS EMR, AWS EC2

ACHIEVEMENT

Awarded with Honors in Master of Science in Information Technology and qualified as a member in International

Society of Alpha Iota Mu.

Catholic Health Initiatives

12/2012 - 12/2013

Expertise in setting Automation Environment and Testing Web application using Selenium.

Implemented Keyword Driven (Robot) using Selenium with Python.

Independently developed Test Script for Modules in Agile Methodology.

Automated the testing of Healthcare Web application using Selenium and Python. Used Internal Framework for the execution.

Hands-on Experience in preparing Test plans, Test Cases.

Developed reusable functions and created driver scripts for batch execution.

Expertise in Microsoft Excel (pivot table, vlookup, etc.)

Reduced Regression Testing complexity by scheduling through Jenkins, effectively managed version control through GIT. Updated issues, bugs, and efforts in JIRA.

TRAINING AND CERTIFICATIONS

Google Analytics Individual Qualification

-Google Ads

AWS Certified Big Data Specialty 2020

-Udemy

AWS Cloud Practitioner Essentials

-AWS Training and Certification

Data Analytics Fundamentals

-AWS Training and Certification

Hadoop Starter Kit

-Udemy

adcicd@r.postjobfree.com +1-201-***-**** 135, Knolls Place, Nashville, TN 37211 linkedin.com/in/iswarya-vardhini-vigneshwar-8bb475135

Apache Hive

Apache Pig

Hadoop

SQL

Python

Data Analysis

Data Engineering

Big Data

AWS

Numpy

Pandas

Google Analytics

ETL

Agile Methodology



Contact this candidate