Post Job Free

Resume

Sign in

Data Analyst

Location:
Bentonville, AR
Posted:
January 24, 2021

Contact this candidate

Resume:

CONTACT

: adjnv6@r.postjobfree.com

: 201-***-****

In: www.linkedin.com/in/ashwini-bhoomi-

12b01182/

: Bentonville, Arkansas

SUMMARY:

Over 5 years of professional experience in Data Engineering and Data Analysis. Well versed in generating data driven solutions to improve efficiency of existing systems. Keenly interested in ETL and Business Intelligence to transform complex data into approachable insights. Robust knowledge in BigData, Hadoop EcoSystem, MapReduce, Hive, Spark and AWS. WORK EXPERIENCE:

Data Engineer

Cognizant Technologies Solutions Jan 2016 – Nov 2018

• Designed and implemented Data Integration and workflows on Big Data Technologies to implement business logic.

• Imported data into HDFS and Hive using Sqoop, created and loaded Hive tables, performed complex Hive queries.

• Successfully performed Hive tuning by using partitioning and bucketing.

• Performed complex joins, transformations, actions and optimized techniques in Spark on large datasets

• Developed Spark applications using Pyspark and SparkSQL

• Worked on Spark core, Data Frames and Paired RDD’s to develop PySpark applications

• Processed S3 data and created external tables on Hive and wrote scripts to ingest tables to be used across projects.

• Wrote shell scripts for Data Integration and to handle errors.

• Designed data models using Erwin for database development. Involved in data quality checks and auditing.

• Involved in developing data pipelines in Agile Scrum Methodology using JIRA and GIT Data Analyst

GlobalLogic Technologies Pvt Ltd., Hyderabad, India May 2014 – Jan 2016

• Developed SQL scripts for extensive Maps SQL database to ensure efficient data retrieval which improved the runtime by 10%.

• Designed and implemented ETL data process within SSIS package to implement business logic.

• Successfully created scalable data pipelines to ingest terabytes of data using SQL.

• Blended operations data from different data sources into Tableau Desktop to create complex reports, dashboards, story points for process improvement and reduced MTTR by 7%.

• Performed analysis on huge datasets including KPIs, data cleansing, validation, comparison, trend, predictive analysis to deliver data-driven insights that successfully were implemented to actions.

• Consulted with client product managers to record requirements and analyzed insights that lead to actionable and profitable growth.

• Optimized SQL scripts to efficiently handle partitioning and indexing, thereby reduced data load time by 5%.

• Provided support with the resolution of escalated tickets and acted as a liaison to business. Technical Analyst

IBM India Pvt Ltd., Hyderabad. India July 2012 – Jan 2014

• Administrated user support on Maximo Tool for Computer Maintenance Management System, IBM WebSphere to maintain and monitor client’s software to reduce labor cost.

• Responsible for planning defect prevention, root cause analysis and creating visual dashboards for the team that helped to achieve lean principles.

• Performed Maximo installations, configured, created instances in the database and modified custom Maximo applications.

• Experience in IBM’s remote desktop connections Software to provide privileges and access to clients, worked on data manipulation on active client accounts.

ACADEMIC PROJECTS:

Big Data Analytics:

• Ran MapReduce jobs in Hadoop to perform word, letter count and to find temperatures in NCDC Datasets.

• Optimized University database search by using Hive partitioning and reduced the search time by 10%.

• Converted Hive/SQL queries into Spark transformations using Spark RDDs and Scala. Implemented Spark using Scala to optimize the processing time and testing of data.

• Used Spark Streaming with Scala to track the most popular hashtags and average tweet length on twitter datasets. Titanic Survival Predictions:

• Predicted the survival rate of Titanic using the dataset available in Kaggle with 77% accuracy rate by applying KNN algorithm in Python and Decision trees in R.

• Used various Python Libraries – NumPy, Pandas, Matplotlib and Seaborn to preprocess, clean and visualize the data. Data Mining on Zoo Dataset:

• Worked on the Zoo dataset from UCI repository to predict animal “class type” based on the given variables that describe the animal.

• Built various classification models in R including Decision trees, Random Forest with less than 2% test error rate for best fit model.

Android Application Development:

• Developed an application in Android Studio in Java, XML, SQLite database called ‘The Right Vote’ which allows Students to participate in online voting for classroom elections. AWS:

• Conducted a presentation for a gathering of 150 people to discuss about technologies in cloud computing – Amazon SNS, SQS publish/subscribe models, how cloud outages can be handled and different open source support tools available for cloud computing.

EDUCATION:

Master’s in Computer Science 3.8 GPA

University of Central Missouri, MO May 2020

Bachelor’s in Computer Science 3.5 GPA

Jawaharlal Nehru Technological

University May 2012

DATABASES:

• MySQL, Oracle, MSSQL, Cassandra,

HBase

PROGRAMMING:

• SQL, Python, Scala, R, Shell scripting,

JavaScript, HTML, CSS & XML

ETL & REPORTING:

• SSIS, SSRS, Tableau

MOBILE DEVELOPMENT:

• Android Studio

BIG DATA TECHNOLOGIES:

• Hadoop Ecosystem, Hive, Spark, Pig,

Oozie, Airflow

STATISTICAL TECHNIQUES:

• Machine Learning algorithms

DEVELOPMENT TOOLS:

•Eclipse, RStudio, Jupyter, PLX, SQL

Developer and SSMS

CLOUD PLATFORM:

• AWS, Athena, Glue, Lambda,EMR,

Azure

CERTIFICATIONS:

• Python for Data Science and Machine

Learning (Udemy)

• Hive to Advanced Hive (Udemy)

ASHWINI BHOOMI



Contact this candidate