Data Developer

Location:

Mumbai, Maharashtra, India

Posted:

September 24, 2020

Contact this candidate

Resume:

MUMBAI

CURRICULUM VITAE

Khushboo Shivangi

Mobile: +91-750*******

Mailto: ********.********@*****.***

Dob: 27th Nov 1991

OBJECTIVE

Looking forward to pursue a dynamic and challenging career in IT industry, demanding critical thinking and innovation, with an organization of repute, a career that adds value to the organization and at the same time offers opportunity to enhance my technical skills

Experience Summary

Around 4+ years of rich experience of working in Vistaar Systems Pvt Ltd, Mumbai as a Software Engineer.

Good knowledge of Hadoop Architecture and various components such as PySpark, HDFS, Hive, Sqoop, MapReduce, Linux, python.

Experienced in python spark data transformations using groupByKey, reduceByKey, aggregateByKey and combineByKey

Experience in Spark Core, Spark SQL.

Developed and designed a 10-node Hadoop cluster for data analysis.

It involves end to end ETL (Extract, Transfer & Load) process.

Load and transform large sets of structured and semi structured data.

Experience in analyzing data using Hive QL.

Regularly tune performance of Hive queries to improve data processing and retrieving.

Implemented Partitioning, Dynamic Partitions, Buckets, indexing, replication in Hive

Experience in importing and exporting data using Sqoop from RDBMS to HDFS and vice-versa.

Collecting and Aggregating large amount of Log data using Apache Flume and storing in HDFS for further analysis.

Used python spark API to perform analytics on data.

Implemented Data Integrity and Data Quality checks in Hadoop using Hive and Linux scripts

Experience in designing and developing high performance utilities system.

Worked in complete Software Development Life Cycle (analysis, design, development, testing, implementation and support) using Agile Methodologies.

Hands on experience in creating various database objects like tables, stored procedures, functions using SQL, PL/SQL.

Good technical, communication, analytical and problem solving skills ability to get on well with people including cross- cultural backgrounds and trouble – shooting capabilities.

Developing and testing data Ingestion/Preparation/Dispatch jobs using spark and hive. Technical Skills

Hadoop EcoSystem HDFS, Hive, Sqoop, Spark, SQL, PL/SQL, Linux MUMBAI

Scripting Languages Shell Sripting, Java Scriprting, Python, pyspark Programming Languages Java, C Programming

Operating Systems Microsoft Windows, Linux

Database Oracle 11g

Tools Eclipse

Projects:

Project 1 Gallo Pricing

Client name E. & J. Gallo Winery

Period 1st Jul 2016 to 1st Jan 2019

Description The project was about price management solution for Gallo. Empowers business and product planners to establish product portfolios, market segment strategies, manage list prices, analyze market intelligence and forecast revenues.

Role/ Responsibilities 1) PL/SQL and JavaScript Developer

• Worked on Model related changes includes Views, MViews, Procedures.

• Performance tuning of Views and MViews using managing joins, Gather stats, Rebuilding indexes etc.

Onboarding projects – ETL Job using SQL code.

• Developed JavaScript code for enhancements:

Managing product group

CQD – Cumulative Quantity Discount

Dashboard Implementation

Solution Environment SQL,PL/SQL, Linux, JavaScript Project 2 Reimbursement Template

Client name Diageo, Brown Forman, Sazerac, Pernod, Treasury Wine Estates, SMWE Period 1st Feb 2019 to Till date

Description The project was about reimbursement generation for various clients like Diageo, Brown Forman, Sazerac,Pernod,TWE,SMWE. Includes matching with respect to plan deal, depletion & actual sell. MUMBAI

Role/ Responsibilities 1) Hadoop Developer

• Involved in importing inbound data (Ex- Metadata, Inbound Measure data) from RDBMS to HDFS using sqoop.

• Designed data warehouse using Hive, created and managed Hive tables.

• Hive tables with partitioning and bucketing concepts

• Regularly tune performance of Hive queries to improve data processing and retrieving

• Developed spark code using python for faster testing and processing of data.

• Live data streaming for pricing markets using spark MLLib

• Created Linux shell Scripts to automate the daily ingestion.

• Managing and reviewing log files

• Run Hadoop streaming jobs to process terabytes of XML data

• Designed data ingestion and transformation pipeline.

• Implemented Data Integrity and Data Quality checks in Hadoop using Hive and Linux scripts

• Used Spark SQL to process the JSON data and insert into the tables for processing

2) SQL and PL/SQL Developer

• Data integration with inbound and outbound data

3) Automation:

• Weekly user stats report generation

• Data validation report

4) Application Support

Solution Environment HDFS, Hive, Sqoop, Spark, SQL,PL/SQL, Linux Education Summary

DEGREE BOARD/UNIVERSITY YEAR OF

PASSING

PERCENTAGE

MCA TIMSCDR, Mumbai University 2016 7.54(CGPA)

BCA Patna University

2013 81.83%

H.S.C CBSE 2009 73.2%

S.S.C CBSE 2007 73.8%

Experience Details

Vistaar System

Experience

4 Year(s), 1 Month(s)

Prev. Experience 0 Year(s),0 Month(s)

Total Experience 4 Year(s), 1 Month(s)

MUMBAI

Declaration

I do hereby declare that the above information furnished by me is true to the best of my knowledge and belief.

- KHUSHBOO SHIVANGI

Contact this candidate