MUMBAI
CURRICULUM VITAE
Khushboo Shivangi
Mobile: +91-750*******
Mailto: ********.********@*****.***
Dob: 27th Nov 1991
OBJECTIVE
Looking forward to pursue a dynamic and challenging career in IT industry, demanding critical thinking and innovation, with an organization of repute, a career that adds value to the organization and at the same time offers opportunity to enhance my technical skills
Experience Summary
Around 4+ years of rich experience of working in Vistaar Systems Pvt Ltd, Mumbai as a Software Engineer.
Good knowledge of Hadoop Architecture and various components such as PySpark, HDFS, Hive, Sqoop, MapReduce, Linux, python.
Experienced in python spark data transformations using groupByKey, reduceByKey, aggregateByKey and combineByKey
Experience in Spark Core, Spark SQL.
Developed and designed a 10-node Hadoop cluster for data analysis.
It involves end to end ETL (Extract, Transfer & Load) process.
Load and transform large sets of structured and semi structured data.
Experience in analyzing data using Hive QL.
Regularly tune performance of Hive queries to improve data processing and retrieving.
Implemented Partitioning, Dynamic Partitions, Buckets, indexing, replication in Hive
Experience in importing and exporting data using Sqoop from RDBMS to HDFS and vice-versa.
Collecting and Aggregating large amount of Log data using Apache Flume and storing in HDFS for further analysis.
Used python spark API to perform analytics on data.
Implemented Data Integrity and Data Quality checks in Hadoop using Hive and Linux scripts
Experience in designing and developing high performance utilities system.
Worked in complete Software Development Life Cycle (analysis, design, development, testing, implementation and support) using Agile Methodologies.
Hands on experience in creating various database objects like tables, stored procedures, functions using SQL, PL/SQL.
Good technical, communication, analytical and problem solving skills ability to get on well with people including cross- cultural backgrounds and trouble – shooting capabilities.
Developing and testing data Ingestion/Preparation/Dispatch jobs using spark and hive. Technical Skills
Hadoop EcoSystem HDFS, Hive, Sqoop, Spark, SQL, PL/SQL, Linux MUMBAI
Scripting Languages Shell Sripting, Java Scriprting, Python, pyspark Programming Languages Java, C Programming
Operating Systems Microsoft Windows, Linux
Database Oracle 11g
Tools Eclipse
Projects:
Project 1 Gallo Pricing
Client name E. & J. Gallo Winery
Period 1st Jul 2016 to 1st Jan 2019
Description The project was about price management solution for Gallo. Empowers business and product planners to establish product portfolios, market segment strategies, manage list prices, analyze market intelligence and forecast revenues.
Role/ Responsibilities 1) PL/SQL and JavaScript Developer
• Worked on Model related changes includes Views, MViews, Procedures.
• Performance tuning of Views and MViews using managing joins, Gather stats, Rebuilding indexes etc.
Onboarding projects – ETL Job using SQL code.
• Developed JavaScript code for enhancements:
Managing product group
CQD – Cumulative Quantity Discount
Dashboard Implementation
Solution Environment SQL,PL/SQL, Linux, JavaScript Project 2 Reimbursement Template
Client name Diageo, Brown Forman, Sazerac, Pernod, Treasury Wine Estates, SMWE Period 1st Feb 2019 to Till date
Description The project was about reimbursement generation for various clients like Diageo, Brown Forman, Sazerac,Pernod,TWE,SMWE. Includes matching with respect to plan deal, depletion & actual sell. MUMBAI
Role/ Responsibilities 1) Hadoop Developer
• Involved in importing inbound data (Ex- Metadata, Inbound Measure data) from RDBMS to HDFS using sqoop.
• Designed data warehouse using Hive, created and managed Hive tables.
• Hive tables with partitioning and bucketing concepts
• Regularly tune performance of Hive queries to improve data processing and retrieving
• Developed spark code using python for faster testing and processing of data.
• Live data streaming for pricing markets using spark MLLib
• Created Linux shell Scripts to automate the daily ingestion.
• Managing and reviewing log files
• Run Hadoop streaming jobs to process terabytes of XML data
• Designed data ingestion and transformation pipeline.
• Implemented Data Integrity and Data Quality checks in Hadoop using Hive and Linux scripts
• Used Spark SQL to process the JSON data and insert into the tables for processing
2) SQL and PL/SQL Developer
• Data integration with inbound and outbound data
3) Automation:
• Weekly user stats report generation
• Data validation report
4) Application Support
Solution Environment HDFS, Hive, Sqoop, Spark, SQL,PL/SQL, Linux Education Summary
DEGREE BOARD/UNIVERSITY YEAR OF
PASSING
PERCENTAGE
MCA TIMSCDR, Mumbai University 2016 7.54(CGPA)
BCA Patna University
2013 81.83%
H.S.C CBSE 2009 73.2%
S.S.C CBSE 2007 73.8%
Experience Details
Vistaar System
Experience
4 Year(s), 1 Month(s)
Prev. Experience 0 Year(s),0 Month(s)
Total Experience 4 Year(s), 1 Month(s)
MUMBAI
Declaration
I do hereby declare that the above information furnished by me is true to the best of my knowledge and belief.
- KHUSHBOO SHIVANGI