Vaishnavi
Email: ******.***@*****.*** Phone: +1-973-***-****
PROFESSIONAL SUMMARY:
Around 7 years of professional experience in field of Information Technology that includes analysis, design, development and testing of complex applications.
Working knowledge on all phases of Software Development Life Cycle (SDLC). Ability to track projects from inception to deployment.
Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, DataNode and MapReduce programming paradigm.
Experience in automating the Hadoop Installation, configuration and maintaining the cluster.
Experience in developing applications using Map Reduce, Pig and Hive, Sqoop, Flume, OOZIE, Spark and Scala.
Extending Hive and Pig core functionality by writing custom UDFs.
Experience in writing MapReduce Programs and using Apache Hadoop API for analyzing the logs.
Experience in designing both time driven and data driven automated workflows using Oozie.
Expert in importing and exporting data into HDFS and Hive using Sqoop.
Expertise in creating Hadoop Clusters using AWS like Amazon EMR, Amazon EC2 & Amazon S3 bucket.
Experience on Core java technology including multithreading, JDBC, RMI, and network programming
Expert in setting up SSH, SCP, SFTP connectivity between UNIX hosts.
Experience in Hadoop Shell commands, writing MapReduce Programs, verifying managing and reviewing Hadoop Log files.
Excellent Understanding of from Amazon Web Services (AWS) services like EC2, S3, EBS, RDS and VPC.
Experience in Hadoop cluster performance tuning by gathering and analyzing the existing infrastructure.
Experience in working with Flume to load the log data from multiple sources directly into HDFS.
Experience in design, development, Unit testing, integration, debugging and implementation and production support, understanding business application, business data flow and data relations from them.
Experience with Data Warehousing and Pentaho Data Integration Tool.
Experience in writing SQL and PL/SQL scripts & stored procedures for databases like Oracle 9i.
Ability to quickly ramp up and start producing results on given any tool or technology.
An individual with excellent communication skills and strong business acumen
Team player with creative problem solving skills, technical competency and leadership skills.
EDUCATION:
Masters in Information Systems
Bachelor of Technology (Electronics and Communication)
TECHNICAL SKILLS:
Hadoop Ecosystem
MapReduce, HDFS, Hive, Pig, Sqoop ZooKeeper, Oozie, Flume, HBase
Language
C, Java, J2EE, Python, UML
Framework
Spring, Hibernate, Struts, MVC
Web Technologies
JavaScript, JDBC, Unix/Linux Shell Scripting, HTML, XML
Methodologies
Agile, Scrum, UML, Design Patterns
Databases
Oracle9i, MS SQL server, MySQL, HBase
Application/Web server
Apache Tomcat, WebSphere, JBoss, AWS
IDE’s
Eclipse, NetBeans
ETL Tool
Pentaho
Version Controls
SVN, CVS, Visual SourceSafe(VSS)
Operating System
Windows 98/NT/2000/2003/XP/7, Linux
PROFESSIONAL EXPERIENCE:
The Doctors Company, Hoffman Estates, IL Feb 2016-Present
Role: Hadoop Developer
Involved in full development cycle of planning, Analysis, Design, Development, Testing and Implementation.
Launched and Setup of HADOOP/ HBASE Cluster which includes configuring different components of HADOOP and HBASE Cluster using on Linux.
Experienced in loading data from UNIX local file system to HDFS.
Developed MapReduce programs in Java for parsing the raw data and populating staging tables.
Created Hive queries to compare the raw data with EDW reference tables and performing aggregates.
Experienced on loading and transforming of large sets of structured, semi structured and unstructured data.
Installed and configured Flume, Hive, Pig, Sqoop and Oozie on the Hadoop cluster.
Managed and scheduled Jobs on a Hadoop cluster.
Migrated the existing RFISS system to Hadoop by extracting files from RDBMS through Sqoop and placed in HDFS and processed.
Performance tuning and troubleshooting of MapReduce jobs by analyzing and reviewing Hadoop log files.
Experienced in defining job flows.
Installed and configured Hive and also written Hive UDFs.
Involved in creating Hive tables, loading with data and writing hive queries.
Developed Hive queries for the analysts.
Cluster co-ordination services through ZooKeeper.
Collected the logs data from Web Servers and integrated in to HDFS using Flume.
Environment: Core Java 1.6, Hadoop 1.2.1, Linux, Hive, HBase, HDFS, Flume, Sqoop, MapReduce programming, Oozie.
HomeServe USA - Norwalk, CT Jan 2015 – Jan 2016
Role: Hadoop Developer
Responsible for coding MapReduce program, Hive queries, testing and debugging the Map Reduce programs.
Developed multiple MapReduce jobs in java for data cleaning and preprocessing.
Designed and developed Pentaho jobs and transformations to load data into dimensions and facts
Involved in loading data from UNIX file system to HDFS.
Installed and configured Hive and also written Hive UDFs.
Involved in creating Hive tables, loading with data and writing Hive queries which will run internally in MapReduce way.
Involved in identifying possible ways to improve the efficiency of the system
Gained very good business knowledge on health insurance, claim processing, fraud suspect identification, appeals process etc.
Environment: Hadoop, MapReduce, HDFS, Hive, Java (jdk1.6), Oozie, Pentaho.
Hitachi Solutions, IN Jan 2014 – Dec 2014
Role: SQL Developer
Involved in development of Software Development Life Cycle (SDLC) and UML diagrams like Use Case Diagrams, Class Diagrams and Sequence Diagrams to represent the detail in design phase
Created new tables, views, indexes and user defined functions.
Performed daily database backup & restoration and monitor the performance of Database Server.
Actively designed database to fasten certain daily jobs and stored procedures.
Optimized query performance by creating indexes.
Developed Stored Procedures, Views to be used to supply data for all reports. Complex formulas were used to show derived fields and to format data based on specific conditions.
Involved in Administration of SQL Server by creating users & login ids with appropriate roles & grant privileges to users and roles. Worked on authentication modules to provide controlled access to users on various modules
Created joins and sub-queries for complex queries involving multiple tables.
Developed stored procedures and triggers using PL/SQL in order to calculate and update tables to implement business logic.
Responsible for report generation using SQL Server Reporting Services (SSRS) and Crystal Reports based on business requirements.
Developed complex SQL queries to perform efficient data retrieval operations including stored procedures, triggers etc.
Designed and Implemented tables and indexes using SQL Server.
Environment: Eclipse, Java/J2EE, Oracle, HTML, PL/SQL, Oracle, XML, SQL.
Lloyds Banking Group, IN
Role: Programmer Analyst Jan 2012– Dec 2013
Developed SQL Scripts to perform different joins, sub queries, nested querying, Insert/Update and delete data in MS SQL database tables.
Experience on modeling principles, database design and programming, creating E-R diagrams and data relationships to design a database.
Experience in writing PL/SQL and in developing and implementing Stored Procedures, Packages and Triggers.
Responsible for designing advance SQL queries, Cursor and Triggers.
Build data connection to database using MS SQL server.
Worked on project to extract data from XML file to SQL table and generate file reporting using SQL server 2008.
Utilized Tomcat webserver for development purpose.
Involved in creation of test cases and performing unit testing.
Environment: PL/SQL, My SQL, SQL Server 2008(SSRS & SSIS), Visual studio 2000/2005, MS Excel
GM, India Jan 2008 –Dec 2011
Role: Java/J2EE Developer
Experience in requirements gathering, Analysis, Design and Testing phases.
As part of the Design phase, designed state, class, and sequence diagrams using Astah Professional.
Experience in working with Scrum Methodologies.
Coded Struts Action classes and Model classes.
Developed DAO classes using JDBC API and wrote SQL queries to interact with Oracle Database.
Handled all bug fixes and enhancements.
Hands on experience on JUnit framework and EasyMock.
Utilized Log4j for logging and Putty tool to check the server logs.
Used SoapUI tool to invoke the Web services.
Experience in working on Apache ANT as build tool and CVS as repository.
Used ANT as a build tool and developed Buildfile.
Environment: Java 1.5, J2EE, Struts 1.2, JavaScripts, JDBC, Log4j, SOAP, JUnit, WebSphere.