Post Job Free
Sign in

Data Project

Location:
Santa Clara, CA
Posted:
June 08, 2016

Contact this candidate

Resume:

Sowmya Pinnaka

Email : ***************@*****.***

Phone : +1-408-***-****

Experience Summary:

A professionally qualified HADOOP Developer and Sun certified Java Programmer (SCJP) & Sun Certified Web Component Developer (SCWCD) with sound academic credentials and having around 8 years of experience including 3 years in Big data ecosystem related technologies. My expertise includes J2SE, Web Technologies and implementation of HADOOP Technologies.

Key strengths:

Hands on experience in installing, configuring, and using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, HIVE, PIG, SQOOP, Flume, Kafka.

HBase, Oozie and other Hadoop related eco-systems as a Data storage and retrieval systems.

Having experience in writing Map Reduce, PIG and Hive UDF’s to solve the purpose of utility classes.

Involved in writing Hive queries to load and process data in Hadoop File System.

Well-experienced Mapper, Reducer, Combiner, Partitioner, Sort and Shuffling along with Custom Partitioning for efficient Bucketing.

Involved in creating Hive tables, loading with data and writing hive queries which will run internally in map reduce way.

Involved in importing data to HDFS using SQOOP.

Good experience in design the jobs and transformations and load the data sequentially & parallel for initial and incremental loads.

Experience in designing both time driven and data driven automated workflows using Oozie.

Experience in loading logs from multiple sources directly into HDFS using Flume.

Strong Knowledge in Object Oriented Programming Concepts.

Proficient in Java, Servlets, JSP, JDBC, HTML, XML and Oracle 10g/11g.

Experience in web server like Tomcat.

Strong knowledge on Struts, Spring, Hibernate, Web services.

Proficient in Building, Deploying, Debugging and Testing of various project phases.

Experience in JUnit Testing Java/J2ee applications. TFS, Anthill Pro, Visual Studio Source Control, HP Service Manager, Squirrel sql client and TeamTrack.

Certifications:

Sun Certified Programmer for the Java Platform (SCJP), Standard Edition 1.5.

Sun Certified Web Component Developer for Java Platform (SCWCD), Standard Edition 1.5

Technical Skills Summary:

Web Technologies : Servlets, JSP, JSTL, JNDI, JDBC

Programming Languages : Java, JavaScript, C and C++

Big Data Technologies : Hadoop MapReduce, HDFS,

HIVE, PIG, SQOOP, Flume, and Ooozi.

RDBMS : Oracle 10g, 11g.

Servers : Apache-Tomcat, TFS.

Mark-up Languages : XML and HTML, CSS

Operating Systems : Windows 2008/NT/XP, UNIX.

Tools : JUnit, log4j, AnthillPro and Ant.

IDE’s : Eclipse, IntelliJ.

Educational Profile:

Master of Computer Applications (MCA) from Jawaharlal Nehru Technological University, Hyderabad, Andhra Pradesh, India – June 2008.

Bachelor of Science (B.Sc.) from Nagarjuna University, Guntur, Andhra Pradesh, India – Apr 2005.

Rewards and Recognition:

Recognized for work on Cite Advisor critical issues during Aug-2012.

Recognized for the project deliverables during Nov-2013.

Recognized with Bronze award for GAP project.

Professional Experience:

Project :

Calix Inc – xCarrier integration with Hadoop (Mar 2015 – till date), San Jose, CA

xCarrier integrates the information from existing data sources, consolidates them into a master data file,

feeds the information back to the sources and thus allows consistent and accurate data to be used across

the enterprise. Simple out-of-the box connectors enable seamless data transfer between on premise applications.

Responsibilities :

Configuration Management of a multi node Hadoop cluster for SQOOP and Hive.

Integrated xCarrier to connect with datastore using Streaming API’s.

Configured Flume to transfer the data to HDFS.

Involved in loading data from Linux file system to HDFS and bulk Loaded the cleaned data into HBase.

Migrated data from RDBMS to HBase to perform real time analytics.

Developing Map Reduce jobs, Hive & PIG scripts.

Involved in writing MR Unit test cases and results.

Involved in scheduling Oozie workflow engine to run multiple Hive and pig.

Importing and exporting data into HDFS and Hive using Sqoop. Creating Hive external tables using shared meta-store.

Technologies

: CDH 4.x, Linux, Oracle11g.

Project :

Thomson Reuters – Bermuda (Bermuda Deployment on Hadoop)(Sep 2013 – Mar 2015), Eagan, MN

Bermuda is a shared application developed in java to identify and validate legal citations and entity references in ampex and XML. To achieve the parallelization of data processing enabled Bermuda over multinode cluster for Ampex content.

Responsibilities :

Evaluated suitability of Hadoop and its ecosystem to the above project and implemented various proof of concept (POC) applications to eventually adopt them to benefit from the Big Data Hadoop initiative.

Estimated Software & Hardware requirements for the Name Node and Data Node & planning the cluster.

Configuration Management of a multi node Hadoop cluster for SQOOP and Hive.

Involved in creating Hive tables, loading data and writing hive queries.

Involved in importing data from relational database to HDFS using SQOOP.

Migrated data from RDBMS to Hbase to perform real time analytics.

Developing MapReduce jobs, Hive & PIG scripts.

Involved in writing MRUnit test cases and results.

Developed bash scripts to bring the log files from FTP server and then processing it to load into Hive tables.

Involved in POC phrase to replace traditional message broker using Apache Kafka.

Loaded the aggregated data into datameer for reports generation.

Worked with application teams to install operating system, Hadoop updates, patches and version upgrades as required.

Monitored System health and logs and respond accordingly to any warning or failure conditions.

Technologies:

Project :

CDH 4.x, Linux, Core Java (Jdk6), Oracle11g.

Thomson Reuters – Bermuda (Product Development and Support) (May 2010 – Mar 2015), Eagan, MN

Westlaw is one of the primary online legal research services for lawyers and legal professionals in the United States and is a part of Westgroup. In addition, it provides proprietary database services. Information resources on Westlaw include more than 40,000 databases of caselaw, statutes and federal statutes, administrative codes, newspaper and magazine articles, public records, law journals, law reviews, treatises, legal forms and other information resources.

Bermuda is a Java application designed to identify and validate legal citations and entity references in Ampex, XML or plain text other formatted content for our external customers. Bermuda is a shared system that creates explicit links to the legal documents and provides relationship information to data warehouse. It is used by Thomson Reuter’s products such as Westlaw, WestlawNext, WestKm, CiteAdvisor, BriefTools, WestCheck etc.

Responsibilities :

Performed project management routines.

Involved in daily technical status meetings and business requirement discussions.

Customization of existing tools according to requirement.

Working on codes and cases cites identification, bug fixes and enhancements of codes related engines. It includes enhancements in cases and codes.

Working on trackers to replace legacy systems (Mainframe).

Enhancement of Bermuda Test Client Tool.

Involved in Bermuda operations like Support, Builds & Release and Regression Testing. 24*7 production Support for critical production issues. Analyzed Java Core, Heap Dump.

Enhancement of Bermuda for Practical law contents and trademarks support.

Involved in queues enhancements for Bermuda job submission.

I have involved in development process and production release cycle.

Since all requirements come from US then I have to coordinate with stakeholders. Coordination between the team was the most challenging and critical task which I have efficiently managed.

GAP: provide solution and support to end users, enhancement in the product. I was module lead for this project for almost 2 yrs.

IPA: created new engines for Patents & Trademarks support.

WLN: DAO layer development.

Bermuda Monitor enhancements (tool to track the jobs submitted to Bermuda).

Technologies

:

Core Java (Jdk6), JUnit, Struts 1.2, XML, Tomcat 6, Oracle11g.

Project :

Thomson Reuters –CiteAdvisor-Formatter (Dec 2010 – Oct 2013), Eagan, MN

The CiteAdvisor identifies the cites in customer’s documents and suggest cite formats as required by the Bluebook, the ALWD Cite Manual and some local state jurisdictions.

The Formatter is a component developed for the CiteAdvisor project. The goal of the Formatter is to provide the formatted string according to a rule set defined by the user (Bluebook, ALWD, California etc…) for each citation found by Bermuda.

Responsibilities:

Communicated with customers about requirements and priority of tasks.

Involved in Cite Advisor operations like Support, Builds and Release.

Worked with the customers by understanding their business needs and provided an effective solution within the time limit.

For any issue, I have analyzed, discussed within the team for the impact via calls, e-mails as well as pre-testing with temporary code.

Collaboratively made the decisions with the help of risk leads for feasible solution among various applications to meet the needs of the client requirements.

Core Java (Jdk 6), Spring, JUnit, XML, Oracle 11g.

Technologies

:

Core Java (Jdk 6), Spring, JUnit, XML, Oracle 11g.

Project :

Thomson Reuters–Legal Professional Authority(Oct2010–Nov2010),Eagan, MN.

Legal Professional Authority is an application which is used to create, update and delete Legal Professional or Legal Organizations, namely, Attorney, Judge, Court, Law Firm, Patent Examiner, Patent Judge, Trademark Judge, Trademark Examiner, Arbitrator as well as Experts.

Responsibilities :

Enhancement of LPA Authority GUI page.

Communicated with in the team for the requirements.

Struts 1.2, JSP, Servlets.

Technologies

:

Struts 1.2, JSP, Servlets.

Project :

Infosage Systems India Private Limited – Market Detection & Response Management (Nov 2009 – May 2010),India.

The Market Detection & Response Management application is mainly focused on precise and constant monitoring of selling prices implemented by market competitors for the products at stores.

Application provides the functionality for the determination of groups / subgroups of items to be collected for the event of change in selling price, Updating the selling prices of multiple competitors at a time and preparing comparative reports for the selling prices with external & internal competitors.

Responsibilities :

Involved in Support, Builds & Release.

Analyzed and fixed bugs in the application as raised by the testing team.

Involved in writing Unit test cases and results.

Core Java(Jdk 1.5), Struts, JSP, Servlets, HTML, XML, JUnit,Oracle 10g.

Technologies

:

Core Java(Jdk 1.5), Struts, JSP, Servlets, HTML, XML, JUnit,Oracle 10g.

Project :

Infosage Systems India Private Limited India – Promotions Authorizing System (Aug 2008 – Oct 2009),India.

Promotions Authoring System centrally manage the definition of all promotions, special redemptions, Loyalty events and product updates that will be applied at point of sales by the promotions engine. A new function at point of sales is the ability to print coupons on customer receipt. The detailed definition of these coupons will also be managed by message libraries. Promotion Load has communication, converter, persistence, common layers for receiving, parsing, persist them into database. The primary function of PAS is to create and maintain promotion events that will be issued/exported to all stores that are associated with store group.

Responsibilities :

Involved in daily status meetings and business requirement discussions.

Involved in development of CPA User Interface, converter workbench and enhancements.

Involved in writing Unit test cases and results.

CoreJava 1.5,Struts, JSP,HTML,XML Servlets, JUnit, Oracle 10g.

Technologies

:

Core Java(Jdk1.5), Struts, JSP, Servlets,HTML,XML,JUnit,Oracle 10g.

Workshops/Trainings Attended:

Hadoop Training – Internal training in Thomson Reuters during Aug-2013

Hibernate Training - Internal training in Thomson Reuters during April-2013

Spring Training – Internal training in Thomson Reuters during Jan-2012

Legal domain Training in Thomson Reuters.



Contact this candidate