Project Data

Location:

Hyderabad, AP, India

Posted:

January 27, 2014

Contact this candidate

Resume:

JAYANAGA PRASAD B

+91-903******* Email:

*******.******@*****.***

Professional Summary:

. 3.3 years of IT experience in Application Development in Java and

Hadoop.

. 1.7 years experience in Hadoop and its components like HDFS, Map

Reduce, Apache Pig, Hive, Sqoop, and HBase

. Extensive Experience in Setting Hadoop Cluster

. Good knowledge with Map Reduce Programming

. Involved in writing the Pig scripts to reduce the job execution time

. Have executed projects using Java technologies such as Core Java,

Servlets, Jsp, JavaScript

. Very well experienced in designing and developing both server side and

client side applications.

. Experience in using development tool like Eclipse IDE 3.x and Net

Beans 6.5

. Excellent communication, interpersonal, analytical skills, and strong

ability to perform as part of team.

. Exceptional ability to learn new concepts.

. Hard working and enthusiastic.

. Knowledge on Apache OOZIE and FLUME

Professional Experience:

. Working with Virtusa India Pvt Ltd, Hyderabad as a Hadoop Developer

since JULY 2010 -to till date.

Qualifications:

. Bachelor Of Computer Science from Andhra University, Andhra Pradesh,

INDIA, since: 2006-2009.

Skills:

. Languages: HTML, XML, SQL, Java, No Sql

. Operating Systems: Linux, Cent OS, Windows 95/98/NT/2000/XP

. Databases & Tools: Oracle 9i/8i, MySql, Sql Plus, Oracle Toad,

Eclipse, Net Beans

. Languages: HTML, XML, SQL, Java, NOSQL

. Hadoop Technologies: HDFS, Map Reduce, PIG, Hive, Sqoop, Hbase,oozie

. Others: Tortoise SVN, Win SCP, Putty

. Web Server: Tomcat

Project Details:

PROJECT #1:

Project Name : Target - Web Intelligence

Client : Target Minneapolis, Minnesota, USA.

Environment : Hadoop, Apache Pig, Hive, SQOOP, Java, UNIX,

MySQL

Duration : Feb 2012 to till Date

Role : Hadoop Developer

Project Description:

This Project is all about the rehousting of their (Target) current

existing project into Hadoop platform. Previously Target was using

mysql DB for storing their competitor's retailer's information.[The

Crawled web data]. Early Target use to have only 4 competitor

retailers namely Amazon.com, walmart.com etc

But as and when the competitor retailers are increasing the data

generated out of their web crawling is also increased massively and

which cannot be accomodable in a mysql kind of data box with the same

reason Target wants to move it Hadoop, where exactly we can handle

massive amount of data by means of its cluster nodes and also to

satisfy the scaling needs of the Target business operation.

Roles and Responsibilities:

. Moved all crawl data flat files generated from various retailers to

HDFS for further processing.

. Written the Apache PIG scripts to process the HDFS data.

. Created Hive tables to store the processed results in a tabular

format.

. Developed the sqoop scripts in order to make the interaction between

Pig and MySQL Database.

. Involved in gathering the requirements, designing, development and

testing

. Writing the script files for processing data and loading to HDFS

. Writing CLI commands using HDFS.

. Developed the UNIX shell scripts for creating the reports from Hive

data.

. Completely involved in the requirement analysis phase.

. Analyzing the requirement to setup a cluster

. Created two different users (hduser for performing hdfs operations and

map red user for performing map reduce operations only)

. Ensured NFS is configured for Name Node

. Setting Password less hadoop

. Setting up cron job to delete hadoop logs/local old job files/cluster

temp files

. Setup Hive with MySQL as a Remote Metastore

. Moved all log/text files generated by various products into HDFS

location

. Written Map Reduce code that will take input as log files and parse

the logs and structure them in tabular format to facilitate effective

querying on the log data

. Created External Hive Table on top of parsed data.

PROJECT #2:

Project Name : Outsourced data privacy

Client : CoreLogic, USA

Environment : java, java Script Jdbc, and Oracle Eclipse using

weblogic

Duration : July 2010 to January 2012.

Project Description:

a cloud computing setting in which similarity querying

of metric data is outsourced to a service provider. The data is to be

revealed only to trusted users, not to the service provider or anyone

else. Users query the server for the most similar data objects to a

query example. Outsourcing offers the data owner scalability and a low-

initial investment.

The need for privacy may be due to the data being

sensitive (e.g., in medicine), valuable, or otherwise confidential.

Given this setting, the paper presents techniques that transform the

data prior to supplying it to the service provider for similarity

queries on the transformed data. Our techniques provide interesting

trade-offs between query cost and accuracy. They are then further

extended to offer an intuitive privacy guarantee. Empirical studies

with real data demonstrate that the techniques are capable of offering

privacy while enabling efficient and accurate processing of similarity

queries.

Roles and Responsibilities:

. Developed User Outsourcing Data

. Generate the utilities to Brute and Anonymization - based Solution

. Deployment on Weblogic applications.

Contact this candidate