JAYANAGA PRASAD B
+91-903******* Email:
*******.******@*****.***
Professional Summary:
. 3.3 years of IT experience in Application Development in Java and
Hadoop.
. 1.7 years experience in Hadoop and its components like HDFS, Map
Reduce, Apache Pig, Hive, Sqoop, and HBase
. Extensive Experience in Setting Hadoop Cluster
. Good knowledge with Map Reduce Programming
. Involved in writing the Pig scripts to reduce the job execution time
. Have executed projects using Java technologies such as Core Java,
Servlets, Jsp, JavaScript
. Very well experienced in designing and developing both server side and
client side applications.
. Experience in using development tool like Eclipse IDE 3.x and Net
Beans 6.5
. Excellent communication, interpersonal, analytical skills, and strong
ability to perform as part of team.
. Exceptional ability to learn new concepts.
. Hard working and enthusiastic.
. Knowledge on Apache OOZIE and FLUME
Professional Experience:
. Working with Virtusa India Pvt Ltd, Hyderabad as a Hadoop Developer
since JULY 2010 -to till date.
Qualifications:
. Bachelor Of Computer Science from Andhra University, Andhra Pradesh,
INDIA, since: 2006-2009.
Skills:
. Languages: HTML, XML, SQL, Java, No Sql
. Operating Systems: Linux, Cent OS, Windows 95/98/NT/2000/XP
. Databases & Tools: Oracle 9i/8i, MySql, Sql Plus, Oracle Toad,
Eclipse, Net Beans
. Languages: HTML, XML, SQL, Java, NOSQL
. Hadoop Technologies: HDFS, Map Reduce, PIG, Hive, Sqoop, Hbase,oozie
. Others: Tortoise SVN, Win SCP, Putty
. Web Server: Tomcat
Project Details:
PROJECT #1:
Project Name : Target - Web Intelligence
Client : Target Minneapolis, Minnesota, USA.
Environment : Hadoop, Apache Pig, Hive, SQOOP, Java, UNIX,
MySQL
Duration : Feb 2012 to till Date
Role : Hadoop Developer
Project Description:
This Project is all about the rehousting of their (Target) current
existing project into Hadoop platform. Previously Target was using
mysql DB for storing their competitor's retailer's information.[The
Crawled web data]. Early Target use to have only 4 competitor
retailers namely Amazon.com, walmart.com etc
But as and when the competitor retailers are increasing the data
generated out of their web crawling is also increased massively and
which cannot be accomodable in a mysql kind of data box with the same
reason Target wants to move it Hadoop, where exactly we can handle
massive amount of data by means of its cluster nodes and also to
satisfy the scaling needs of the Target business operation.
Roles and Responsibilities:
. Moved all crawl data flat files generated from various retailers to
HDFS for further processing.
. Written the Apache PIG scripts to process the HDFS data.
. Created Hive tables to store the processed results in a tabular
format.
. Developed the sqoop scripts in order to make the interaction between
Pig and MySQL Database.
. Involved in gathering the requirements, designing, development and
testing
. Writing the script files for processing data and loading to HDFS
. Writing CLI commands using HDFS.
. Developed the UNIX shell scripts for creating the reports from Hive
data.
. Completely involved in the requirement analysis phase.
. Analyzing the requirement to setup a cluster
. Created two different users (hduser for performing hdfs operations and
map red user for performing map reduce operations only)
. Ensured NFS is configured for Name Node
. Setting Password less hadoop
. Setting up cron job to delete hadoop logs/local old job files/cluster
temp files
. Setup Hive with MySQL as a Remote Metastore
. Moved all log/text files generated by various products into HDFS
location
. Written Map Reduce code that will take input as log files and parse
the logs and structure them in tabular format to facilitate effective
querying on the log data
. Created External Hive Table on top of parsed data.
PROJECT #2:
Project Name : Outsourced data privacy
Client : CoreLogic, USA
Environment : java, java Script Jdbc, and Oracle Eclipse using
weblogic
Duration : July 2010 to January 2012.
Project Description:
a cloud computing setting in which similarity querying
of metric data is outsourced to a service provider. The data is to be
revealed only to trusted users, not to the service provider or anyone
else. Users query the server for the most similar data objects to a
query example. Outsourcing offers the data owner scalability and a low-
initial investment.
The need for privacy may be due to the data being
sensitive (e.g., in medicine), valuable, or otherwise confidential.
Given this setting, the paper presents techniques that transform the
data prior to supplying it to the service provider for similarity
queries on the transformed data. Our techniques provide interesting
trade-offs between query cost and accuracy. They are then further
extended to offer an intuitive privacy guarantee. Empirical studies
with real data demonstrate that the techniques are capable of offering
privacy while enabling efficient and accurate processing of similarity
queries.
Roles and Responsibilities:
. Developed User Outsourcing Data
. Generate the utilities to Brute and Anonymization - based Solution
. Deployment on Weblogic applications.