Post Job Free
Sign in

Engineer Data

Location:
India
Posted:
August 10, 2014

Contact this candidate

Resume:

Dijin Thomas Mob-+91-905*******

***********@*****.***

Objective

To work for an organization, which provides continuous exposure to cutting

edge technologies, where I can learn continuously, implement my knowledge

and be a value addition to the company.

IT - Experience Summary

Having a total 3.6 years of experience with Infosys Technologies Limited.

(Feb 2012-Present)

Worked as trainee in Cognizant Technologies for 3 months.

(Nov 2011 - Jan 2012)

Skill-Set

Big Data : Hadoop (HDFS), Pig, Hive, Sqoop, Map Reduce,

Oozie, MR Testing

Programming Languages : Java (core), PL/SQL

RDBMS : MySQL, Oracle

Operating System : Windows XP/7, MS-DOS, Ubuntu, CentOS.

Tools : Eclipse (3.3), SQL, Windows Server 2000,

CICS, TSO, Lotus notes, Clear Quest,

FileZilla,

Cloudera 5(VM)

Soft Skills : Quick learner, discipline, effective team

player,

Efficient and eager to learn

DETAILS OF THE PROJECTS WORKED ON

01 Project Name : Re-Hosting of Web Intelligence Project for

WallmartLabs

Client : WallmartLabs

Role : Systems Engineer/Senior Systems Engineer

Responsibility ( Moving all web crawling data flat files

generated from

various retailers to HDFS for further

processing.

( Written the Apache PIG scripts to process

the HDFS data.

(Created Pig UDFs to extract and transform

the data.

( Created HIVE tables to store the

processed results in

tabular format.

(Make the data fix to resolve the issue.

( Developed the Sqoop scripts in order to

make

interaction between HDFS and MySQL.

Software : Hadoop (HDFS), Mapreduce, Pig, Hive, Sqoop,

Java

02 POC(Internal) : Stack Overflow Data Analysis POC

Client : Infosys(Internal)

Role : Systems Engineer/Senior Systems Engineer

Responsibility (Written MapReduce jobs to count the number

of

posts & comments on a particular day from

XML

data set(posts.xml,comments.xml,users.xml)

from

Stackoverflow website.

(Written MapReduce jobs to determine the

first and the last

time a user commented and total number of

comments by

user.

(Written MapReduce job to find average and

median of

comment length per hour per day.

(Written Mapreduce job to count the number

of user from

each state using custom counters.

(Written Mapreduce job to extract the top

10 users based

on reputation

(Written Mapreduce job to create a

structured XML

hierarchy to nest comments with its related

posts.

(Written a Mapreduce job to perform a self

join to create a

question,answer and comment hierarchy

(Written Mapreduce job to partition the

records(user.xml)

based on the last access date,one

partition per year

from 2008-2014.

(Stored all the above Mapreduce job results

to the

corresponding Hive external tables .

(Moved the output data from HDFS to MySQL

using Sqoop.

(Created Hive tables using Sqoop commands.

Software : Hadoop (HDFS), Mapreduce, Pig, Hive, Sqoop,

FileZilla.

03 POC(Internal) : Retail Data processing and Data Analysis

POC.

Client : Infosys(Internal)

Role : Systems Engineer/Senior Systems Engineer

Responsibility (Written Apache PIG scripts to process the

HDFS data.

(Written Pig scripts to extract the product

information,

maximum sale price, minimum sale price

for the

different competitors (different Retail

Ids).

(Sliced the data based on the Product Type.

(Segregated the data based on different

Vertical Ids and

Retail Ids.

(Created HIVE tables to store the processed

results in tabular

format.

(Developed the Sqoop scripts in order to

make the

interaction between HDFS and MySQL

database.

(Created Hive tables directly using Sqoop

commands.

Software : Pig, Hive, Sqoop.

04 Project Name : Horizon Health Care

Client : Horizon BCBSNJ

Role : Systems Engineer/Senior Systems Engineer

Responsibility Writing SQL queries and Procedures as per

requirement

(Analyzing each issue and determining if

data fix is required.

(Basic analysis of each and every issue

coming to NMS

And assigning it to appropriate

applications.

(Interact with Horizon Business users in

case of any

question or clarifications.

(Make the data fix to resolve the issue.

(Planning and tracking of all the issues

fixed and

their turnaround Time.

(Identify the priority of the issue raised

and work accordingly.

(Work with configuration application

development teams

to determine enhancement and fix

priorities.

(Communicate workarounds or knowledge base

fix schedule.

(Providing daily and weekly reports for

Clients.

(Imparting the knowledge as well as making

the

counterparts aware of the new production

issues

Software : Windows Server 2000, CICS, TSO, Lotus

notes, Clear Quest

CERTIFICATIONS

# Certification Type

1 Infosys Certified Big Data - Hadoop Developer

2 Infosys Certified Big Data Hive Developer 202a

3 Infosys Certified Hadoop Core Developer

4 AHIP : Fundamentals of Healthcare - Part A

(America's Health Insurance Plan-Part-A)

5 AHIP : Fundamentals of Healthcare - Part B

(America's Health Insurance Plan-Part-B)

6 IMS ITIL Problem and Change Management

7 IQ Foundation Certification

8 STAR INFOSCION CERTIFICATION

Achievements

Awarded Employee of the Month: October, 2012 Infosys Technologies

Nominated for Employee of the Month: September, 2001 Infosys Technologies

Division level football player.

Educational Qualification

Course Institute Year of Marks

Passing

B.E. (I.T) SDBCT Indore (M.P) 2010 70.09%

Higher Secondary CBSE Mandsaur (M.P) 2006 69.4%

Secondary CBSE Mandsaur (M.P) 2004 63.4 %

DETAILS OF TRAINING UNDERGONE

# Trained On Period(in Days)

1 Business Tier using POJO 5.00

2 Client Tier using HTML and 3.00

Java Script

3 CS - Intermediate 2.00

Comprehensive Examination

4 Design and Analysis of 2.00

Algorithms - FPINT

5 Integration n Deployment 2.00

of Enterprise Application

6 Intro to Enterprise Appn 4.00

Dev

7 Introduction to Web 3.00

Technologies

8 IP Security and Legal 0.50

Issues

9 J2EE Comprehensive Exam-LC 3.00

10 J2EE POST Project-LC 10.00

11 Knowledge Management at 0.50

Infosys

12 Object Oriented 5.00

Programming - FPINT

13 Peristence Tier using 7.00

Oracle and JPA

14 Presentation Tier using 8.00

JSP and JSF

15 Programming 6.00

Practices-FPBRDG

16 RDBMS - Essentials - FPINT 9.00

17 RDBMS-FPBRDG 4.00

18 Software engineering and 3.00

IQS - FPINT

19 STAR FP - Business 0.50

Etiquette

20 STAR FP - Competition I 0.50

21 STAR FP - Competition III 0.50

22 STAR FP - English Lab 4.00

23 STAR FP -Basics of 0.50

Business

24 STAR FP -Global 0.50

Effectiveness

25 STAR FP -STAR Introduction 0.20

and Concepts

26 STAR FP-Professional Work 0.50

Culture

27 Unix - FPINT 4.00

28 User Experience Design - 2.00

FPINT

Personal Information

Address : C/o Mr. Y. Chinna Goud Flat No.203

Raidurgam, Gachibowli

Hyderabad (T.S)

Phone : +91-905*******

Email : ***********@*****.***

Nationality : Indian

Marital Status : Single

Languages Known : English, Hindi, Malayalam

Permanent Address

Address : House No.66 phase-2

Yash Nagar

Mhow - Neemuch Road

Mandsaur (M.P)

Phone : +91-905*******



Contact this candidate