Post Job Free
Sign in

Data Project

Location:
United States
Posted:
February 26, 2015

Contact this candidate

Resume:

Jesu Darison Gerard

607-***-**** **********@**********.*****45,Seminary Ave Binghamton -13905

EDUCATION

Binghamton University, State University Of New York, Thomas J Watson School of Engineering Expected May 2015

Master of Science in Computer Science (GPA-3.5/4)

TECHNICAL

SKILLS

Big Data Skills:Hadoop -Mapreduce,Pig,Hive,HBase, OOzie,Flume,Scoop

Programming Languages : C,Core Java,Python,Shell/Bash scripting (Scripting Language)

Visualization and Virtualization software’s: Tableau Reporting Tool, VitualBox, VMWare,Microsoft Visio

Web Technologies: HTML, CSS, JavaScript

Applications: Visual Studio 2012,Eclipse,HP Quality Center 9.2,Software AG Web Methods Developer 8.2,Share Point

Operating Systems: Windows, UNIX(LINUX-Ubuntu,Centos)

WORK

EXPERIENCE

Hadoop Development Experience (Masters 2014)

• Developed map-reduce jobs to query data stored in Hadoop Distributed File System (HDFS)

• Developed Pig scripts to transform data in HDFS and performed several operations like FILTER,GROUPBY,JOIN etc

• Written Pig UDF's (User Defined Functions using Python) and registered the new UDF’s written into the pig script

• Worked on splitting, joining large data sets using Pig which can consume structured, semi-structured and unstructured data

• Executed Hive (HQL) queries to analyze Big data and created Managed tables, External tables in which data is well organized

• Used scoop to transfer data between HDFS and RDBMS and used authentication methods to have protected data transfers

• Populated HDFS with data using Flume from different data sources

• Built and orchestrated workflows in Oozie and scheduled jobs that are triggered by time (frequency) and data availability

Core Java, Middleware and Testing Experience (Infosys Ltd 2011 Aug -2013 Dec)

• Implemented Design Patterns like Decorator, Template method patterns in Java which makes code maintainability better.

• Worked as part of integration project for Infosys Client DHL (the global market leader in the Logistics industry)..

• Interacted with Business Analysts for clarifications and analyzed mapping specs required for data transformation

• Performed Data Transformations between heterogeneous systems (Oracle Enterprise One, SAP, Mainframes).

• Developed code in Web Methods Flow (Graphical Programming Language) as per business requirements and executed in Web Methods

Integration Server.

• Worked as part of Quality Assurance Team for Infosys Client AVON (an American international manufacturer and distributor of beauty,

household, and personal care Company).

• Created Test Scripts for modules and performed functional and regression testing.

• Performed Test Case Execution and recorded test results in Quality Center.

• Extensive knowledge in understanding and gathering information from Business requirements, Business architecture diagrams (BAR),

Functional Specification Documents, SRS (Software Requirement Specifications).

• Strong knowledge of Software Development Life Cycle (SDLC) process and Software engineering principles.

PROJECTS

1. Music Store Application(Design Patterns):

Designed Music store application in Java using design patterns like SimpleFactory, Builder,Visitor, Observer, State Patterns

2. Data Analysis on Online shopping website data:

• Purpose of this project is to store terabytes of log information generated by ecommerce website and extract meaningful

information from it. The data was stored in HDFS, which includes raw html data from the website. Processed product, pricing

information using Hive and made sentiment analysis on reviews of products

• Exported the sentiment analysis results data to Tableau (a reporting tool) for creating dashboards

3. Data Analysis on Log files:

• Moved all log data from individual servers to HDFS as main log management system and performed data analysis using Pig and

Hive. Used Flume to periodically move the log data into HDFS .Exported data to Tableau reporting tool and created dashboards.

The main purpose of the project is to mine large amount of log files, extract meaningful information from it and visualize the

results using Tableau.

EXTRA CURRICULAR

ACTIVITIES

CCNA

• Completed CCNA (Cisco Certified Network Associate 640-820) Routing and Switching.

• Trained in Networking fundamentals (Sub netting, Network Address Translation,IPV6,DHCP,IP Address structure)

Leadership Volunteer Member

• Assisted the university staff in welcoming the international students for the International student’s orientation day.



Contact this candidate