Sign in

Data Computer Science

Lake Saint Louis, Missouri, United States
April 09, 2018

Contact this candidate



**** ******* ***, *'Fallon, MO *3368 C: 404-***-****


Hadoop Developer adept at application development, testing and optimization. Excels at Java and Python application development, including coordinating ground-up planning, programming and implementation for core modules.


Languages : Big Data Ecosystem :

Java, Python (numpy, pandas, scipy), PySpark, Scala Spark, Spark Streaming, Kafka, ElasticSearch,

(Beginner Level), PL SQL, Hive QL,Pig Latin. HBase,Cassandra, HDFS, MapReduce, Hive, Pig, Flume, Sqoop,Oozie.


Hadoop Developer 07/2016 to Current

MasterCard O'Fallon, MO

Developed mapreduce code to process the redemptions and send back the cashback values,points to customers.

Implemented oozie bundle and coordinator for managing and scheduling workflows. Created generic job launcher for oozie workflows, single place to launch all the workflow, easy to add logging, metrics, etc.

Enabled speedy reviews in first mover advantages by using oozie to automate data loading into hdfs . Developed and tested workflow scheduler jobs scripts in apache oozie. Developed map reduce programs to parse the raw data and populate the tables in post grace database . Created hive quires that helped stalk holders to see how many offers got redeemed and non redeemed based on the transactions .

Part of the Design & Implementation team which identifying opportunities to leverage components of the Hadoop ecosystems which are not yet part of our architecture. Provided Hadoop expertise to various operational groups with varying skillsets, who will share admin responsibilities.

Accounted for end-to-end performance tuning and volume management of Hadoop clusters and MapReduce routines.

Contributed to the evolving architecture of mastercard's Big Data services to meet mastercard's multiple project specific requirements for scaling, reliability, performance, manageability, and price. Big Data Developer 01/2015 to 06/2016

JCPenney Headquarters Plano, TX

Installed and configured Hortonworks Distribution Platform (HDP 2.4) on Amazon EC2 instances with 100 nodes.

Developed automation scripts to import data from S3 to Datameer in a Datameer compressed format. Created FTP jobs (JSCAPE) to import data from COREMETRICS detailed files into datalake (hdfs/S3) through edge node.

Developed MapReduce programs and Hive queries to analyze sales pattern and customer satisfaction index over the data present in various relational database tables. Analyzed the data using the hive queries to find the top selling products in particular region during festive seasons in order to increase its production in the future,and this analyzed is has imported to tableau for visualization graphically.

Implemented Naive Bayes model on supervised machine learning dataset for sentiment analysis. Designed workflows and coordinators in Jenkins to automate and parallelize import jobs (Sqoop, Spark, JScape, etc) on Apache Hadoop environment by Hortonworks (HDP 2.4). Deployed Amazon Elastic Search clusters and deployed log stash on all instances to collect logs and metrics of applications which is used to monitor the application health deployed in the clusters Experience in understanding the security requirements for Hadoop and integrating with Kerberos authentication infrastructure- KDC server setup, creating realm /domain, managing. Expertise in implementing Spark, Python application using higher order functions for both batch and interactive analysis requirement.

Worked on Kafka to rebuild user activity tracking pipeline for publish-subscribe feeds. Spark Streaming collects the data from Kafka in real-time and performs necessary transformations and aggregation on the fly to build the common learner data model and persists the data in NoSQL store Cassandra.

Developed Python scripts, using both DataFrames/SQL and RDD in Spark for data aggregation and queries.

Experience in configuring Zookeeper to provide high availability and Cluster services coordination. Processed the clickstream data in Spark Streaming and stored it in Cassandra for further modification. Utilized market basket analysis to discover and understand customer purchasing behavior. Software Development Engineer In Test (SDET) 05/2014 to 12/2014 Mastercard O'Fallon, MO

Involved in the design and development of test plan from business and Functional requirements which includes test objectives, test strategies, test environments etc. Incorporating automation framework changes based on the new changes in the product and enhancement requests from the manual team.

Part of the team which incorporated masterpass jbehave based automation framework with mastercard ‘s global automation framework as part of organizational requirements. Working with the Release Engineering team to create and maintain an automated build verification test. Running bamboo jobs as part of regression testing on different environments depending on the sprint team requirements .

Part of the team which modified the existing non Page Object Model framework to Page Object Model Automation framework.

Interact with users by conducting User Acceptance Testing (UAT) to ensure that the total Provide test summary documentation and analyze test results, identifying trends and/or root causes of problems. Running, maintaining, troubleshooting the sanity jobs as part of sprint team. Responsible of automating functional stories as part of feature level automation. Manipulated data using CRUD (create, read, update and delete) operations of MongoDB database management system and handled database access and data transmission based on RESTful web service. Education and Training

Computer Science 2017

Certification in Big Data Modeling and Management Systems,University of California San Diego, Coursera. Certification in Big Data Integration and Processing,University of California San Diego, Coursera. Hortonworks Certified Associate (HCA)

Master of Science: Computer Science 2014

University of Louisiana at Lafayette Lafayette, LA, United States 3.75 GPA

Bachelor of Science: Computer Science 2012

Acharya Nagarjuna University Guntur, AP, India

Awards and Achievements

2013 All India 750th Rank (98.50 percentile) in Graduate Aptitude Test in Engineering (GATE). 2014 ULL Fellowship for being one of the best students in Master of Sciences.

Contact this candidate