Harsha Vardhan Galla
Contact k: ***********@*****.*** H: +1-352-***-****
**** ******** ***, *** *, Ho man Estates, IL, 60169 Summary Six years of professional working experience.
Good exposure in designing and Implementation of software involving high data volume.
Experience in projects involving extraction of meaningful information from data leading to new products for business.
Good understanding of Big Data platforms like Hadoop, Pig, Hive, Flume, Sqoop, Zookeeper.
Exposure to design and optimization of database with working experience in Oracle and Mon- goDB.
A quick learner, a team player and can adapt to new technologies and practices in a short period of time.
Technical
Skills
Big Data Technologies : Hadoop, HDFS, HBase, Pig, Hive, MapReduce, Flume, Sqoop, Zoo Keeper.
Java Technologies : Java, Servlets, JSP, J2EE.
Databases and Development Tools : Oracle database, MongoDB, SQL. Development Tools : Crystal Report Desinger, Sonar, YourKit, Eclipse, IntelliJ, Netbeans. Version Control System : SVN, ClearCase, Git.
Frameworks : JSP, Servlets, Maven, Spring, Amazon Web Services (AWS), Ant, SmartGWT, Jasper Reports, Log4J, Selenium, JUnit, EasyMock, Chrome Extensions. Development models : Scrum, Agile.
Education University of Florida, Gainesville, Florida Master of Science (Computer Engineering) GPA: 3.83/4 Birla Institute of Technology and Science, Pilani, India Bachelor of Engineering (Computer Science)
Professional
Experience
Sears Holdings Software Development Engineer II July 2014 - current
Setup Big data cluster using Hadoop which can be used for in house data processing and analysis. Con gured Hadoop components like Mapreduce, HDFS, Pig and Hive.
Responsible for Data import and export into HDFS with Flume and Sqoop. Explore use cases for extraction of meaningful information from imported data.
Experience in developing Pig Latin scripts and using Hive Query Language for data analytics.
Support cluster management and investigation of system failures to assist in restoring the func- tionality.
Document design and procedures to build and manage Hadoop clusters.Coordination with tech- nical teams for installation of Hadoop and third related applications on systems. Sabre Holdings Senior Software Developer Sep 2009 - Dec 2012
Developed Business Intelligence solutions named Sales Analyzer, Sales Essentials and Network Analyzer that help airlines in their decision strategies. The software provides real-time analysis of thousands of markets worldwide and helps identify potential opportunities for
ight operations.
Lead the e ort in designing a BI application named Market Intelligence which provide in browser analytical capabilities, rich user interface and cross browser compatibility. The application used frameworks like SmartGWT, Spring, Maven and Oracle as the database. This new product was a success doubling revenues in less than 6 months.
Designed and developed a dynamic query generator framework to automate query generation for complex databases with very high dynamism in query parameters. This signi cantly reduced implementation and operational costs.
Designed and integrated crystal and jasper reports in a J2EE web application to provide reporting capabilities in a BI product. This provided drill down and graph capabilities to users which helped better analyse reports.
Contributed signi cantly to database (of the size of 15TB) performance by e cient database design and optimizing complex SQL queries. Improved performance in a data processing job by 10x by implementing Trie data structure and using cache management frameworks like JCS. Project
Details
Data Migration and Reporting
The project involved moving of data from various sources to HDFS and vice-versa. This also involved setup of front-end to execute queries and export results to clients. Team Size: 3
Environment: Hadoop, HDFS, Map Reduce, Hbase, Hive, Flume, Sqoop, Ambari, Ranger, Blueprint.
Source Control: SVN
Role: Software Developer Engineer.
1. Resposible for loading structured/semi-structered data from various sources into HDFS. 2. Identify various data sources to ensure data quality and import into HDFS using Flume and Sqoop.
3. Export data to relational tables so it can be consumed by front end dashboards. 4. Analyze imported data using Pig and Hive scripts and provide meaningful results. 5. Developed Pig UDF’s to preprocess data for analyis. 6. Worked on tuning the performance of Pig queries. Setup and Development of Hadoop Fully-Distributed Operation & Map reduce The project involves setup and con guration of hadoop cluster. The setup is used for analysis of logs which helps to generate reports that aid in making business decisions. The project also involved maintenance of the cluster to achieve high availability and scalability. Team Size: 1
Environment: Hadoop, HDFS, Map Reduce, Hbase, Hive, Flume, Sqoop, Ambari, Blueprint, Kerbos, Ranger.
Source Control: SVN
Role: Software Developer Engineer.
1. Setup cluster using Ambari blueprint to rapidly scale as per computing needs. 2. Gather use cases and implement them using Map Reduce. 3. Analyze data output from the Map Reduce jobs and share it across teams. 4. Design and implementation of security for Hadoop cluster with Kerberos secure authentication. 5. Setup Ranger to achieve ne-grained access control over Hadoop data access in the cluster. 6. Manage scheduling of long running tasks and optimize execution. 7. Monitor cluster and scale when load is high
8. Overlook cluster coordination services through Zookeeper. AirVision Market Intelligence
Design and implement a product that is used extensively by airlines to perform network analysis. The product helps business identify potential new routes in network, analyze competitor routes and help understand tra c patterns.
Team Size: 3
Environment: Core Java, Spring, Maven, GWT, SmartGWT, Servlets, JSP, Oracle, SQL.
Source Control: SVN
Other Tools: JIRA, Sonar, Version One, YourKit, Log4J, JUnit, Selenium. Role: Senior Software Developer
1. Identify the right set of technolgies to implement the project by implementing various POC’s. 2. Responsible for adoption of best frameworks and practices in product design and implementa- tion.
3. Own and plan Agile process to deliver the new product in a time bound manner. 4. Ensure data quality and accuracy. Write database scripts to ensure data integrity. 5. Fine tune and optimize database and SQL queries to provide data in user acceptable time limit. 6. Implement data structures, algorithms to improve data retrieval in the application. 7. Design query framework to support adhoc analysis with data from multiple sources to help business make decisions.
8. Provide insights and new ideas to compete with the market. AirVision Sales Analyzer, Network Analyzer & Sales Essesntial Design and develop products that are used extensively used by airlines to perform exploratory data analysis. The products provided real-time analysis of sale point channels and schedules. It also pro- vides insight into potential opportunities to improve revenue and pro t margins. Team Size: 4
Environment: Core Java, Struts, Ant, SQL, Oracle, Crystal Reports, Servlets, JSP, Jasper Reports.
Source Control: SVN, Clear Case.
Other Tools: JIRA, Sonar, Crystal Reports Designer, Log4J, JUnit. Role: Contributor Software Developer
1. Gather requirements from business and evaluate feasibility of implementation. 2. Implement enhancements and own complete Agile process to deliver requirements. 3. Tracking of defects by using Quality Center tool and plan in iteration to take them to closure. 4. Evaluate new frameworks and Business Intelligence tools for the product. 5. Contribution to Sabre enterprise UI framework