Professional Summary
Education
Skills
Work History
Academic Projects
D M
• Home: 518-***-**** • **************@*****.***
Software Engineer with 3+ years of experience in creating scalable web applications using Java. Masters in Computer Science, Proficient in C++, Java,Python. Big Data & Hadoop Enthusiast. Master of Science: Computer Science, 2018
State University of New York, Albany - Albany, NY Programming Languages: Python, C, C++,
Java,R,JavaScript
Database: MySQL, MongoDB
Big Data Ecosystem: Hadoop, Hive, Flume, MapReduce, Apache Spark, Pig, HBase, Cassandra, Oozie, Kafka, AWS,.
Spark Framework:
Spark RDD's, SparkSQL, Spark Streaming, Spark Mlib, Apache Kafka & Architecture
Tools & Technologies: Parallel Programming
--OpenMp,CUDA, OpenCL,Boost Libraries, Spring, JSP, Servlets, Struts, JDBC, Ajax, jQuery.Cloud : Amazon EC2, AWS-AMI. MATLAB
Graduate Teaching Assistant, 08/2017 to 12/2017
State University of New York (SUNY) – Albany, NY
Mentored students through office hours and one-on-one communication. Checked assignments, proctored tests, and provided grades according to university standards. Software Developer, 01/2013 to 11/2016
Softyoug Solutions
Writing Technical specification document Ensure designs follow specifications. Write well designed, testable, efficient code Struts MVC development JSP pages development
Servlets Development
Write JavaScript
Ajax Implementation
Testing of Application
Code Maintenance.
Bug Fixing.
Intern, 03/2011 to 08/2011
Idea Cellular Ltd
Worked at Idea Cellular, a mobile network operator as trainee where developed a network switching subsystem(NSS) portal which helped its users to run Telnet commands to VLR and HLR using PHP scripts. Analysis of General Election
Environment: HDFS (for storage), Spark SQL (for transformation), Spark MLlib (for ML) Analyze factors that led to the eventual outcome based on demographic features to plan subsequent campaigns Overcame challenges of storing & processing structured/semi-structured data via Hadoop Framework & Apache Spark Transferred data into HDFS
Deployed PySpark to analyze voting patterns across multiple sources and channels . Further processing using MapReduce Delivered the output into RDBMS via Sqoop. Stock Market Prediction
Developed a Stock Market Prediction model using different Regression and Classification techniques, developed in R.
We developed a model which was 57% accurate in predicting Stock market movements using Naives baye's Classifier combined with EMA.
We improvised our model by combining random forest and features from Bollinger Bands. A Security analysis on Browser Extensions
Environment : Python, MongoDB.
Conducted Security research on Browser Extension Vulnerabilities. Exposing Web Accessible Resources in Chrome Extensions. Researched on attacks like URI Leakages and Two side Time channel attack for exposing browser extensions. Crawled Google Chrome Web store, fetched around 67822(all) extensions packages and their metadata. We found broadly more that 64% extensions were Over-privileged. We classified permissions used by extensions according to the categories defined on the google chrome store. Passage Retrieval System
Environment : Apache Lucene.
Developed Passage Retrieval system using Apache Lucene . It involved two separate indexing techniques .
Documents were divided into passages after then they were indexed, instead of usual document indexing. Precision and Recall were compared using trec eval using both techniques. Aviation Data Analysis
Environment: Linux, Hive, Hadoop, Spring, Flume, Pig, Spark Deployed Hadoop Framework to analyze public datasets around cities served by airports, timezone, code of airport, etc.
Rendered insights on nation-wide airport listing, airlines sharing airports, airlines with zero stops, most active airline, etc.