RESHMA KARUMATHIL MANI
Phone: 770-***-****
EDUCATION:
M.S in Computer Science
Georgia State University, College of Arts and Science, Atlanta, GA. GPA: 3.91/4
Relevant Coursework: Database systems, Parallel and distributed systems, Advanced parallel algorithms, Modelling and simulation, Machine learning, Databases and web, Computer graphics.
Bachelors of Engineering in Computer Science. (June 2006)
Calicut University, Kerala, India
Certified Informatica Advanced Mapping Designer.
Teradata Certified SQL Specialist.
PROFESSIONAL SUMMARY:
5+ years of Total IT experience in the complete SDLC process with 4+ years of data warehousing experience and 2 years of Java code development.
Fair Experience in Data Modeling & Data Analysis using Dimensional Data Modeling and Relational Data Modeling, Star Join Schema/Snowflake modeling, FACT & Dimensions tables, Physical & Logical Data Modeling.
Expertise in ETL process using Informatica Power Center Designer, Workflow manager, Workflow monitor, Informatica server and Repository Manager.
Worked extensively with complex mappings using different transformations like Source Qualifiers, Expressions, Filters, Joiners, Routers, Union, Unconnected / Connected Lookups and Aggregators.
Experience in writing BTEQ scripts in Teradata and also in using the load and uload utilities of Teradata like Tpump.
Worked in understanding and decoding the HIPAA files and uploaded the same to data warehouse.
Designed and developed efficient Error Handling methods and implemented throughout the mappings.
Strong Knowledge of Reporting tool Microstrategy and knowledge of report creation in Business Objects and Cognos Reporting Tools.
Good experience in UNIX and writing shell scripts for Informatica pre & post session operations, FTP and file management and other database administration activities.
Skilled in Unit Test, System Integration Test and UAT.
.
ACADEMIC PROJECTS: (August 2013-May 2015)
Modelling and simulation Project
Developed a Devsjava model of wolf, sheep and grass predator prey model and simulated it in various parameters of sheep and wolf birth rate and grass growth rate, and the growth and decline of the sheep and wolf populations were studied. The model is a Devsjava based cellular space model.
Machine Learning Project
1. Did multi label classification using one-vs-all algorithm on the following test data after applying PCA algorithm to feature normalize: 1) Train data with 4434 features and 33 samples and Test data with 4434 features and 17 samples. Four classes.2) Train data with 5966 features and 73 samples and Test data with 5966 features and 29 samples. Two classes.3) Train data with 9182 features and 100 samples and Test data with 9182 features and 72 samples. Eleven classes.4) Train data with 3312 features and 161 samples and Test data with 3312 features and 42 samples. Five classes.
2. Did Missing value estimation on a dataset of 242 genes and 14 samples with 4% missing values and another dataset of 758 genes and 50 samples and 10% missing values.
Parallel and Distributed systems Project
1.Did CUDA programs for matrix multiplications on shared memory and multiblock.
2.Implemented Bitonic sort and Parallel sorting by regular sampling (PSRS) algorithms in both shared memory and message passing paradigms and did a parallel time complexity analysis of each algorithm.
Databases and Web
Developed a movie recommendation system using Java. Compared the performance of MySQL and Neo4J databases for this movie recommendation, by implementing it using them as backend.
TECHNICAL SKILLS:
Expertise working with Tools
Informatica 9.1,8.6.1,8.6,8.5.1 (Power Center), Microstrategy 8.1
Data Modeling
Erwin 4.0/3.5, Star Schema Modelling, Snow Flake Modelling
Databases
Teradata TD14,Oracle 10g/9i/8i, MS SQL Server 2005/2000, MongoDB
Languages
SQL, PL/SQL, Unix Shell Script,,Java (Jdk 1.7), C, XQuery, XSLT, DEVS Java, CUDA
Other Tools
Business Objects,Cognos 8.0, Toad, SQL* Loader
Operating Systems
UNIX,Windows NT Servers
PROFESSIONAL PROJECT EXPERIENCE:
1.Latest Project (Jan 2014 – May 2015)
Project for Neurology department, GSU. - AthenaDB (Datawarehousing/ MongoDB/Java/XSLT)
The main objective of this project was to build a set of tools to extract, reformat and build a database out of articles published electronically on internet by various publishers that are open access and is related to cognitive neuroscience. And then to build a web interface to query the database. This database is designed as a flexible collection of raw text of published papers that can be queried to generate sub collections of text for the training of text-mining classifier systems. This project was done for the psychology department of the college to enable them to do data mining on their various subjects and this has mainly focused on the frontiers and PLoSOne website. The XML pages of the articles are downloaded from the website by creating a URL using the article DOIs that is obtained from the PubMed and PubMed Central databases. These XML files are reformatted into a simpler form using XSLT and uploaded to the Mongo DB database. The project has been done using Java.
Responsibilities:
Involved in full life cycle design and development of Database on MongoDB database.
Interacted with psychology department team to get the detailed requirements in order to build the database, decide upon relational or NoSQL database.
Reformatted the articles downloaded from websites to simplified XML form for the ease of storage in mongoDB and ease of querying on them later on.
Developed the database design and the design of how to search and download the needed articles from open access websites.
Implemented the complete Project alone with the help of my advisor and acquired the needed features in the tool developed.
Environment: Jdk 7.1, EditiX-2008, MongoDB 3.0.1, MongoVUE, PubMed, PubMed Central databases, XML, XSLT, XSD.
2.Infosys.
Sep'06 – Jan'2011
TechnologyAnalyst(Datawarehousing)
WellPoint (Healthcare) http://www.wellpoint.com/
State Sponsored Business
Jun’2010 - Jan'11
Sr.Informatica Developer
Description:
WellPoint is one of the largest Healthcare providers in U.S.A.The state government provides help to the under privileged people of USA by paying the healthcare claimed amount for them. WellPoint acts as a third party by helping the State in keeping track of the members, providers and claims of the state sponsored business. In this project, we developed a complete data warehouse for State sponsored business part of WellPoint, by extracting data from various sources like, Oracle database,HIPAA files, flat files etc. We also developed a different layer of warehouse from where HIPAA files could be extracted.
Responsibilities:
Involved in full life cycle design and development of Data warehouse on Teradata database.
Interacted with business analysts, data architects and application developer to develop a data model and was involved in business analysis and technical design sessions with business and technical staff to develop Entity Relationship/data models, requirements document, and ETL specifications.
Developed standard and re-usable mappings and mapplets using various transformations like expression, aggregator, joiner, source qualifier, router, lookup Connected/Unconnected, and filter.
Environment: Informatica Power Center 8.5.1/9.1, Teradata, Oracle 10g, SQL, TOAD, Erwin, SQL SERVER 2005, XML, Shell Scripts,WLM(scheduling the workflows).
WellPoint (Healthcare) http://www.wellpoint.com/
Encounter Data-Warehouse
Jun'08 - Jan'2010
Sr.Informatica Developer
Description
This project was done to make a single data warehouse for the complete Wellpoint systems by sourcing data from all the existing databases of WellPoint. The different databases included Oracle system, Mainframe systems and DB2 systems. Also, a reporting data store was developed for the ease of reporting purpose. The history data was also captured from the HIPAA files that were submitted to WellPoint by state and which were archived in WellPoint shared drives.
Responsibilities:
Responsible for creating detailed technical specifications, and mapping documents.
Extracted data from sources like Oracle, XML and Fixed width and Delimited Flat Files transformed the data according to the business requirement using ETL tool, Informatica.
Extract data from mainframe through Informatica Power exchange loaded into data Warehouse.
Modified several of the existing mappings and created several new mappings based on the user requirement and was Involved in Job monitoring as a part of production support.
Created Mappings using Mapping Designer to load the data from various sources using different transformations like Aggregator, Expression, Stored Procedure, Filter, Joiner, Lookup, Router, Sequence Generator, Source Qualifier, and Update Strategy transformations.
Environment: Informatica Power Center 8.1.1/8.5.1, Teradata, Oracle 10g, SQL, TOAD, Erwin, SQL SERVER 2005, XML, Shell Scripts,WLM(scheduling the workflows),Business Objects.
Intel
Account Reconciliation Tool
May’ 07 – Mar’ 08
Microstrategy Technical Specialist
Description:
Previously, Intel’s Finance department did the reconciliation of data manually using excel sheets to compare the added up amounts in the source databases and the final amount in the data warehouse. In this project, a reconciliation tool was developed to compare these amounts and to avoid the manual work. Account Reconciliation Tool (ART) is an enhancement of the Accord tool and is a key control under Sarbanes-Oxley(SOX). Here we used Microstrategy tool to develop the reports in the same format how the reconciliation was done manually. It was also used to enable the users to export the reconciliation reports into Microsoft Excel sheets. And also to compare the data and to do various analysis, various other reports were developed according to the requirements provided.
Responsibilities:
Involved in full life cycle design and development of reports in Microstrategy.
Interacted with business analysts, data architects and application developer to develop a logical data model and then the physical data model.
Worked on the design and development of reports according to the requirements provided.
Worked on generating MSTR reports from .net platform by passing a URL containing the MSTR server and authentication info.
Worked with Microstrategy Command Manager to create Metrics.
Used Microstrategy Office to export reports into Excel format.
Environment: Microstrategy 8.1, Teradata, Oracle 10g, SQL, TOAD, Erwin.T
Time Warner Cable (TWC)
Jan'07 – May'07
Informatica Developer
Description:
Time Warner Cable is one of the largest Cable companies in U.S.TWC provides its services to many of the states,Baffalo,South Carolina, California etc. The Project included three modules, Production Support, Quality Analysis and Report Development. We studied the complete data warehouse of the company, did a detail data analysis, supported many daily running Informatica jobs, solved the bugs that occurred and developed reports according to the user requirements that were provided.
Responsibilities:
Provided 24X7 support as a Informatica Developer and worked as a team with the Quality Analysis and Cognos report developers from offshore.
Worked closely with the onsite coordinators in writing the functional specifications based on the business needs.
Worked on Change management, Configuration management, Knowledge management and Remedy management.
Developing and Tuning the complex mappings based on source, target and mapping even at session level.
/dropping of table and indexes of performance for pre and post session management.
Used Cognos Reporting tool for developing the reports as per requirements.
Environment: Informatica Power Center 8.6.1/8.1.1, Cognos 8, Oracle 10g, Natezza, MS SQL Server 2005, Toad.