Name
Rujbir Kaur
Contact Details
Email: acxm2d@r.postjobfree.com
Profile
I have had varied experience in IT industry. I worked as a business analyst for a number of years after being a software developer for a few years. I also had a brief stint as a tester and a System Integration Lead in Telco companies. I had one role as a Technical Writer. Lately, I have been working on transitioning to Data Science area where new opportunities are emerging. I have recently undertaken a project in this area.
Skills Summary
Area
Skills
Text Mining/Natural Language Processing
Text cleansing
Text tokenization
Text stemming
Text vectorization
Topic modelling (LDA), n-gram analysis
Twitter Mining
Machine Learning
Classification, Regression, Clustering, Text Mining
Python
Python (3.5), Sklearn, Pandas, Numpy, Visualizations in Python
Access Web Data
HTML, XML Parsing, scraping, JSON/REST APIs
Other
Matlab, MS Excel, SQL
Recent Data Science Project
Healthcare Twitter Analysis Project (Saama Technologies, USA)
The goal of Saama Technologies healthcare Twitter analysis project is to gain meaningful insights into American healthcare and medicine by analysing data on the social media website Twitter along with government data and live events data. The project aims to achieve the above goals by deploying machine learning methods to analyse data. The analysis for the project can take the form of any of the common machine learning problem domains like classification, regression, clustering, sentiment analysis etc. Moreover, extensive text analysis and natural language processing (NLP) methods are expected to be used for processing the textual data.
The project as described above is an umbrella project and has a very wide scope. Smaller projects with narrower scope are expected to be carried out to fulfil the goal of the umbrella project. All projects are expected to contribute to the goal of the umbrella project, which is to understand the themes and patterns in tweets, government data and the live data.
The objective of this particular project was to explore ‘cancer’ related tweets with a view to discover some ‘cancer’ related themes. The project objective was met by using machine learning algorithms to discover topics in tweets, manually interpreting topics to identify themes and by performing some content analysis on tweets. The project was implemented by using Python 3.5.
Project Problem Statement:
a) Develop topic models from ‘cancer’ related tweets in order to identify some ‘cancer’ related themes.
b) Analyse the content and describe the themes in tweets.
c) Summarise the findings of topic analysis and content analysis.
IT Business Analysis/Requirements Analysis Roles
Role
Year
Organisation
Description
Reporting & Project Officer
2014
NSW Department of Education and Community Services (DECS)
Gather and document reporting requirements
Develop and test reports
(MS Access)
Adhere to IT processes
Business Analyst
2011
ACN, Sydney
(Telco Reseller)
Facilitate requirements workshops
Elicit requirements
Document business requirements
Document functional requirements
Manage stakeholder expectations.
Business Analyst
2010-2011
Tabcorp, Sydney (Star City)
1.Same as above -
Business Analyst
2008-2009
Telstra, Sydney
2.Same as above -
Business Analyst
2008
Sensis, Melbourne
3.Same as above -
Business Analyst, Billing
2005
Hutchison Telecom, Sydney
Gather and document requirements for billing customers for Hutchison products
Business Analyst, Billing
2004
Optus, Sydney
Gather requirements for insourcing a module of Kenan Arbor Billing system into Optus environment.
Testing Role
Role
Year
Organisation
Description
System Integration Test Lead
2006-2007
Satyam Mahindra, India
Project managed System Integration Testing (SIT) for a telecom software deployment.
Other Roles
Role
Year
Organisation
Description
Technical Writer
2011-2012
Retriever Communications, Sydney
Write technical documents - user guide, product specifications guide, language guide.
Software Developer
1999-2003
Various
Develop software on C/Unix platform using databases and SQL.
Qualifications
Bachelor of Commerce (B.Com)
Master of Computer Applications (MCA)
Advanced Diploma of IT Project Management
Certifications & Training
ITIL Foundation Certificate (score 37/40)
MS Excel Level 2 (University of Sydney, 2014)
MS Excel Level 3 (University of Sydney, 2014)
Machine Learning (Stanford University, Coursera online platform)
Regression Analysis (University of Washington, Coursera online platform)
Machine Learning: Clustering & Retrieval (University of Washington, Coursera online platform) [Python]
Using Python to Access Web Data (University of Michigan, Coursera online platform)