MUWANGUZI ESTHER JORDANA
*** **** ****** ***** ******, Birmingham, Al **********@*****.*** https://www.linkedin.com/in/muwajorda/
I am a data Scientist with a diverse background in computer science, information technology and bioinformatics. I have experience in handling big data, application of machine learning, and statistics with recent solicitation in the bioinformatics field.
RECENT WORK EXPERIENCE
Bioinformatics Analyst, Birmingham AL Oct 2017-to date.
University of Alabama at Birmingham: Center for Clinical and translational Science –Informatics (CCTS).
Presently, using python’s snakemake to create bioinformatics pipelines to collect, process, analyze and maintain big omics datasets.
Additionally, assist investigators in understand analytical results through utilizing interactive visualizations.
Currently, utilizing GitHub to maintain analysis projects with the end goal of achieving reproducible research.
Intern as Bioinformatics Professional.
Momenta Pharmaceuticals, Cambridge, MA July-Dec 2016.
Developed a translational Biology data analysis pipeline in the Linux cloud environment hosted by Amazon Web Services.
Used RNA Seq NGS tools and applications to perform data processing and analysis to identify variants in the cohort compared to the known populations.
Analyzed results from SNP Biomarker discovery by applying various statistical tests in R to stratify patients based on treatment response compared to genotype.
Intern as a Data Analyst
Karen soft Consulting Group Inc. Duluth, GA Jun – Dec 2014.
Performed data mining and data management on clinical and public health data using in house data analysis programs.
Completed reporting and packaging of data with EXCEL spreadsheets.
Created and mapped logical to physical design models to aid in the development of an in house data warehouse.
PROJECTS
Biologics and Much More
Northeastern University, April 2016
The project involved gathering data from biological databases, journals and websites to create data repository. R was the basic language used for data extraction, management and cleaning. Mongo dB package in R was used to create the repository for storing meaningful metadata and information for common biologics.
Sentimental Analysis on Trending Hashtags (Northeastern University, May, 2016)
This project utilized twitter hashtags to analyze sentiments of the sample population. R was used to gather data using twitter api.Data was cleaned and explored. R Sentimental packages captured and classified words as positive, negative and neutral sentiments as well human emotions.
TECHNICAL SKILLS
Programs and Languages: R Programming, Shell, Python, SQL, Markdown and GitHub,Jupyter Notebooks
Exploratory data analysis: R and python(Numpy, Panda,tidyverse)
Machine Learning and Deep Learning: classification, regression, clustering.( R:Rpart,Random Forest,,Python: scikit-learn,Keras and Tensor-flow)
Statistical Methods: regression models and hypothesis testing, principal component analysis and dimensionality reduction.
Data Visualization: ggplot, Plotly and ggmap, Matplotlib
EDUCATION
Northeastern University, Boston MA : Master of Science in Bioinformatics, May 2017
Makerere University, Uganda : Bachelor of Science in Computer Science &Information Technology, Jan 2014