Dunia Giniebra Camejo
Office: 713-***-**** • Cell: 832-***-**** • ********@*****.***
Work Experience
MD Anderson Cancer Center
Biostatistician, Department of Cancer Systems Imaging
Houston, TX
Mar 2016-Present
•Used supervised and unsupervised machine learning techniques such as support vector machines, tree-based methods, neural networks, and Bayesian ensembles to predict gene expression, disease progression, and patient prognosis
•Designed anomaly detection method to detect at-risk cases of early onset pneumonitis
•Created a penalized lasso classification system to identify sarcomatoid cells in lung tumors
•Identified autistic brains from normal brains using MRI scans and a custom-built boosted SVM model
•Developed graphical models to study the network structure of cancer-linked genes
Lone Star College
Adjunct Faculty, Department of Mathematics
Cypress, TX
May 2015-Mar 2016
•Taught college algebra and statistics classes to undergraduates
Institute of Cybernetics, Mathematics, and Physics
Statistics Specialist
Havana, Cuba
Sept 2010-June 2014
•Developed simulation algorithms based on Markov Chain Monte Carlo, Data Augmentation, and Expectation Maximization methods to address missing data problems and build a parametric model to estimate risk-factor changes
•Used a fully data-driven model selection procedure to estimate covariance functions, with applications in measurements of risk based on financial time series (VaR, Expected Shortfall)
•Applied techniques from Kriging Spatial Interpolation to map and predict rainfall and soil properties
•Developed variable selection method to extract variables of importance in Big Data datasets
•Built clustering methods, with applications in portfolio diversification and bias avoidance
Education
University of Havana
Master of Science in Probability and Statistics
Havana, Cuba
Sept 2010-July 2012
•Thesis: “Nonparametric Estimation of Covariance Functions by Means of Fully Data-Driven Model Selection Technique”
Bachelors of Science in Mathematics
Sept 2006-July 2010
•Thesis: “Methods of Solving Problems of Missing Data in Categorical Variables”
Publications
•“Radiome Sequencing Reveals Genomic Landscape of Glioblastoma and Predicts Patient Survival” (recently sent).
•“Estimation of Covariance Functions by a Fully Data-Driven Model Selection Procedure and its Application to Kriging Spatial Interpolation of Real Rainfall Data” Statistical Methods and Applications, Vol. 23, p. 149-174 (2014)
•“Applications of New Techniques to Kriging Spatial Interpolation for Predictions of Rainfall and Soil Properties in Pinar del Rio State” Reporte de investigación, ICIMAF 2012-670
•“Non Parametric Estimation of a Covariance Component in an Additive Model” Reporte de investigación, ICIMAF 2013
Skills and Interests
Technical Skills
Interests
R, MATLAB, Python, Octave, SPSS, Excel, SAS, STATISTICA, LaTeX
Stochastic Processes, Machine Learning, Time Series Analysis, Nonparametric Estimation, Predictive Modelling, Model Selection, Regularization, Dimension Reduction
Work Eligibility
Languages
US Permanent Resident
Spanish (native), English (very good), French (very good)