Josh Blumenstock
************@*****.*** *** South Hall, School of Information, University of California at Berkeley, Berkeley CA 94720
Education
University of California at Berkeley, Berkeley, CA Present
Pursuing PhD, School of Information
Wesleyan University, Middletown, CT May 2003
B.A., Computer Science: High Departmental Honors Major GPA: 4.0/4.0
B.A., Physics: High Departmental Honors Cumulative GPA: 3.9/4.0
Honors Thesis: Mining Clusters for Knowledge: Finding Algorithm-Independent Groups in Microarray Data
Honors and Awards
Berkeley Fellowship, University of California at Berkeley 2006-2010
Power Award, University of California at Berkeley 2006
Watson Fellowship, Thomas J. Watson Foundation 2003-2004
Phi Beta Kappa, Wesleyan University 2003
Senior Prize in Computer Science, Wesleyan University 2003
Johnston Prize in Physics, Wesleyan University 2000
Work/Research Experience
Founding Engineer, HOTorNOT.com, Berkeley, CA 2005-2006
Principal engineer and lead product manager for dating website with over 5M users.
Fellow, Thomas J. Watson Foundation, Providence, RI 2003-2004
Received $22k grant to research emerging technologies in China, South Africa, Cape Verde, Argentina, Brazil and Chile.
Intern, Microsoft Research, Bay Area Research Center, CA 2002
Worked in Gordon Bell and Jim Gray s lab on MyLifeBits, a database with annotations, pivoting, clustering, and search.
Research Associate, Harvard Institute of Medicine, Boston, MA 2001
Developed statistical techniques to diagnose and classify subtypes of lung cancer and other human disease.
Co-founder/Researcher, Uncanny Systems, Lake Pleasant, MA 2000
Developed algorithms and an application to analyze financial data using agent-based unsupervised learning.
Publications and Presentations
Blumenstock JE, Size Matters: Word Count as a Measure of Quality on Wikipedia . Proceedings of the 17th
International Conference on the World Wide Web. ACM Press, 2008 (forthcoming).
Blumenstock JE, Automatically Assessing the Quality of Wikipedia Articles. UCB iSchool Tech. Report. 2/2008.
Named Entity Recognition in Biographical Markup. ECAI/PNC Joint Meeting on Area Studies. October 2007.
Blumenstock, JE. Closing the Gap: Solutions to South Africa s Digital Divide The Big Issue. 81:8 (2004): 22-23
Gemmell J, Lueder R, Blumenstock JE, Solomon E, Bell G. Telephone, Television and Radio in the Home of the
Future Internet and Multimedia Systems and Applications. (2003): 263-268
Gordon GJ, Jensen RV, Hsiao LL, Gullans SR, Blumenstock JE,, Sugarbaker DJ, Bueno R. Using Gene Expression
Ratios to Predict Outcome Among Patients With Mesothelioma. Journal of the Natl Cancer Inst. 95:8 (2003): 598-605
Gordon GJ, Jensen RV, Hsiao LL, Gullans SR, Blumenstock JE, Ramaswamy S, Richards WG, Sugarbaker DJ, Bueno R.
Translation of Microarray Data Into Clinically Relevant Cancer Diagnostic Tests Using Gene Expression Ratios in Lung
Cancer and Mesothelioma Cancer Research. 62:17 (2002): 4963-4967
Fryer RM, Randall J, Yoshida T, Hsiao LL, Blumenstock JE, Jensen KE, Dimofte T, Jensen RV, Gullans SR. Global
Analysis of Gene Expression: Methods, Interpretation, and Pitfalls. Experimental Nephrology. 10:2 (2002): 64-74
Hsiao LL, Jensen RV, Yoshida T, Clark KE, Blumenstock JE, Gullans SR. Correcting for Signal Errors in the Analysis
of Microarray Data. Biotechniques. 32:2 (Feb. 2002): 330-337
Skills
Conversationally fluent in Spanish and Portuguese; conversational in Chinese (Mandarin).
Research experience in machine learning, data mining, computational statistics.
Fluent in C#/C++, PHP, VBA, MatLab, SQL. Admin experience on multi-tiered web architecture w/200+ servers.