Michael Hahsler
Department of Engineering Management, Information and Systems
Department of Computer Science and Engineering
Lyle School of Engineering
Southern Methodist University
Dallas, TX 75205, USA
aborkf@r.postjobfree.com
http://lyle.smu.edu/~mhahsler/
Experience
Assistant Professor
in Engineering Management, Information and Systems, (by courtesy)
Computer Science and Engineering, and co-director of the Intelligent Data Analysis Lab
(IDA@SMU), Lyle School of Engineering, Southern Methodist University (SMU), Dallas,
TX, USA, 2012-present
.
Visiting Assistant Professor in Computer Science and Engineering, and co-director of
IDA@SMU, Lyle School of Engineering, SMU, Dallas, TX, USA, 2009-2012
.
E-Business and Marketing Research Adviser,
Hall Financial Group, Frisco, TX, USA, 2007-
2008
.
Associate Professor (Privatdozent), Department of Information Systems and Operations and
core researcher, Research Institute for Computational Methods, Vienna University of
Economics and Business Administration (WU), Austria, 2006-2007
.
Adjunct Professor, Department of Computer Science, Webster University, Vienna Campus,
Austria, 2002-2003
.
Assistant Professor (Universit tsassistent), Department of Information Systems and
Operations,
WU, Austria, 2001-2006
.
Research Assistant and Lecturer (Universit tsassistent), Department of Applied Computer
Science, WU, Austria, 1998-2001
.
Education
Habilitation (postdoctoral university degree with lecture qualification) in Business
Informatics,
Vienna University of Economics and Business Administration (WU), Austria, 2006.
PhD in Social and Economic Sciences (Business Informatics) with distinction, WU, Austria,
2001. WU is ranked 28 in the 2011 Financial Times European Business School Ranking.
MS in Business Administration (majors: Information Systems and Applied Computer Science),
WU, Austria, 1998.
Associate degree in Communication Engineering with distinction, College of Technology -
HTBLA Wien I, Vienna, Austria, 1992.
1
Research Interests
Data Mining/Machine Learning/Business Analytics: Data stream mining, recommender
systems, data visualization, association rule mining, market basket analysis.
Information Systems: Digital information management, pricing of information goods.
Software Engineering: Design patterns, open source software development processes.
Awards
Nomination for the H.O.P.E. (Honoring Our Professors Excellence) professor of the year
award, Residence Life and Student Housing, SMU 2012
Graduate Student Council Outstanding Faculty Award, Computer Science and Engineering,
Bobby B. Lyle School of Engineering, SMU, 2011.
Top publication 2007 award for Data Mining and Marketing: Exploratory Market Basket
Analysis (in German: Data Mining und Marketing am Beispiel der explorativen
Warenkorbanalyse ) in Marketing ZFP, WU, 2007.
Finalist of the Global Bangemann Award 1999 (Stockholm Challenge) with the Virtual
University Project, Stockholm, Sweden, 1999.
Winner of the 1997 WU Innovation Award, WU, 1997
.
Project Experience
Lead developer of the extension packages
arules - infrastructure for analyzing transaction data with association rules,
TSP - infrastructure for the traveling salesperson problem,
seriation - seriation/sequencing techniques and
rEMM - temporal modeling for massive data stream clustering
for R, a free software environment for statistical computing and graphics.
Head of engineering, ePub-WU project. Development of an open access digital library for
working papers and Ph.D. theses, WU, 2001-2003.
Designer, Assistant Project Manager and later Project Manager, Virtual University
Project,
WU, 1997-2004
.
Professional Memberships
ACM, ACM SIGKDD, GfKl (German Classification Society), IEEE Computer Society
Languages
English, German (first language)
Citizenship and Residency
Austria, United States permanent resident
2
Teaching Experience
IT Internship with Thesis (in German IT-Praktikum mit Bakkalaureatsarbeit, ), WU,
Spring
2005, Spring 2006, Spring 2007, Fall 2008, Spring 2009.
COAP 2120: Data Handling on the Web, Webster University (Vienna Campus), Spring II
2002.
COAP 3110: Interactive Web Site Development, Webster University (Vienna Campus), Fall
II
2002.
Introduction to Electronic Data Processing (in German Elektronische Datenverarbeitung:
Markup-Konzepte ), WU, Fall 1998.
Graduate courses
CSE 7337: Information Retrieval and Web Search, Lyle School of Engineering, SMU. Spring
2012.
CSE 8331: Advanced Topics in Data Mining, Lyle School of Engineering, SMU. Spring 2012.
CSE 8091: Advanced Scientific Computing with R, Lyle School of Engineering, SMU, Fall
2011.
CSE 8098: Computer Science Seminar, Lyle School of Engineering, SMU, Fall 2009, Spring
2010, Fall 2010, Spring 2011, Fall 2011, Spring 2012.
Process Oriented Information Management (in German Prozessorientierte
Informationswirtschaft ), WU, Fall 2006, Spring 2007.
Current Topics in Information Management (in German Seminar aus
Informationswirtschaft ), WU, Spring 2000, Fall 2000, Fall 2001, Spring 2002 Fall 2002,
Spring 2003, Spring 2004, Spring 2005, Spring 2006, Spring 2007.
3
Introduction to Object Oriented Programming (in German Einfuhrung in das
objektorientierte
Programmieren ), WU, Spring 1999, Fall 1999, Spring 2000, Fall 2000, Spring 2001.
Executive programs and professional training
CSE 7343: Operating Systems and System Software, Executive Master's Program in Security
Engineering, Lyle School of Engineering, SMU, Spring 2009.
UML Basics: Introduction to Object Oriented Modeling (in German UML-Basics: Einf
uhrung
in Objekt-Orientierte Modellierung mit der Unified Modeling Language ), ADV
(Arbeitsgemeinschaft fur Datenverarbeitung), Vienna, 2000 to 2001.
Introduction to Object Oriented Programming with C++ (in German Einfuhrung in den
Einsatz von Objekt-Orientierung mit C++ ), ADV (Arbeitsgemeinschaft fur
Datenverarbeitung), Vienna, 2000.
University and Department Service
Chair of the Department's Undergraduate Program Committee, CSE, SMU, 2010-2012
Department Colloquium Coordinator, CSE, SMU, 2009-2012
Member of the Department's Teaching Assistant Selection Committee and Teaching Assistant
Coordinator, CSE, SMU, 2009-2012
Member of the PhD Committees for Mallik Kotamarti (SMU, 2010), Charlie Isaksson (SMU,
2009-), Yu Su, (SMU 2011) and Maya El Dayeh (SMU 2011-)
Committee to implement a new Business Informatics Degree Program, WU, 2004-2006
Member of the Habilitation Committee for Christopher Casey, WU, 2004
Department Research Evaluation Coordinator, WU, 2002
Undergraduate EDP Exam Coordinator, WU, 1999-2002
4
Current Graduate Students
Xiaodian Xie: Data Stream Clustering for Financial Applications (working title, MS), SMU,
2013 (expected)
Andy Nagar: Rapid Classification and Differentiation of Short Genetic Sequences (working
title,
PhD), SMU, 2013 (expected)
Sudheer Chelluboina: Data Mining in Transportation Security (working title, PhD), SMU,
2013
(expected)
Hadil Shaiba: Data Mining Models for Hurricane Intensity Prediction (working title, PhD),
SMU, 2014 (expected)
Graduates
Maya El Dayeh: Biological Pathway Completion using Network Motifs (PhD Thesis), PhD in
CS,
SMU, 2012.
Akshaya Aradhya, MS in CS, SMU, 2012.
John Forrest: Stream:
DM 2012 - Data Mining, IADIS Multi Conference on Computer Science and Information
Systems (MCCIS 2012), Scientific Committee, July 2012.
PAKDD 2012 - The 16th Pacific-Asia Conference on Knowledge Discovery and Data Mining,
Program Committee, May 2012.
KDD 2011 - 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining,
Program Committee, August 2011.
QIMIE'11 - Quality Issues, Measures of Interestingness and Evaluation of Data Mining
Models,
workshop organized in association with the PAKDD'11 conference, Program Committee,
May 2011.
StreamKDD'10 - Novel Data Stream Pattern Mining Techniques, workshop held in conjunction
with the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining (KDD-2010), Organizer, July 2010.
QIMIE'09 - Quality Issues, Measures of Interestingness and Evaluation of Data Mining
Models,
workshop organized in association with the PAKDD'09 conference, Program Committee,
April 2009.
WebKDD 2008 - Knowledge Discovery on the Web, held in conjunction with the 14th ACM
SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-
2008), Program Committee, August 2008.
GfKl 2007 - 31th Annual Conference of the German Classification Society, Session
Organizer,
"Tools for Intelligent Data Analysis," March 2007.
GfKl 2006 - 30th Annual Conference of the German Classification Society, Session
Organizer,
"Tools for Intelligent Data Analysis," March 2006.
WebKDD 2006 - Workshop on Web Mining and Web Usage Analysis, held in conjunction with
the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining (KDD-2006), Program Committee, August 2006.
6
Reviewer for International Journals
Computational Statistics & Data Analysis
Data & Knowledge Engineering (DKE)
Electronic Commerce Research
IEEE Transactions on Knowledge and Data Engineering (TKDE)
7
Research Funding
Position Sensitive P-Mer Frequency Clustering with Applications to Classification, Co-PI
with Margaret Dunham (PI) and Monnie McGee, NIH R21HG005912, National Human
Genome Research Institute, National Institutes of Health. $385,000, 2011-2013.
Mobile Communication Innovation Lab at SMU, Co-PI with Mark Fontenot (PI), Samsung,
$25,000 (equipment), 2011-2012.
III/EAGER: Temporal Relationships Among Clusters in Data Streams (TRACDS), Co-PI
with Margaret Dunham (PI), NSF-IIS 0948893, National Science Foundation, Division of
Information & Intelligent Systems. $180,000 + 32,000 REU supplements, 2009-2013.
An Experimentation Environment for Generating Top-N Recommendations from Binary
Data, PI, NSF I/UCRC: Net-Centric Software & Systems Consortium, $60,000, 2009.
Infrastructure for interdisciplinary research focusing on machine learning and simulation,
Co-PI with Kurt Hornik (PI), Austrian Federal Ministry of Science and Education,
179,000 ($230,000), 2005-2008.
Digital Library WU online publications, PI, University Library of the Vienna University
of
Economics and Business. 31,000 ($40,000), 2001-2009.
Supplementary funds for the virtual university project, PI, Vienna Chamber of Commerce,
11,000 ($14,000), 2001.
8
Publications
Articles in journals
1. Michael Hahsler, Sudheer Chelluboina, Kurt Hornik, and Christian Buchta. The arules R-
package ecosystem: Analyzing interesting patterns from large transaction datasets.
Journal
of Machine Learning Research, 12:1977-1981, 2011.
2. Michael Hahsler and Kurt Hornik. Dissimilarity Plots: A Visual Exploration Tool for
Partitional Clustering. Journal of Computational and Graphical Statistics, 20(2):335-354,
2011.
3. Rao M. Kotamarti, Michael Hahsler, Douglas Raiford, Monnie McGee, and Margaret H.
Dunham. Analyzing Taxonomic Classification Using Extensible Markov Models.
Bioinformatics, 26(18):2235-2241, 2010.
4. Margaret H. Dunham, Michael Hahsler, and Myra Spiliopoulou. Novel data stream pattern
mining, Report on the StreamKDD 10 workshop. SIGKDD Explorations, 12(2):54-55, 2010.
5. Michael Hahsler and Margaret H. Dunham. rEMM: Extensible Markov Model for data
stream clustering in R. Journal of Statistical Software, 35(5):1-31, 2010.
6. Michael Hahsler, Christian Buchta, and Kurt Hornik. Selective association rule
generation.
Computational Statistics, 12(2):303-315, April 2008.
7. Michael Hahsler, Kurt Hornik, and Christian Buchta. Getting things in order: An
introduction to the R package seriation. Journal of Statistical Software, 25(3):1-34,
March
2008.
8. Michael Hahsler and Kurt Hornik. TSP - Infrastructure for the traveling salesperson
problem. Journal of Statistical Software, 23(2):1-21, December 2007.
9. Michael Hahsler and Kurt Hornik. New probabilistic interest measures for association
rules.
30.Andreas Geyer-Schulz, Michael Hahsler, and Georg Schneider. The virtual university as
a
network economy. In Heinrich C. Mayr, Claudia Steinberger, Hans-Jurgen Appelrath, and
Uwe Marquardt, editors, Informatik '99, Unternehmen Hochschule '99, Workshop-
Unterlagen, pages 75-86, Bielefeld, Germany, October 1999.
Presentations and Talks
1. Recommender systems: User-facing decision support systems, February 2012. Invited talk
for EMIS 7357-Decision Support Systems, Southern Methodist University, Dallas, Texas,
February 22, 2012.
2. Recommender systems: From content to latent factor analysis, CSE Colloquium,
Department of Computer Science and Engineering, Southern Methodist University, Dallas,
Texas, September 7, 2011.
3. Dissimilarity plots: A visual exploration tool for partitional clustering, June 2011.
Invited
talk, 42th Symposium on the Interface, Cary, NC, June 1-3, 2011.
4. Visualizing association rules in hierarchical groups, June 2011. 42th Symposium on the
Interface, Cary, NC, June 1-3, 2011.
5. Analyzing incomplete biological pathways using network motifs, May 2011. Division of
Biomedical Informatics Retreat, UT Southwestern Medical Center, Dallas, TX, May 6 and
12, 2011.
6. Temporal structure learning for clustering massive data streams in real-time, April
2011.
SIAM Conference on Data Mining (SDM11), Phoenix, AZ, April 28-30, 2011
.
7. Dissimilarity plots: A visual exploration tool for partitional clustering, CSE
Colloquium,
Department of Computer Science and Engineering, Southern Methodist University, Dallas,
TX, April 3, 2009.
8. A probabilistic approach to association rule mining. CSE Colloquium, Department of
Computer Science and Engineering, Southern Methodist University, Dallas, Texas, October
10, 2008.
9. Generating top-N recommendations from binary profile data. Berufungsvortrag
Wirtschaftsinformatik, WU Wien, July 16, 2008.
10.Two applications of the TSP for data analysis. 31th Annual Conference of the German
Classification Society (GfKl 2007), Freiburg, March 7-9, 2007.
11.Probabilistische Ans tze in der Assoziationsanalyse. Habilitationsvortrag,
Wirtschaftsuniversit t Wien, May 19, 2006.
12.An association rule mining infrastructure for the R data analysis toolbox, 30th Annual
Conference of the German Classification Society (GfKl 2006), Berlin, March 8-10, 2006.
13.Warenkorbanalyse mit Hilfe der Statistiksoftware R. WU Competence Day,
Wirtschaftsuniversit t Wien, 19. October, 2006.
14.Optimizing web sites for customer retention, 2005 International Workshop on Customer
Relationship Management: Data Mining Meets Marketing November 18th & 19th, 2005,
New York City, USA.
15.Implications of probabilistic data modeling for rule mining. 29th Annual Conference of
the
German Classification Society (GfKl 2005), March 9-11, 2005, Magdeburg, Germany.
14
16.Discussion of a large-scale open source data collection methodology. 38th Hawaii
International Conference on System Sciences (HICSS-38), January 3-6, 2005, Hilton
Waikoloa Village, Big Island, Hawaii.
17.ePubWU - Erfahrungen mit einer Volltextplattform an der Wirtschaftsuniversit t Wien,
28.
Osterreichischer Bibliothekartag 2004, Linz, Austria.
18.Gernerating synthetic transaction data for tuning usage mining algorithms, March 2003.
27th
Annual GfKl-Conference, Cottbus, Germany.
19.Software reuse with analysis patterns. AMCIS 2002, August 9-11, 2002, Dallas, Texas.
20.Evaluation of recommender algorithms for an internet information broker based on
simple
association rules and on the repeat-buying theory, July 2002. WEBKDD 2002, Edmonton,
Alberta, Canada.
21.Patterns im SoftwareentwicklungsprozeB, September 2001. ADV Arbeitsgemeinschaft fur
Datenverarbeitung, Wien.
22.A customer purchase incidence model applied to recommender services. WEBKDD 2001,
August 2001, San Francisco, CA.
23.User-centered navigation re-design for web-based information systems. AMCIS 2000,
August 2000, Long Beach, CA.
24.Living Lectures - WU Virtual Library: Ein Lernportal, March 2000. in Vortragsreihe
''Lernen per Internet'', Technische Universit t Wien.
25.Living Lectures - Virtual University Projekt: Informationstechnologie im universit ren
Bildungsbereich, June 1999. Global Village 99.
26.Automatic labelling of references for Internet information systems, March 1999. 23rd
Annual GfKl-Conference, Bielefeld, Germany.
Last update: 05/10/12
15