Gengbin Zheng
****E NCSA
Urbana, IL 61801
1]0.8
217-***-****(o)
******@********.***
http://charm.cs.uiuc.edu/people/gzheng
Research Interests
Parallel programming languages and paradigms
Adaptive parallel runtime systems
Performance prediction of parallel applications
Education
Ph.D., Computer Science, December 2005, University of Illinois at
Urbana-Champaign, Urbana, IL ; GPA 4.0
M.S., Computer Science, September 1998, Beijing University, China
(with honors) ; GPA 3.8
B.S., Computer Science, September 1995, Beijing University, China;
GPA 3.6
Awards
HPC Challenge class 2 Award, SC 2011, Seattle, WA
Gordon Bell Award for special accomplishment in NAMD paper,
SC 2002, Baltimore, MD.
GuangHua scholarship, Study scholarship, Beijing University, 1997
LianXiang scholarship, Beijing University, 1996
LianXiang scholarship, Beijing University, 1995
Study scholarship, Beijing University, 1993, 1994
Professional Experience
Senior Research Programmer, NCSA, University of Illinois at Urbana-Champaign,
7/2012-now
Research Scientist, Computer Science Department, University of Illinois
at Urbana-Champaign, 2010-7/2012
Research Scientist, Center for Simulation of Advanced Rockets, University
of Illinois at Urbana-Champaign, 2008-2010
Postdoctoral Research Associate, Center for Simulation of Advanced
Rockets, University of Illinois at Urbana-Champaign, 2005-2008
Graduate Research Assistant, Computer Science Department, University
of Illinois at Urbana-Champaign, 1999-2005
Intern, IBM T.J. Watson Research Center, Yorktown Heights, NY, summer
2001 - involved in BlueGene/L (World's fastest machine) project
Intern, Silicon Graphics, Inc (SGI), in the compiler group, Mountain
View, CA, summer 2000
Teaching Assistant, Computer Science Department, University of Arizona,
1998
Research assistant, Computer Science Department, Beijing University,
1995-1998
Publications
Papers in Journals and Book Chapter
Yanhua Sun, Gengbin Zheng, Pritish Jetley and Laxmikant V.
Kal\'e,``ParSSSE: An Adaptive Parallel State Space
Search Engine'', Parallel Processing Letters, 2011
Aaron Becker, Gengbin Zheng, and Laxmikant Kale, ``Distributed
Memory Load Balancing'', book chapter in Encyclopedia of Parallel
Computing, David Padua, Ed., 2011
Gengbin Zheng, Abhinav Bhatele, Esteban Meneses and Laxmikant
V. Kale; ``Periodic Hierarchical Load Balancing for Large Supercomputers'';in
International Journal of High Performance Computing, 2010
Laxmikant V. Kale and Gengbin Zheng, ``Charm++ and AMPI:
Adaptive Runtime Strategies via Migratable Objects''; book
chapter, in Advanced Computational Infrastructures for Parallel
and Distributed Applications (Wiley-Interscience), 2009
Gengbin Zheng, Hari Govind, Michael S. Breitenfeld, Hari
Govind, Philippe Geubelle, Laxmikant V. Kale, ``Automatic Dynamic
Load Balancing for a Crack Propagation Application'', submitted to
the International Journal of High Performance Computing Applications
Gengbin Zheng, Chao Huang, Laxmikant V. Kale, ``Performance
Evaluation of Automatic Checkpoint-based Fault Tolerance for AMPI
and Charm SIGOPS Operating System Review Special Issue
on HEC OS/Runtimes, 2006
Xiangmin Jiao, Gengbin Zheng, Phillip A. Alexander, John
Norris, Michael T. Campbell, Andreas Haselbacher, Michael T. Heath,
``A system integration framework for coupled multiphysics simulations'',
special issue of Engineering with Computers
on frameworks/integrated software infrastructures for scalable scientific
and engineering applications, 2006
Orion S. Lawlor, Sayantan Chakravorty, Terry L. Wilmarth, Nilesh Choudhury,
Isaac Dooley, Gengbin Zheng and Laxmikant V. Kalé, ``ParFUM:
A Parallel Framework for Unstructured Meshes for Scalable Dynamic
Physics Applications'', special issue of Engineering
with Computers on frameworks/integrated software infrastructures
for scalable scientific and engineering applications, 2006
Laxmikant V. Kale, Klaus Schulten, Robert D. Skeel, Glenn Martyna,
Mark Tuckerman, James C. Phillips, Sameer Kumar, and Gengbin
Zheng, ``Biomolecular modeling using parallel supercomputers'', book
chapter, In S. Aluru, editor, Handbook of computational molecular
biology, pp. 34.1-34.43. Taylor and Francis, 2005
Gengbin Zheng, Terry Wilmarth, Praveen Jagadishprasad, Laxmikant
V. Kalé, ``Simulation-Based Performance Prediction for Large Parallel
Machines'', International Journal of Parallel Processing, 2005
Laxmikant V. Kale, Eric Bohm, Celso L. Mendes, Terry Wilmarth, Gengbin
Zheng; ``Programming Petascale Applications with Charm++ and AMPI'';
in Petascale Computing: Algorithms and Applications, CRC Press,
2004
Laxmikant V. Kalé, Gengbin Zheng, Chee Wai Lee, Sameer
Kumar, ``Scaling Applications to Massively Parallel Machines Using
Projections Performance Analysis Tool'', Future Generation
Computer Systems, Journal, 2004
Papers in Conferences and Workshops
Yanhua Sun, Gengbin Zheng, Chao Mei, Eric Bohm, James Phillips,
Terry Jones and Laxmikant Kale, ``Optimizing Fine-grained Communication
in a Biomolecular Simulation Application on Cray XK6'', in Proceedings
of the 2012 ACM/IEEE conference on Supercomputing, 2012
Harshitha Menon, Nikhil Jain, Gengbin Zheng and Laxmikant
Kale, ``Automated Load Balancing Invocation based on Application Characteristics'',
in Proceedings IEEE International Conference on Cluster Computing
2012, Beijing China
Gengbin Zheng, Xiang Ni and L. V. Kale,``A Scalable Double
In-memory Checkpoint and Restart Scheme towards Exascale'',in Proceedings
of the 2nd Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS),
Boston, 2012
Yanhua Sun, Gengbin Zheng, Ryan Olson, Terry Jones, Laxmikant
Kale, ``A uGNI-Based Asynchronous Message-driven Runtime System for
Cray Supercomputers with Gemini Interconnect'', in IEEE International
Parallel and Distributed Processing Symposium (IPDPS), Shanghai,
China, 2012
Ehsan Totoni, Abhinav Bhatele, Eric Bohm, Nikhil Jain, Celso Mendes,
Ryan Mokos, Gengbin Zheng and Laxmikant Kale, ``Simulation-based
Performance Analysis and Tuning for the Planned Blue Waters System'',
Proceedings of the 16th International Conference on Parallel
and Distributed Systems (ICPADS), 2011
Gengbin Zheng, Stas Negara, Celso L. Mendes, Eduardo R. Rodrigues
and Laxmikant V. Kale, ``Automatic Handling of Global Variables for
Multi-threaded MPI Programs'', Proceedings of the 16th International
Conference on Parallel and Distributed Systems (ICPADS), 2011
Chao Mei and Yanhua Sun and Gengbin Zheng and Eric J. Bohm
and Laxmikant V.~Kal{\'e} and James
C.Phillips and Chris Harrison,``Enabling and Scaling Biomolecular
Simulations of 100~Million Atoms on Petascale Machines
with a Multicore-optimized Message-driven Runtime'', Proceedings
of the 2011 ACM/IEEE conference on Supercomputing, November, 2011
Yanhua Sun, Gengbin Zheng, Pritish Jetley and Laxmikant V.
Kale, An Adaptive Framework for Large-scale State Space
Search, Proceedings of Workshop on Large-Scale
Parallel Processing (LSPP) in IEEE International Parallel and Distributed
Processing Symposium (IPDPS), Anchorage (Alaska), May, 2011
Abhishek Gupta, Gengbin Zheng and Laxmikant V. Kale, ``A
Multi-level Scalable Startup for Parallel Applications'', Proceedings
of International Workshop on Runtime and Operating Systems for Supercomputers,
May, 2011
Gengbin Zheng, Gagan Gupta, Eric Bohm, Isaac Dooley, and
Laxmikant V. Kale, Simulating Large Scale Parallel
Applications using Statistical Models for Sequential Execution Blocks,
in the Proceedings of the 16th International Conference on Parallel
and Distributed Systems (ICPADS 2010), Shanghai, China, 2010
Filippo Gioachin, Gengbin Zheng, and Laxmikant V. Kale; ``Debugging
Large Scale Applications in a Virtualized Environment''; in the
Proceedings of the 23rd International Workshop on Languages and Compilers
for Parallel Computing (LCPC2010), Houston, TX, USA, October, 2010
Chao Mei, Gengbin Zheng, Filippo Gioachin and Laxmikant V.
Kale; ``Optimizing a Parallel Runtime System for Multicore Clusters:
A Case Study''; in Proceedings of TeraGrid'10, Pittsburgh,
PA, USA, August, 2010
Filippo Gioachin, Gengbin Zheng and Laxmikant V. Kale; ``Robust
Record-Replay with Processor Extraction''; in Proceedings of
the Workshop on Parallel and Distributed Systems: Testing, Analysis,
and Debugging (PADTAD - VIII), 2010
Gengbin Zheng, Esteban Meneses, Abhinav Bhatele and Laxmikant
V. Kale, ``Hierarchical Load Balancing for Large Scale Supercomputers'';
in Proceedings of the Third International Workshop on Parallel
Programming Models and Systems Software for High-End Computing (P2S2),
2010
Abhinav Bhatele, Sameer Kumar, Chao Mei, James C. Phillips, Gengbin
Zheng, Laxmikant V. Kale; ``Overcoming Scaling Challenges in Biomolecular
Simulations across Multiple Platforms''; to appear in Proceedings
of IEEE International Parallel and Distributed Processing Symposium
2008
Sameer Kumar, Chao Huang, Gengbin Zheng, Eric Bohm, Abhinav
Bhatele, James C. Phillips, Hao Yu, Laxmikant V. Kale; ``Scalable
Molecular Dynamics with NAMD on Blue Gene/L''; to appear in IBM
Journal of Research and Development 2007
David Kunzman, Gengbin Zheng, Eric Bohm, Laxmikant V. Kale;
``Charm++, Offload API, and the Cell Processor''; in PMUP Workshop
at PACT'06, September 2006
Gengbin Zheng, Orion Sky Lawlor, Laxmikant V. Kale, ``Multiple
Flows of Control in Migratable Parallel Programs'', to appear in The
8th Workshop on High Performance Scientific and Engineering Computing
(HPSEC), 2006
Laxmikant Kale, Issaac Dooley, and Gengbin Zheng, ``Handling
OS Interference Via Migratable Message-Driven Objects'', minisymposium,
SIAM Conference on Parallel Processing for Scientific Computing,
San Francisco, CA, 2006
Chao Huang, Gengbin Zheng, Sameer Kumar, Laxmikant V. Kale,
``Performance Evaluation of Adaptive MPI'', ACM SIGPLAN Symposium
on Principles and Practice of Parallel Programming, 2006
Xiangmin Jiao, Gengbin Zheng, Orion Lawlor, Phil Alexander, Mike Campbell, Michael Heath, Robert Fiedler, ``An Integration
Framework for Simulations of Solid Rocket Motors'', 41st AIAA/ASME/SAE/ASEE
Joint Propulsion Conference, July 10-13, 2005, Tucson, Arizona
Terry L. Wilmarth, Gengbin Zheng, Eric J. Bohm, Yogesh Mehta,
Praveen Jagadishprasad, Laxmikant V. Kalé, ``Performance Prediction
using Simulation of Large-scale Interconnection Networks in POSE'',
19th ACM/IEEE/SCS Workshop on Principles of Advanced and Distributed
Simulation (PADS 2005)
Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé, ``FTC-Charm++:
A Parallel In-Memory Checkpoint-Based Fault Tolerant Runtime for Parallel
Systems'', Cluster Computing 2004, San Diego, California.
Gengbin Zheng, Gunavardhan Kakulapati, Laxmikant V. Kalé,
``BigSim: A Parallel Simulator for Performance Prediction of Extremely
Large Parallel Machines'', in 18th International Parallel and
Distributed Processing Symposium(IPDPS), 2004
Gengbin Zheng, Terry Wilmarth, Orion Sky Lawlor, Laxmikant
V. Kalé, Sarita Adve, David Padua, Philippe Guebelle, ``Performance
Modeling and Programming Environments for Petaflops Computers and
the Blue Gene Machine'', Next Generation Systems Program Workshop,
18th International Parallel and Distributed Processing Symposium(IPDPS),
2004
Laxmikant V. Kalé, Sameer Kumar, Gengbin Zheng, Chee Wai
Lee, ``Scaling Molecular Dynamics to 3000 Processors with Projections:
A Performance Analysis Case Study'', Terascale Performance Analysis
Workshop, International Conference on Computational Science(ICCS),
2003
James Phillips, Gengbin Zheng, Sameer Kumar, Laxmikant V.
Kalé, ``NAMD: Biomolecular Simulation on Thousands of Processors''.
SC2002, Baltimore, MD, Gordon
Bell Award winner paper.
James Phillips, Gengbin Zheng, Laxmikant V. Kalé, ``NAMD:
Biomolecular Simulation on Thousands of Processors'' in Workshop:
Scaling to New Heights, 2002, Pittsburgh Supercomputing Center.
Gengbin Zheng, Arun Singla, Joshua Unger, Laxmikant V. Kalé,
``A Parallel-Object Programming Model for PetaFLOPS Machines and
Blue Gene/Cyclops'' in Next Generation Systems Program Workshop,
16th International Parallel and Distributed Processing Symposium(IPDPS),
2002.
Zhihui Du, Wenkui Ding, Gengbin Zheng, Xiaoming Li, Zhuoqun
Xu, ``Research and Implementation of an HPF Compilation System'',
Ruan Jian Xue Bao/Journal of Software, 10(1), pp. 60-67, 1999.
Hua Xiang, Gengbin Zheng, Lixia Shi, Jianping Wang, Zhuoqun
Xu, ``Performance Analysis on DAWN with p_HPF Compiler System'',
Proceedings of DAWN User's Conference, October, 1998.
Thesis
Gengbin Zheng, ``Achieving High Performance on Extremely
Large Parallel Machines: Performance Prediction and Load Balancing'',
Ph.D. Thesis, Department of Computer Science, University of
Illinois at Urbana-Champaign, 2005
Gengbin Zheng, ``The Key Technologies and Optimizations
in an Implementation of Data Parallel Language: HPF''. M.S.
Thesis, Dept. of Computer Science, Beijing University, 1998.
Talks and Posters
Gengbin Zheng,``Parallel Runtimes for Achieving High Performance
on Large Parallel Machines'', seminar,Brookhaven National Laboratory,Feb
2010
David M. Kunzman, Gengbin Zheng, Eric Bohm, James C. Phillips,
Laxmikant V. Kale,``Charm++ Simplifies Coding for the Cell Processor'',
poster,SC 2006,Tampa,FL
Hari Govind, Gengbin Zheng, Laxmikant Kale, Michael Breitenfeld,
Philippe Geubelle, ``Speeding up Parallel Simulation with Automatic
Load Balancing poster, SC 2005, Seattle, WA
Gengbin Zheng, ``Basic Charm++ and Load Balancing'', LACSI
Symposium 2005, Santa Fe, NM
Gengbin Zheng, ``Fault Tolerance in Charm LACSI
Symposium 2005, Santa Fe, NM
Gengbin Zheng, ``Advanced Charm++ Tutorial'', Charm++
workshop, University of Illinois, 2005, Urbana, IL
Gengbin Zheng, ``An Integration Framework for Simulations
of Solid Rocket'', AIAA, 2005 Tucson, AZ
Gengbin Zheng, ``FTC-Charm++: An In-Memory Checkpoint-Based
Fault Tolerant Runtime for Charm++ and MPI'', Cluster, 2004,
San Diego, CA
Gengbin Zheng, ``BigSim Tutorial'', Charm++ workshop,
University of Illinois, 2004, Urbana, IL
Gengbin Zheng, BigSim: A Parallel Simulator for Performance
Prediction of Extremely Large Parallel Machines, IPDPS 2004,
Santa Fe, NM
Gengbin Zheng, ``Charm++ Load Balancing Framework'', Charm++
workshop, University of Illinois, 2003, Urbana, IL
L. V. Kale, Gengbin Zheng, Terry Wilmarth, ``BigSim Simulator'',
Poster, BlueGene Workshop, 2003, Reno, NV
L. V. Kale, Sayantan Chakravorty, Gengbin Zheng, ``Runtime
Support for BlueGene'', Poster, BlueGene Workshop, 2003, Reno,
NV
Gengbin Zheng, ``Parallelizing FP-growth Frequent Patterns
Mining Algorithm Using OpenMP'', Intel Corporation, 2002, Urbana,
IL
Gengbin Zheng, ``A Parallel-Object Programming Model for
PetaFLOPS Machines and BlueGene/Cyclops'', IPDPS, 2002 Fort
Lauderdale, FL
Gengbin Zheng, Arun Singla, Joshua Unger, Laxmikant Kal,
``Blue Gene Simulator and Charm poster, SC 2002, Baltimore,
MD
Gengbin Zheng, ``Charm++ on Blue Gene/C'', Charm++
workshop, University of Illinois, 2002, Urbana, IL
Gengbin Zheng, ``Exploiting the I/O processors in Bluelight'',
Poster, IBM T.J.Watson Research Center, 2001, NY
Participated Projects
Blue Waters Project -- Many Charm++-related projects, including porting
and scaling Charm++ and NAMD application on Cray supercomputer with
Gemini interconnect
NIH -- NAMD parallel molecular dynamics simulation code
NSF Next Generation Software (NGS) -- BigSim performance prediction
for petaflops scale parallel machines
DOE -- parallel rocket simulation code developed at the Center for
Simulation of Advanced Rockets (CSAR) funded by the DOE as part of
its Advanced Simulation and Computing (ASCI) program
Research and Industrial Projects
Blue Waters Project, 07 - present
The Blue Waters project aims at delivering a Cray supercomputer
capable of sustained performance of 1 petaflop on a range of real-world
science and engineering applications. It is expected to be one of
the most powerful supercomputers in the world. I have been working
on this project since the project started. My work includes using
BigSim performance simulator to predict the performance of parallel
applications (including NAMD) on future Blue Waters machine, porting
and optimizing Charm++/AMPI runtime system on Cray Gemini interconnect
using low level Cray uGNI communication library, and scaling NAMD
molecular dynamics simulation program on this machine. I have been
using supercomputers including Hopper (NERCS), Titan (ORNL), JYC/ESS
(NCSA), etc
Parallel Programming Laboratory, with Laxmikant Kalé,
UIUC, 1/99 - present
I am the lead developer of Charm++ - a parallel object-oriented
language and run-time system. My work in general involves improving
the performance and productivity in high performance computing on
supercomputers and workstation clusters with Charm++ run-time system.
My projects involves most aspects in Charm++ system and its applications,
including Adaptive MPI (AMPI), automatic dynamic load balancing techniques
to improve the scalability of parallel applications especially those
challenging applications to scale on very large parallel machines.
I also work on performance tracing and analysis tool associated with
Charm++.
My Ph.D. thesis focuses on large scale parallel simulation for predicting
performance of parallel applications on extremely large parallel machines.
With the simulation infrastructure, I explore the optimization techniques
needed in automatic load balancing to improve the parallel performance
on these machines.
For many years, I am in charge of the Charm++/AMPI software development
and maintanance work, including Charm++ regression nightly tests,
and email support for external users, etc.
I have been collaborating with several external groups on applications
including molecular dynamics simulation such as NAMD and LeanMD (with
IBM), climate simulation application (ISAM), and FEM applications
such as Fractography3D (crack propagation simulation) and Rocstar
(Rocket Simulation).
Center for Simulation of Advanced Rockets, UIUC, 5/05
- 2010
I joined the center as a postdoc research associate, my
current research topics in the center is to exploit Charm++ in the
advanced rocket simulation application to improve its parallel performance
as well as the portability. I am one of the main developers on designing
a software integration framework for multi-physics interaction and
flexible high-level orchestration modules to ease quick prototyping
of coupling schemes in the rocket simulation.
Theoretical Biophysics Group, Beckman Institute for
Advanced Science and Technology, UIUC, 1/99 - 8/04
I was one of the main developers in parallelizing NAMD
application developed in the group. NAMD is a parallel, object-oriented
molecular dynamics code designed for high-performance simulation of
large biomolecular systems. Based on Charm++ parallel objects and
the load balancing framework (which I was working on), NAMD scales
to hundreds of processors on high-end parallel platforms and tens
of processors on commodity clusters using gigabit Ethernet at that
time. Our work in NAMD won the prestigious Gordon Bell Award
in SC2002 for unprecedented speedup on a 3000 processor machine with
peak performance of a Teraflop. I was also actively involved in supporting
NAMD user community via NAMD mailing list. I interacted with a great
number of external NAMD users to help them with various portability
and performance issues with NAMD and Charm++. After I left the group,
I have been involved in NAMD project in various aspects as I continue
to work on Charm++.
High Performance Computing Technology, development of
High Performance Fortran Compiler, with Zhuoqun Xu, Beijing University,
China, 9/95 - 7/98
HPF is a data parallel programming language. The aim of
this project was to design and implement a practical HPF compiler
and runtime system. My work included compiler front-end design and
implementation, SPMD source to source code translation and implementation
of the communication runtime system. This compiler was installed on
DAWN1000, one of the fastest supercomputers built in China at that
time.
Parallel Large Scale Image Processing, collaborated
with Chinese Academy of Science, China, 9/97 - 7/98
In this project, we explored techniques to solve large scale
image processing problems using HPF and the compiler we developed.
We developed applications in HPF for image processing and improved
the HPF compiler to achieve high performance. In order to handle the
super large scale data, which can not be completely held in main memory,
we designed and implemented parallel I/O in the HPF compiler runtime
system to perform out-of-core execution for extremely large datasets.
Synergistic Activities
Member, Association for Computing Machinery (ACM)
Program Committee: IEEE International Parallel and
Distributed Processing Symposium (IPDPS), Shanghai, China, 2012
Reviewer, Journal of System and Software, Journal of Parallel
and Distributed Computing, International Journal of High Performance
Computing, Euro-Par
Technical Skills
Programming: C/C++, Fortran77, Pascal, SmallTalk, Prolog, COBOL, SML,
Perl, Unix Shell, Assembly;
Parallel Programming: Fortran90, HPF, MPI, PVM, Charm++, Pthread,
openMP, CORBA;
Web Programming: Java, JavaScript, HTML, CGI, Perl, Ruby, Rails, PHP;
Protocol: TCP/IP, HTTP;
OS: UNIX(many flavors), DOS/WindowsNT;
Database: Informix(4GL, ESQL), mysql;
GUI: X Window System(Xlib/Xt/Motif);
Software Tools: Lex, Yacc, CVS, Homepage design, Visual C++ ...
About this document ...
Gengbin Zheng
This document was generated using the
LaTeX2HTML translator Version 2008 (1.71)
Copyright © 1993, 1994, 1995, 1996,
Nikos Drakos,
Computer Based Learning Unit, University of Leeds.
Copyright © 1997, 1998, 1999,,
Mathematics Department, Macquarie University, Sydney.
The command line arguments were:
latex2html-no_subdir -ascii_mode -split 0 -no_navigation resume.tex
The translation was initiated by Gengbin Zheng on 2012-07-17
Gengbin Zheng
2012-07-17
Copyright © 1997, 1998, 1999,
Ross Moore