Jin Wang
* ****** ***** ***: 732-***-****
Piscataway, NJ 08854 E-mail: *****@****.*******.***
SUMMARY: Experienced software and data analyst in biomedical area
EDUCATION: 2006 present Ph.D program in Computational Biology, Rutgers University, New Brunswick NJ
2005 – 2006 Ph.D program in Bioinformatics, Boston University, Boston, MA
1997 – 1999 MS in Computer Science, Rutgers University, New Brunswick NJ
1991 – 1995 BS in Computer Science, Southeast University, Nanjing, China
SKILLS
• Operating Systems: Unix(SunOS/Solaris), Linux, Windows 2000/XP
• Programming Languages: JAVA (Swing, JNDI, JDBC, RMI, Threading), Java Beans, HTML, JavaScript,
CSS, JSP, StyleReport4.0, XML, SOAP, WSDL, CORBA, Shell Script, C/C++, Basic, Perl, Codefusion
• J2EE/Web/Internet: Servlet, JSP/TagLib, JDBC, EJB, JDNI, JAXP, JMS, RMI, JTA, XLST, Swing/AWT,
Struts, SOAP, HTTP, Apache/Tomcat
• Database Systems: Oracle, Sybase, PL/SQL
EXPERIENCE:
Biology Courses Molecular Biology I, Molecular Biology II, Protein and DNA Sequence analysis,
•
Biochemistry(audit), Topics in System Biology, Seminar in Biophysics
Computer Science courses: Pattern Recognition, Artificial Intelligence, Database, Algorithm, Computer
•
graphics
Statistical Courses Linear Regression, Multivariate Analysis, Survival Analysis, Clinical Trial,
•
Interpretation of Data I, II, Probability Theory, Methods of Statistical Inference.
Sept,2006 – present Computational Biology Program, Rutgers University, New Brunswick, NJ
Ph.D Student
• To understand mechanism of different breast cancer subtypes, I work on machine learning and data mining
algorithm on microarray data analysis using Perl and R
• Working on hazard ratio and power estimation of case cohort design with continuous predictor using R in an
epidemiological problem
• Protein NMR data analysis
Sept, 2005 – Aug, 2006 Bioinformatics Program, Boston University. Boston, MA
Ph.D Student
• Working on cis-regulatory motif detection using Gibbs Sampling based multiple local alignment.
• Flux balance analysis in metabolism of Liver cell model based on experimental data. Elucidate metabolism
pathway and mass production in different nutrition intake.
• Design algorithm for transcription factor (Estrogen Receptor) binding site identification based on CpG island
chip-chip data in whole genome. Given a set of DNA sequences that share a common function, the program
will compare them to a library of sequence motifs (e.g. transcription factor binding patterns), and identify
which motifs are statistically overrepresented in the sequence set.
• Undertake a comparative genomics approach that subtracted the nonflagellated proteome of Arabidopsis
from the shared proteome of the ciliated/flagellated organisms Chlamydomonas and human to identify highly
conserved 683 protein sequences that are exclusively in organisms with flagella and basal bodies and also
includes the BBS5 Human Disease Gene. The study of model organism elucidated many critically important
conserved processes and the importance of cilia in diverse human disease. I find there is abundant
evolutionary info and dense relationships in this protein data set. I cluster and predict function of these 683
sequences to further understand the relationship between cilia disease and the function of cilia and basal
body,.
July, 2003 – Aug, 2005 AT&T Corp., Middletown, NJ
Sr. Software Engineer
An enterprise web-based J2EE Application for AT&T Business service-specific order infrastructures using Struts
hosted in clustered iPlanet application servers provides end users with account search, account management;
order composing, shopping cart, order distribution, order automation, order checkout and API to legacy
inventory in the enterprise service delivery process. .
July, 2002 - June. 2003 Johnson & Johnson, NJ
Sr. Software Engineer
Design and implement a data mining software system applying pattern recognition, machine learning and
statistics to assist drug screening automation in large gene profile data
• Working on gene data model design and schema implementation in Oracle using web services in Java and
developing Oracle applications
• Working on architecture of J&J information infrastructure, enterprise wide data warehousing, system test and
integration with different application software
• A web application with 3 tiers, where user can input search criteria, data mining is done in backend and well
organized knowledge is retrieved
• Build initially simplified models of biological systems that allow researchers to extract meaningful
information from otherwise hopelessly noisy data. I propose a systematic computational framework for
predictive modeling on biological data that relies on pattern classification
• Working on model selection problems for analysis of toxic genomics gene expression data in microarray
which held very limited data sample size but high dimensional feature space defined by size of genome
Oct. 1999 – June, 2002 Panasonic Multimedia & Networking Research Lab, Princeton, NJ
Sr. Software Engineer
Worked on integrated web telephony product which combined the voice telephone capabilities of a PBX with the
latest IP based LAN switching to deliver a completely integrated voice & data solution.
• Implement a web-centric data management and configuration scheme to provide an unified interface to
retrieve data in cache server that manages system and application memory objects using optimized
algorithm, under which different data sources are allowed including self-implemented XML file system
management and commercial database system so that media server or web browser can retrieve data in
common API and store their data persistently and safely
• Implement a remote XML based IMA system scheduling agent specific for telephony system platform which
provide task scheduling service to start and stop a XML system task like conferencing, making call between
a customer and the agent timely using multithreaded programming
• Implement Client GUI Layout Manager and GUI display policy which can optimize the layout of all client
windows and remember the last display layout in screen according to window’s location and size (GUI and
Logic) using Java JFC/Swing/JDBC, SQL, XML
• Implement a real time event driven summary calculation framework, which can record hourly agent and call
activity by call and Agent state transition machine in server using Sybase. Implement historical report Client
GUI, which can generate and print report from supervisor selected columns, agents and time period by hour,
day, and month with JCLASS, styleReport4.0, extensive Sybase SQL developing
• Developed and maintained 3-tier web based Billing and accounting software based on CDR data in a
telephony system platform using Tomcat, JSP, servlet, shared object, CDR data is stored into Sybase
database from ACD server in real time. Business Logic like producing complex SQL statement and bill
calculation is done in Tomcat server
• Implement Real Time Monitor (RTM) for ACD call center server in both server and client sides. RTM
client update call and agent activity from events multicast by RTM server in real time. It combines all the
features into one interface, making it more user-friendly and efficient for the supervision process. RTM
server receives events from ACD Server whenever an agent’s status changes, or a call status changes. The
RTM will collect all activities and process them to form real time data and summary data using Java swing,
shared object, multithreading, JDBC, SQL
May 98 – Sep. 99 Multimedia Information System Lab, CAIP, Rutgers University
Research Assistant/Software Developer
1. Project DISCIPLE (Distributed System for Collaborative Information Processing and Learning)
A project to develop an advanced groupware design that enables interactive collaboration in the context of the task
at hand
• Groupware services provided to subscribe to and publish information, exchange messages, post small notes,
and annotate regions of the shared application window
• GUI implemented to provide several types of group awareness widgets to all the imported Java Beans to make
the user aware of other users actions
• Information abstraction utilities for intelligently reducing information content while maintaining semantics
HTTP Web Proxy Server with caching using BSD socket and POSIX thread using C in Unix Environment.
•
Proxy can handle multiple HTTP requests concurrently and filter useful requests and send cache data on local
disk to client if the request is repeated
An object oriented database implementation using Java JDBC, RMI, Perl (map E-R model to object-oriented
•
model) Multi-users can access the database from remote machines concurrently. Background thread can keep
data consistency between two Sybase servers
Jan 95 – May 97 High Speed Network Lab & CERNET Research Center, Southeast University, China
Software Engineer
Participate in the following projects as a key developer.
• Data format conversion and data transmission in electronic financial data exchange system.
• National high speed network protocol implementation and testing
• Enterprise network performance evaluation, flow simulation and traffic monitoring
• First Chinese network router design and implementation.