Computer Science Signal Processing

Location:

Columbus, OH

Posted:

November 29, 2012

Contact this candidate

Resume:

Xiaojia Zhao

*** ****** **********, **** **** Avenue

Columbus, OH, 43210

Cell Phone: 614-***-****

Email: ***********@*****.***

Homepage: http://www.cse.ohio-state.edu/~zhaox

Objective and Research Interests

Ph.D. candidate at Department of Computer Science and Engineering at the Ohio State University.

Looking for summer internship of 2013. Research interests include Artificial Intelligence, Robust

Speaker/Speech Recognition, Speech Separation, Speech/Signal Processing, Machine Learning and

Computational Auditory Scene Analysis (CASA).

Summary of Skills

Programming Languages: Matlab, C/C++, Java, C#

Operating Systems: Linux and Windows

Academic Background: Artificial Intelligence, Machine Learning, Speaker/Speech recognition,

Digital Signal Processing, Computer Architecture, Operating Systems, Algorithms Analysis and

Design, Probability Theory, Linear Regression, Applied Multivariate Analysis

Work Experience

Graduate Teaching Associate

March, 2012 - current

The Ohio State University Columbus, OH

Help students major in business school develop problem-solving skills using Microsoft Excel, Access, etc.

Responsible for coordinating all the TAs of the same class

Graduate Research Associate

September, 2008 to March, 2012

The Ohio State University Columbus, OH

Work with Professor DeLiang Wang in the research area of single microphone source separation and its

application on speaker recognition using machine learning techniques.

Projects

Noise-robust Speaker Recognition

Designed a robust speaker identification system that works reasonably well in a variety of noisy conditions.

We first propose a novel speaker feature, gammatone frequency cepstral coefficients (GFCC), based on an

auditory periphery model and show that this feature captures speaker characteristics and performs

substantially better than conventional speaker features under noisy conditions. To deal with noisy speech,

we apply CASA separation and then either reconstruct or marginalize corrupted components indicated by a

CASA mask. The complementary advantages of the two methods are utilized by a combination to achieve

further performance improvement.

Page 1 of 2

Analyzing Noise Robustness of GFCC and MFCC Features in Speaker Identification

In the previous project, the proposed GFCC feature exhibits superior noise robustness to the well-known

mel-frequency cepstral coefficients (MFCC). To gain a deep understanding of this, we conduct this project.

We first analyze all of their differences, which helps us to generate a number of hypotheses. For each

hypothesis, we design a corresponding set of experiments to test it. In this way, we are able to narrow

down possible explanations, which eventually reveal the desired answer.

Speaker Recognition in Noisy and Reverberant Conditions

In real world acoustic environments, speech often occurs simultaneously with noise and reverberation. Our

previous projects mainly focus on noise robustness of speaker recognition systems. We conduct this project

to explore the joint effects of noise and reverberation. We first train speaker models in selected noise-free

reverberant conditions. We then tackle noise from two different perspectives, bounded marginalization and

direct masking, based on source separation results from a deep neural network. The proposed system that

combines the two substantially improves speaker identification performance compared to related systems

in a wide range of reverberation time and signal-to-noise ratios.

Education

The Ohio State University Columbus, OH, USA

09/2008 06/2014

Ph.D. in Computer Science and Engineering (GPA: 3.94 / 4)

09/2008 06/2012

M.S. in Computer Science and Engineering (GPA: 3.94 / 4)

Major: Artificial Intelligence

Minor: Signal Processing and Statistics

Nankai University Tianjin, China

Bachelor in Engineering (GPA: 3.85 / 4. Rank 1st / 152) 09/2004 06/2008

Major: Software Engineering (Graduated with Distinction)

Honors and Awards

First-Class Student Scholarship of Nankai University

Excellent Student Leader Award of Nankai University

Outstanding High School Student Award of Henan Province of China

Publications

X. Zhao, Y. Shao, and D.L. Wang: Robust speaker identification using a CASA

front-end. Proceedings of ICASSP-11, pp. 5468-5471, 2011.

A. Narayanan, X. Zhao, D.L. Wang, and E. Fosler-Lussier: Robust speech recognition using

multiple prior models for speech reconstruction. Proceedings of ICASSP-11, pp. 4800-4803, 2011.

(The first two authors have equal contribution).

X. Zhao, Y. Shao and D.L.Wang, "CASA-Based Robust Speaker Identification," IEEE Trans.

Audio, Speech and Language Processing, vol.20, no.5, pp.1608-1616, 2012.

Resume of Xiaojia Zhao Page 2 of 2

Contact this candidate