Search Results

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition

Download or Read eBook Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition PDF written by Xiaodong He and published by . This book was released on 2003 with total page 222 pages. Available in PDF, EPUB and Kindle.
Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition
Author :
Publisher :
Total Pages : 222
Release :
ISBN-10 : OCLC:55646957
ISBN-13 :
Rating : 4/5 (57 Downloads)

Book Synopsis Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition by : Xiaodong He

Book excerpt: Rapid globalization requires speech recognition systems to handle not only speech spoken by native speakers, but also speech spoken by foreign speakers. Currently, most American English speech recognition systems are built from speech data of American native English speakers. Although these systems work very well for native speakers, their performances degrade dramatically on recognition of foreign accented speech. Moreover, due to wide varieties of foreign accents, different speaking proficiency levels of English and limited data, in general it is difficult to train a specific acoustic model for each foreign accent. Therefore a practically feasible way to improve the performance of nonnative speech recognition is fast model adaptation. In this dissertation, the problem of adapting acoustic models of native English speech to nonnative speakers is addressed from the perspective of adaptive model selection. The goal is to dynamically select the optimal model for each nonnative talker so as to balance model robustness to pronunciation variations and model details for discrimination of speech sounds. A maximum expected likelihood (MEL) based technique is proposed for reliable model selection when adaptation data is sparse, where expectation of log-likelihood (EL) of adaptation data is computed based on distributions of mismatch biases between model and data, and model is selected to maximize EL. Moreover, in order to obtain reliable results when the available data is very limited, an improved prior knowledge guided MEL (P-MEL) approach is also proposed by using maximum a posteriori (MAP) estimation of bias distributions. These model selection methods are further combined with Maximum likelihood linear regression (MLLR) to enable adaptation of both structure and parameters of acoustic models. Experiments were performed on data of speakers with a wide range of foreign accents. Results show that the MEL based model selection can dynamically select proper model according to the available adaptation data, and the P-MEL approach can achieve a good performance even when the data amount is very small. Compared with the standard MLLR, the MEL+MLLR and the P-MEL + MLLR methods led to consistent and significant improvement to recognition accuracy on nonnative speakers, without performance degradation on native speakers.


Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition Related Books

Model Selection Based Speaker Adaptation and Its Application to Nonnative Speech Recognition
Language: en
Pages: 222
Authors: Xiaodong He
Categories: Automatic speech recognition
Type: BOOK - Published: 2003 - Publisher:

DOWNLOAD EBOOK

Rapid globalization requires speech recognition systems to handle not only speech spoken by native speakers, but also speech spoken by foreign speakers. Current
Robust Adaptation to Non-Native Accents in Automatic Speech Recognition
Language: en
Pages: 135
Authors: Silke Goronzy
Categories: Computers
Type: BOOK - Published: 2003-07-01 - Publisher: Springer

DOWNLOAD EBOOK

Speech recognition technology is being increasingly employed in human-machine interfaces. A remaining problem however is the robustness of this technology to no
Speaker Adaptation in a Large-vocabulary Speech Recognizer Via VQ Prototype Modification
Language: en
Pages: 16
Authors: Dimitry Rtischev
Categories: Automatic speech recognition
Type: BOOK - Published: 1989 - Publisher:

DOWNLOAD EBOOK

Abstract: "The problem of adapting the parameters of a speaker-dependent speech recognition system to a different speaker is examined with the objective of redu
Dynamic Speech Models
Language: en
Pages: 118
Authors: Li Deng
Categories: Technology & Engineering
Type: BOOK - Published: 2006-12-01 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation
Nonlinear Speech Modeling and Applications
Language: en
Pages: 444
Authors: Gerard Chollet
Categories: Computers
Type: BOOK - Published: 2005-07-04 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This book presents the revised tutorial lectures given at the International Summer School on Nonlinear Speech Processing-Algorithms and Analysis held in Vietri
Scroll to top