Ensemble Acoustic Modeling in Automatic Speech Recognition

Download or Read eBook Ensemble Acoustic Modeling in Automatic Speech Recognition PDF written by Xin Chen and published by . This book was released on 2011 with total page 106 pages. Available in PDF, EPUB and Kindle.

Author	: Xin Chen
Publisher	:
Total Pages	: 106
Release	: 2011
ISBN-10	: OCLC:872561309
ISBN-13	:
Rating	: 4/5 (09 Downloads)

DOWNLOAD EBOOK

Book Synopsis Ensemble Acoustic Modeling in Automatic Speech Recognition by : Xin Chen

Book excerpt: In this dissertation, several new approaches of using data sampling to construct an Ensemble of Acoustic Models (EAM) for speech recognition are proposed. A straightforward method of data sampling is Cross Validation (CV) data partition. In the direction of improving inter-model diversity within an EAM for speaker independent speech recognition, we propose Speaker Clustering (SC) based data sampling. In the direction of improving base model quality as well as inter-model diversity, we further investigate the effects of several successful techniques of single model training in speech recognition on the proposed ensemble acoustic models, including Cross Validation Expectation Maximization (CVEM), Discriminative Training (DT), and Multiple Layer Perceptron (MLP) features. We have evaluated the proposed methods on TIMIT phoneme recognition task as well as on a telemedicine automatic captioning task. The proposed EAMs have led to significant improvements in recognition accuracy over conventional Hidden Markov Model (HMM) baseline systems, and the integration of EAM with CVEM, DT and MLP has also significantly improved the accuracy performances of CVEM, DT, and MLP based single model systems. We further investigated the largely unstudied factor of inter-model diversity, and proposed several methods to explicit measure inter-model diversity. We demonstrate a positive relation between enlarging inter-model diversity and increasing EAM quality. Compacting the acoustic model to a reasonable size for practical applications while maintaining a reasonable performance is needed for EAM. Toward this goal, in this dissertation, we discuss and investigate several distance measures and proposed global optimization algorithms for clustering methods. We also proposed an explicit PDT (EPDT) state tying approach that allows Phoneme data Sharing (PS) for its potential capability in accommodating pronunciation variations.

Ensemble Acoustic Modeling in Automatic Speech Recognition Related Books

Language: en
Pages: 106

Ensemble Acoustic Modeling in Automatic Speech Recognition

Authors: Xin Chen

Categories: Electronic Dissertations

Type: BOOK - Published: 2011 - Publisher:

DOWNLOAD EBOOK

In this dissertation, several new approaches of using data sampling to construct an Ensemble of Acoustic Models (EAM) for speech recognition are proposed. A str

Language: en
Pages: 200

Discriminative Training and Acoustic Modeling for Automatic Speech Recognition

Authors: Wolfgang Macherey

Categories:

Type: BOOK - Published: 2010 - Publisher:

DOWNLOAD EBOOK

Language: en
Pages: 362

Discriminant Training of Front-end and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition

Authors: Michael Lee Shire

Categories:

Type: BOOK - Published: 2000 - Publisher:

DOWNLOAD EBOOK

Language: en
Pages: 347

Speech and Audio Processing for Coding, Enhancement and Recognition

Authors: Tokunbo Ogunfunmi

Categories: Technology & Engineering

Type: BOOK - Published: 2014-10-14 - Publisher: Springer

DOWNLOAD EBOOK

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statist

Language: en
Pages: 197

Acoustical and Environmental Robustness in Automatic Speech Recognition

Authors: A. Acero

Categories: Technology & Engineering

Type: BOOK - Published: 2012-12-06 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in rec