Search Results

Ensemble Acoustic Modeling in Automatic Speech Recognition

Download or Read eBook Ensemble Acoustic Modeling in Automatic Speech Recognition PDF written by Xin Chen and published by . This book was released on 2011 with total page 106 pages. Available in PDF, EPUB and Kindle.
Ensemble Acoustic Modeling in Automatic Speech Recognition
Author :
Publisher :
Total Pages : 106
Release :
ISBN-10 : OCLC:872561309
ISBN-13 :
Rating : 4/5 (09 Downloads)

Book Synopsis Ensemble Acoustic Modeling in Automatic Speech Recognition by : Xin Chen

Book excerpt: In this dissertation, several new approaches of using data sampling to construct an Ensemble of Acoustic Models (EAM) for speech recognition are proposed. A straightforward method of data sampling is Cross Validation (CV) data partition. In the direction of improving inter-model diversity within an EAM for speaker independent speech recognition, we propose Speaker Clustering (SC) based data sampling. In the direction of improving base model quality as well as inter-model diversity, we further investigate the effects of several successful techniques of single model training in speech recognition on the proposed ensemble acoustic models, including Cross Validation Expectation Maximization (CVEM), Discriminative Training (DT), and Multiple Layer Perceptron (MLP) features. We have evaluated the proposed methods on TIMIT phoneme recognition task as well as on a telemedicine automatic captioning task. The proposed EAMs have led to significant improvements in recognition accuracy over conventional Hidden Markov Model (HMM) baseline systems, and the integration of EAM with CVEM, DT and MLP has also significantly improved the accuracy performances of CVEM, DT, and MLP based single model systems. We further investigated the largely unstudied factor of inter-model diversity, and proposed several methods to explicit measure inter-model diversity. We demonstrate a positive relation between enlarging inter-model diversity and increasing EAM quality. Compacting the acoustic model to a reasonable size for practical applications while maintaining a reasonable performance is needed for EAM. Toward this goal, in this dissertation, we discuss and investigate several distance measures and proposed global optimization algorithms for clustering methods. We also proposed an explicit PDT (EPDT) state tying approach that allows Phoneme data Sharing (PS) for its potential capability in accommodating pronunciation variations.


Ensemble Acoustic Modeling in Automatic Speech Recognition Related Books

Ensemble Acoustic Modeling in Automatic Speech Recognition
Language: en
Pages: 106
Authors: Xin Chen
Categories: Electronic Dissertations
Type: BOOK - Published: 2011 - Publisher:

DOWNLOAD EBOOK

In this dissertation, several new approaches of using data sampling to construct an Ensemble of Acoustic Models (EAM) for speech recognition are proposed. A str
Discriminative Training and Acoustic Modeling for Automatic Speech Recognition
Language: en
Pages: 200
Authors: Wolfgang Macherey
Categories:
Type: BOOK - Published: 2010 - Publisher:

DOWNLOAD EBOOK

Speech and Audio Processing for Coding, Enhancement and Recognition
Language: en
Pages: 347
Authors: Tokunbo Ogunfunmi
Categories: Technology & Engineering
Type: BOOK - Published: 2014-10-14 - Publisher: Springer

DOWNLOAD EBOOK

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statist
Acoustical and Environmental Robustness in Automatic Speech Recognition
Language: en
Pages: 197
Authors: A. Acero
Categories: Technology & Engineering
Type: BOOK - Published: 2012-12-06 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in rec
Scroll to top