Search Results

Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition

Download or Read eBook Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition PDF written by Natasha Singh-Miller and published by . This book was released on 2010 with total page 134 pages. Available in PDF, EPUB and Kindle.
Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition
Author :
Publisher :
Total Pages : 134
Release :
ISBN-10 : OCLC:711101743
ISBN-13 :
Rating : 4/5 (43 Downloads)

Book Synopsis Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition by : Natasha Singh-Miller

Book excerpt: This thesis investigates the problem of using nearest-neighbor based non-parametric methods for performing multi-class class-conditional probability estimation. The methods developed are applied to the problem of acoustic modeling for speech recognition. Neighborhood components analysis (NCA) (Goldberger et al. [2005]) serves as the departure point for this study. NCA is a non-parametric method that can be seen as providing two things: (1) low-dimensional linear projections of the feature space that allow nearest-neighbor algorithms to perform well, and (2) nearest-neighbor based class-conditional probability estimates. First, NCA is used to perform dimensionality reduction on acoustic vectors, a commonly addressed problem in speech recognition. NCA is shown to perform competitively with another commonly employed dimensionality reduction technique in speech known as heteroscedastic linear discriminant analysis (HLDA) (Kumar [1997]). Second, a nearest neighbor-based model related to NCA is created to provide a class-conditional estimate that is sensitive to the possible underlying relationship between the acoustic-phonetic labels. An embedding of the labels is learned that can be used to estimate the similarity or confusability between labels. This embedding is related to the concept of error-correcting output codes (ECOC) and therefore the proposed model is referred to as NCA-ECOC. The estimates provided by this method along with nearest neighbor information is shown to provide improvements in speech recognition performance (2.5% relative reduction in word error rate). Third, a model for calculating class-conditional probability estimates is proposed that generalizes GMM, NCA, and kernel density approaches. This model, called locally-adaptive neighborhood components analysis, LA-NCA, learns different low-dimensional projections for different parts of the space. The models exploits the fact that in different parts of the space different directions may be important for discrimination between the classes. This model is computationally intensive and prone to over-fitting, so methods for sub-selecting neighbors used for providing the classconditional estimates are explored. The estimates provided by LA-NCA are shown to give significant gains in speech recognition performance (7-8% relative reduction in word error rate) as well as phonetic classification.


Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition Related Books

Neighborhood Analysis Methods in Acoustic Modeling for Automatic Speech Recognition
Language: en
Pages: 134
Authors: Natasha Singh-Miller
Categories:
Type: BOOK - Published: 2010 - Publisher:

DOWNLOAD EBOOK

This thesis investigates the problem of using nearest-neighbor based non-parametric methods for performing multi-class class-conditional probability estimation.
Acoustic Modeling for Emotion Recognition
Language: en
Pages: 72
Authors: Koteswara Rao Anne
Categories: Technology & Engineering
Type: BOOK - Published: 2015-03-14 - Publisher: Springer

DOWNLOAD EBOOK

This book presents state of art research in speech emotion recognition. Readers are first presented with basic research and applications – gradually more adva
Ensemble Acoustic Modeling in Automatic Speech Recognition
Language: en
Pages: 106
Authors: Xin Chen
Categories: Electronic Dissertations
Type: BOOK - Published: 2011 - Publisher:

DOWNLOAD EBOOK

In this dissertation, several new approaches of using data sampling to construct an Ensemble of Acoustic Models (EAM) for speech recognition are proposed. A str
Intelligent Speech Signal Processing
Language: en
Pages: 210
Authors: Nilanjan Dey
Categories: Technology & Engineering
Type: BOOK - Published: 2019-06-15 - Publisher: Academic Press

DOWNLOAD EBOOK

Intelligent Speech Signal Processing investigates the utilization of speech analytics across several systems and real-world activities, including sharing data a
Dynamic Speech Models
Language: en
Pages: 118
Authors: Li Deng
Categories: Technology & Engineering
Type: BOOK - Published: 2006-12-01 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation
Scroll to top