Search Results

Hidden Conditional Random Fields for Speech Recognition

Download or Read eBook Hidden Conditional Random Fields for Speech Recognition PDF written by Yun-Hsuan Sung and published by Stanford University. This book was released on 2010 with total page 161 pages. Available in PDF, EPUB and Kindle.
Hidden Conditional Random Fields for Speech Recognition
Author :
Publisher : Stanford University
Total Pages : 161
Release :
ISBN-10 : STANFORD:zn927hy7753
ISBN-13 :
Rating : 4/5 (53 Downloads)

Book Synopsis Hidden Conditional Random Fields for Speech Recognition by : Yun-Hsuan Sung

Book excerpt: This thesis investigates using a new graphical model, hidden conditional random fields (HCRFs), for speech recognition. Conditional random fields (CRFs) are discriminative sequence models that have been successfully applied to several tasks in text processing, such as named entity recognition. Recently, there has been increasing interest in applying CRFs to speech recognition due to the similarity between speech and text processing. HCRFs are CRFs augmented with hidden variables that are capable of representing the dynamic changes and variations in speech signals. HCRFs also have the ability to incorporate correlated features from both speech signals and text without making strong independence assumptions among them. This thesis presents my current research on applying HCRFs to speech recognition and HCRFs' potential to replace the current hidden Markov model (HMM) for acoustic modeling. Experimental results of phone classification, phone recognition, and speaker adaptation are presented and discussed. Our monophone HCRFs outperform both maximum mutual information estimation (MMIE) and minimum phone error (MPE) trained HMMs and achieve the-start-of-the-art performance in TIMIT phone classification and recognition tasks. We also show how to jointly train acoustic models and language models in HCRFs, which shows improvement in the results. Maximum a posterior (MAP) and maximum conditional likelihood linear regression (MCLLR) successfully adapt speaker-independent models to speaker-dependent models with a small amount of adaptation data for HCRF speaker adaptation. Finally, we explore adding gender and dialect features for phone recognition, and experimental results are presented.


Hidden Conditional Random Fields for Speech Recognition Related Books

Hidden Conditional Random Fields for Speech Recognition
Language: en
Pages: 161
Authors: Yun-Hsuan Sung
Categories:
Type: BOOK - Published: 2010 - Publisher: Stanford University

DOWNLOAD EBOOK

This thesis investigates using a new graphical model, hidden conditional random fields (HCRFs), for speech recognition. Conditional random fields (CRFs) are d
An Introduction to Conditional Random Fields
Language: en
Pages: 120
Authors: Charles Sutton
Categories: Computers
Type: BOOK - Published: 2012 - Publisher: Now Pub

DOWNLOAD EBOOK

An Introduction to Conditional Random Fields provides a comprehensive tutorial aimed at application-oriented practitioners seeking to apply CRFs. The monograph
The Application of Hidden Markov Models in Speech Recognition
Language: en
Pages: 125
Authors: Mark Gales
Categories: Automatic speech recognition
Type: BOOK - Published: 2008 - Publisher: Now Publishers Inc

DOWNLOAD EBOOK

The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the various re
Spoken Language Understanding
Language: en
Pages: 443
Authors: Gokhan Tur
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2011-05-03 - Publisher: John Wiley & Sons

DOWNLOAD EBOOK

Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication
Computational Linguistics and Intelligent Text Processing
Language: en
Pages: 683
Authors: Alexander Gelbukh
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2023-02-25 - Publisher: Springer Nature

DOWNLOAD EBOOK

The two-volume set LNCS 13451 and 13452 constitutes revised selected papers from the CICLing 2019 conference which took place in La Rochelle, France, April 2019
Scroll to top