Search Results

Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition

Download or Read eBook Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition PDF written by Seyedmahdad Mirsamadi and published by . This book was released on 2017 with total page pages. Available in PDF, EPUB and Kindle.
Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition
Author :
Publisher :
Total Pages :
Release :
ISBN-10 : OCLC:1029740539
ISBN-13 :
Rating : 4/5 (39 Downloads)

Book Synopsis Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition by : Seyedmahdad Mirsamadi

Book excerpt: In recent years, there has been a significant increase in the popularity of voice-enabled technologies which use human speech as the primary interface with machines. Recent advancements in acoustic modeling and feature design have increased the accuracy of Automatic Speech Recognition (ASR) to levels that enable voice interfaces to be used in many applications. However, much of the current performance is dependent on the use of close-talking microphones, (i.e., scenarios in which the user speaks directly into a hand-held or body-worn microphone). There is still a rather large performance gap experienced in distant-talking scenarios in which speech is recorded by far-field microphones that are placed at a distance from the speaker. In such scenarios, the distorting effects of distance (such as room reverberation and environment noise) make the recognition task significantly more challenging. In this dissertation, we propose novel approaches for designing a distant-talking ASR front-end as well as training robust acoustic models to reduce the existing gap between far-field and close-talking ASR performance. Specifically, we i) propose a novel multi-channel front-end enhancement algorithm for improved ASR in reverberant rooms using distributed non-uniform microphone arrays with random unknown locations; ii) propose a novel neural network model training approach using adversarial training to improve the robustness of multi-condition acoustic models that are trained directly on far-field data; iii) study alternate neural network adaptation strategies for far-field adaptation to the acoustic properties of specific target environments. Experimental results are provided based on far-field benchmark tasks and datasets which demonstrate the effectiveness of the proposed approaches for increasing far-field robustness in ASR. Based on experiments using reverberated TIMIT sentences, the proposed multi-channel front-end provides WER improvements of +21.5% and +37.7% in two-channel and four-channel scenarios over a single-channel scenario in which the channel with best signal quality is selected. On the acoustic modeling side and based on results of experiments on AMI corpus, the proposed multi-domain training approach provides a relative character error rate reduction of +3.3% with respect to a conventional multi-condition trained baseline, and +25.4% with respect to a clean-trained baseline.


Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition Related Books

Robust Acoustic Modeling and Front-end Design for Distant Speech Recognition
Language: en
Pages:
Authors: Seyedmahdad Mirsamadi
Categories: Acoustical engineering
Type: BOOK - Published: 2017 - Publisher:

DOWNLOAD EBOOK

In recent years, there has been a significant increase in the popularity of voice-enabled technologies which use human speech as the primary interface with mach
Robust Speech Recognition in Embedded Systems and PC Applications
Language: en
Pages: 193
Authors: Jean-Claude Junqua
Categories: Technology & Engineering
Type: BOOK - Published: 2006-04-18 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Robust Speech Recognition in Embedded Systems and PC Applications provides a link between the technology and the application worlds. As speech recognition techn
New Era for Robust Speech Recognition
Language: en
Pages: 433
Authors: Shinji Watanabe
Categories: Computers
Type: BOOK - Published: 2017-10-30 - Publisher: Springer

DOWNLOAD EBOOK

This book covers the state-of-the-art in deep neural-network-based methods for noise robustness in distant speech recognition applications. It provides insights
Acoustic Modeling Methods for Robust Speech Recognition in Teleservice Conditions
Language: en
Pages: 168
Authors: Robert S. van Kommer
Categories:
Type: BOOK - Published: 2005 - Publisher:

DOWNLOAD EBOOK

Robust Automatic Speech Recognition and Moduling of Auditory Discrimination with Auditory Experiments Spectro-temporal Features
Language: en
Pages:
Authors: Marc René Schädler
Categories:
Type: BOOK - Published: 2016 - Publisher:

DOWNLOAD EBOOK

Automatic speech recognition (ASR) systems still do not perform as well as human listeners under realistic conditions. The unmatched ability of humans to unders
Scroll to top