ITEM METADATA RECORD
Title: Speech recognition with segmental conditional random fields: a summary of the JHU CLSP 2010 summer workshop
Authors: Zweig, G.
Nguyen, P.
Van Compernolle, Dirk
Demuynck, Kris
Atlas, L.
Clark, P.
Sell, G.
Wang, M.
Sha, F.
Hermansky, H.
Karakos, D.
Jansen, A.
Thomas, S.
Sivaram, G.S.V.S.
Bowman, S.
Kao, J.
Issue Date: 2011
Publisher: Ieee
Host Document: Proceedings 36th international conference on acoustics, speech and signal processing - ICASSP’2011 pages:5044-5047
Series Title: International Conference on Acoustics Speech and Signal Processing ICASSP
Conference: International conference on acoustics, speech and signal processing - ICASSP’2011 edition:36 location:Prague, Czech Republic date:22-27 May 2011
Abstract: This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state-of-the-art baseline as a springboard from which to add a suite of novel features including ones derived from acoustic templates, deep neural net phoneme detections, duration models, modulation features, and whole word point-process models. The SCRF framework is able to appropriately weight these different information sources to produce significant gains on both the Broadcast News and Wall Street Journal tasks.
Description: Zweig G., Nguyen P., Van Compernolle D., Demuynck K., Atlas L., Clark P., Sell G., Wang M., Sha F., Hermansky H., Karakos D., Jansen A., Thomas S., Sivaram G.S.V.S., Bowman S., Kao J., ''Speech recognition with segmental conditional random fields: a summary of the JHU CLSP 2010 summer workshop'', 36th international conference on acoustics, speech and signal processing - ICASSP’2011, pp. 5044-5047, May 22-27, 2011, Prague, Czech Republic.
ISBN: 978-1-4577-0539-7
ISSN: 1520-6149
Publication status: published
KU Leuven publication type: IC
Appears in Collections:ESAT - PSI, Processing Speech and Images

Files in This Item:
File Description Status SizeFormat
3378.pdf Published 156KbAdobe PDFView/Open Request a copy

These files are only available to some KU Leuven Association staff members

 




All items in Lirias are protected by copyright, with all rights reserved.

© Web of science