Download PDF

International conference on language resources and evaluation - LREC 2014, Date: 2014/05/26 - 2014/05/31, Location: Reykjavik, Iceland

Publication date: 2014-05-01
Pages: 3041 - 3044
ISSN: 9782951740884
Publisher: EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA

Proceedings LREC 2014

Author:

Pelemans, Joris
Demuynck, Kris ; Van hamme, Hugo ; Wambacq, Patrick ; Calzolari, N ; Choukri, K ; Declerck, T ; Loftsson, H ; Maegaard, B ; Mariani, J ; Moreno, A ; Odijk, J ; Piperidis, S

Keywords:

PSI_SPEECH, Social Sciences, Linguistics, Language & Linguistics, speech recognition, web services, Dutch

Abstract:

In this paper we present 3 applications in the domain of Automatic Speech Recognition for Dutch, all of which are developed using our in-house speech recognition toolkit SPRAAK. The speech-to-text transcriber is a large vocabulary continuous speech recognizer, optimized for Southern Dutch. It is capable to select components and adjust parameters on the fly, based on the observed conditions in the audio and was recently extended with the capability of adding new words to the lexicon. The grapheme-to-phoneme converter generates possible pronunciations for Dutch words, based on lexicon lookup and linguistic rules. The speech-text alignment system takes audio and text as input and constructs a time aligned output where every word receives exact begin and end times. All three of the applications (and others) are freely available, after registration, as a web application on http: //www.spraak.org/webservice/ and in addition, can be accessed as a web service in automated tools.