Download PDF

IEEE workshop on spoken language technology - SLT 2016, Date: 2016/12/13 - 2016/12/16, Location: San Diego, California, USA

Publication date: 2016-01-01
Pages: 144 - 150
ISSN: 9781509049035
Publisher: IEEE

Proceedings SLT 2016

Author:

Renkens, Vincent
Tomar, Vikrant ; Van hamme, Hugo

Keywords:

PSI_SPEECH, Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Vocabulary learning, Spoken language acquisition, Non-negative Matrix Factorisation, Machine learning, Bayesian methods, NONNEGATIVE MATRIX FACTORIZATION, PSI_4154

Abstract:

© 2016 IEEE. This paper discusses a spoken language acquisition system for a command-and-control interface. The proposed system learns a set of words through coupled commands and demonstrations. The user can teach the system a new command by demonstrating the uttered command through an alternative interface. With these coupled commands and demonstrations, the system can learn the acoustic representations of the used words coupled with the meaning or semantics. In previous work the focus was mainly on a batch learning scheme to train the model. All the commands and demonstrations had to be stored and the model had to be retrained from scratch every time a new demonstration was given by the user. This work presents a Bayesian learning scheme where the dictionary of learned words can be updated when new data is presented. The dictionary can automatically expand to add new words or shrink to forget old words. The proposed system is tested on a language acquisition task where the user suddenly starts using new words. The results show that the proposed system can learn the new words quicker than a baseline where the size of the dictionary cannot be adjusted.