Skip to content

Detail of publication

Citation

Trmal Jan and Zelinka Jan and Luděk Müller : Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform . Text, Speech and Dialogue, Lecture Notes in Computer Science, vol. 6231, p. 423-430, Springer Berlin / Heidelberg, 2010.

Download PDF

PDF

Abstract

In this paper we present a novel method for adaptation of a multi-layer perceptron neural network (MLP ANN). Nowadays, the adaptation of the ANN is usually done as an incremental retraining either of a subset or the complete set of the ANN parameters. However, since sometimes the amount of the adaptation data is quite small, there is a fundamental drawback of such approach – during retraining, the network parameters can be easily overfitted to the new data. There certainly are techniques that can help overcome this problem (early-stopping, cross-validation), however application of such techniques leads to more complex and possibly more data hungry training procedure. The proposed method approaches the problem from a different perspective. We use the fact that in many cases we have an additional knowledge about the problem. Such additional knowledge can be used to limit the dimensionality of the adaptation problem. We applied the proposed method on speaker adaptation of a phoneme recognizer based on traps (Temporal Patterns) parameters. We exploited the fact that the employed traps parameters are constructed using log-outputs of mel-filter bank and by virtue of reformulating the first layer weight matrix adaptation problem as a mel-filter bank output adaptation problem, we were able to significantly limit the number of free variables. Adaptation using the proposed method resulted in a substantial improvement of phoneme recognizer accuracy.

Detail of publication

Title: Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform
Author: Trmal Jan ; Zelinka Jan ; Luděk Müller
Language: English
Date of publication: 1 Sep 2010
Year: 2010
Type of publication: Papers in journals
Title of journal or book: Text, Speech and Dialogue
Series: Lecture Notes in Computer Science
Číslo vydání: 6231
Page: 423 - 430
ISBN: 3-642-15759-9
ISSN: 0302-9743
Address: Springer Berlin / Heidelberg
Date: 6 Sep 2010 - 10 Sep 2010
/ 2011-03-15 17:36:49 /

BibTeX

@ARTICLE{TrmalJan_2010_Adaptationof,
 author = {Trmal Jan and Zelinka Jan and Lud\v{e}k M\"{u}ller},
 title = {Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform},
 year = {2010},
 journal = {Text, Speech and Dialogue},
 address = {Springer Berlin / Heidelberg},
 volume = {6231},
 pages = {423-430},
 series = {Lecture Notes in Computer Science},
 ISBN = {3-642-15759-9},
 ISSN = {0302-9743},
 url = {http://www.kky.zcu.cz/en/publications/TrmalJan_2010_Adaptationof},
}