Přejít na obsah

Detail publikace

Citace

Zbyněk Zajíc and Machlica Lukáš and Müller Luděk : Initialization of Adaptation by Sufficient Statistics Using Phonetic Tree . IEEE 11th International Conference on Signal Processing, p. 503-506, Beijing, 2012.

PDF ke stažení

PDF

Abstrakt

In this work we deal with the problem of small amount of data when estimating a feature transformation for the speaker adaptation of an acoustic model. Our goal is to compensate for the lack of adaptation data by a proper initialization of transformation matrices. Methods used in such situations are described, they are based on collecting additional accumulated statistics from nearest speakers. The proposed initialization approach is based on accumulated statistics too, but it incorporates also phonetic information when selecting the “nearest” statistics. Initialization methods compensating for the absence of actual speaker’s data are tested on telephone recordings with different amounts of adaptation data. In worst situation with extremely small amount of adaptation data relative improvement of 5% is obtained.

Abstrakt v češtině

Článek popisuje řešení problému malého mknožství dat pro fMLLR adaptaci. Navrhuje inicializaci statistik z dat od nejbližších řečníků s využitím informace o fonetické příslušnosti.

Detail publikace

Název: Initialization of Adaptation by Sufficient Statistics Using Phonetic Tree
Autor: Zbyněk Zajíc ; Machlica Lukáš ; Müller Luděk
Jazyk publikace: anglicky
Rok vydání: 2012
Typ publikace: Stať ve sborníku
Název časopisu / knihy: IEEE 11th International Conference on Signal Processing
Strana: 503 - 506
Místo vydání: Beijing
/ 2013-02-18 16:34:32 /

Klíčová slova

speech recognition, adaptation, initialization, phonetic tree

Klíčová slova v češtině

rozpoznávání řeči, adaptace, inicializace, fonetický strom

BibTeX

@MISC{ZbynekZajic_2012_Initializationof,
 author = {Zbyn\v{e}k Zaj\'{i}c and Machlica Luk\'{a}\v{s} and M\"{u}ller Lud\v{e}k},
 title = {Initialization of Adaptation by Sufficient Statistics Using Phonetic Tree},
 year = {2012},
 journal = {IEEE 11th International Conference on Signal Processing},
 address = {Beijing},
 pages = {503-506},
 url = {http://www.kky.zcu.cz/en/publications/ZbynekZajic_2012_Initializationof},
}