Přejít na obsah

Detail publikace

Citace

Lukáš Machlica : High Dimensional Spaces and Modelling in the task of Speaker Recognition . University of West Bohemia, Faculty of Applied Sciences, Department of Cybernetics, Univerzitni 8, Pilsen, Czech Republic, 2012.

PDF ke stažení

PDF PDF 2

Abstrakt

The automatic speaker recognition made a signicant progress in the last two decades. Huge speech corpora containing thousands of speakers recorded on several channels are at hand, and methods utilizing as much information as possible were developed. Nowadays state-of-the-art methods are based on Gaussian mixture models used to estimate relevant statistics from feature vectors extracted from the speech of a speaker, which are further concatenated into a high dimensional vector  supervector. Methods concerning the extraction of high dimensional supervectors along with techniques capable to build a speaker model in such a high dimensional space are described in depth and links between these methods are found. The main emphasize is laid on the analysis of these methods and an ecient implementation in order to process huge amounts of development data to train the speaker recognition system. Also the inuence of development corpora on the recognition performance is experimentally tested.

Detail publikace

Název: High Dimensional Spaces and Modelling in the task of Speaker Recognition
Autor: Lukáš Machlica
Název - česky: Vysokodimenzionální Prostory a Modelování v úloze Rozpoznávání e£níka
Jazyk publikace: anglicky
Datum vydání: 31.8.2012
Rok vydání: 2012
Typ publikace: Vysokoškolská kvalifikační práce (dizertační, habilitační)
Místo vydání: Univerzitni 8, Pilsen, Czech Republic
Univerzita, škola: University of West Bohemia, Faculty of Applied Sciences, Department of Cybernetics
/ 2014-02-26 13:45:42 /

Klíčová slova

Gaussian mixture models, support vector machine, supervector, factor analysis, dimensionality reduction, speaker recognition

BibTeX

@PHDTHESIS{LukasMachlica_2012_HighDimensional,
 author = {Luk\'{a}\v{s} Machlica},
 title = {High Dimensional Spaces and Modelling in the task of Speaker Recognition},
 year = {2012},
 address = {Univerzitni 8, Pilsen, Czech Republic},
 school = {University of West Bohemia, Faculty of Applied Sciences, Department of Cybernetics},
 url = {http://www.kky.zcu.cz/en/publications/LukasMachlica_2012_HighDimensional},
}