Publikace
Detail publikace
Citace
p. 552-566, Elsevier, 2011. : On the detection of pitch marks using a robust multi-phase algorithm . Speech Communication,
Další informace
Abstrakt
A large number of methods for identifying glottal closure instants (GCIs) in voiced speech have been proposed in recent years. In this paper, we propose to take advantage of both glottal and speech signals in order to increase the accuracy of detection of GCIs. All aspects of this particular issue, from determining speech polarity to handling a delay between glottal and corresponding speech signal, are addressed. A robust multi-phase algorithm (MPA), which combines different methods applied on both signals in a unique way, is presented. Within the process, a special attention is paid to determination of speech waveform polarity, as it was found to be considerably influencing the performance of the detection algorithms. Another feature of the proposed method is that every detected GCI is given a confidence score, which allows to locate potentially inaccurate GCI subsequences. The performance of the proposed algorithm was tested and compared with other freely available GCI detection algorithms. The MPA algorithm was found to be more robust in terms of detection accuracy over various sets of sentences, languages and phone classes. Finally, some pitfalls of the GCI detection are discussed.
Detail publikace
Název: | On the detection of pitch marks using a robust multi-phase algorithm |
---|---|
Autor: | Legát, M. ; Matoušek, J. ; Tihelka, D. |
Jazyk publikace: | anglicky |
Datum vydání: | 21.1.2011 |
Rok vydání: | 2011 |
Typ publikace: | Článek z časopisu |
Název časopisu / knihy: | Speech Communication |
Strana: | 552 - 566 |
ISSN: | 0167-6393 |
Nakladatel: | Elsevier |
Klíčová slova
glottal closure instant, pitch mark, speech signal polarity, fundamental frequency
Klíčová slova v češtině
okamžik uzavření hlasivek, hlasivkový puls, polarita řečového signálu, základní hlasivková frekvence
BibTeX
@ARTICLE{LegatM_2011_Onthedetectionof, author = {Leg\'{a}t, M. and Matou\v{s}ek, J. and Tihelka, D.}, title = {On the detection of pitch marks using a robust multi-phase algorithm}, year = {2011}, publisher = {Elsevier}, journal = {Speech Communication}, pages = {552-566}, ISSN = {0167-6393}, url = {http://www.kky.zcu.cz/en/publications/LegatM_2011_Onthedetectionof}, }