Skip to content

Detail of publication

Citation

Císař, P. and Železný, M. and Krňoul, Z. : 3D lip-tracking for audio-visual speech recognition in real applications . Prodeedings of the INTERSPEECH 2004 - ICSLP, p. 2521-2524, Sunjin Printing Co., Jeju, 2004.

Abstract

In this paper, we present a solution to the problem of tracking 3D information about the shape of lips from 2D picture of a speaker. We focus on lip-tracking of audio-visual speech recordings from the Czech in-vehicle audio-visual speech corpus (CIVAVC). The corpus consists of 4 h 40 min records of audiovisual speech of driver recorded in a car during driving in an usual traffic. In real conditions a head of a speaker (a car driver) can move and turn in various directions. To cope with this movements and to avoid recognition errors caused by changing 3D position of lips, our algorithm utilizes a 3Dmodel- based approach to the lip-tracking process.

Detail of publication

Title: 3D lip-tracking for audio-visual speech recognition in real applications
Author: Císař, P. ; Železný, M. ; Krňoul, Z.
Language: English
Date of publication: 4 Oct 2004
Year: 2004
Type of publication: Papers in proceedings of reviewed conferences
Title of journal or book: Prodeedings of the INTERSPEECH 2004 - ICSLP
Page: 2521 - 2524
Publisher: Sunjin Printing Co.
Address: Jeju
Date: 4 Oct 2004 - 8 Oct 2004
/ /

Keywords

liptracking, 3D, speech recognition, templates, 3D tracking

BibTeX

@INPROCEEDINGS{CisarP_2004_3Dlip-trackingfor,
 author = {C\'{i}sa\v{r}, P. and \v{Z}elezn\'{y}, M. and Kr\v{n}oul, Z.},
 title = {3D lip-tracking for audio-visual speech recognition in real applications},
 year = {2004},
 publisher = {Sunjin Printing Co.},
 journal = {Prodeedings of the INTERSPEECH 2004 - ICSLP},
 address = {Jeju},
 pages = {2521-2524},
 url = {http://www.kky.zcu.cz/en/publications/CisarP_2004_3Dlip-trackingfor},
}