On automatic annotation of meeting databases

Gatica-Perez, Daniel; McCowan, Iain A.; Barnard, Mark; Bengio, Samy; Bourlard, Hervé

report

Gatica-Perez, Daniel

•

McCowan, Iain A.

•

Barnard, Mark

2003

In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition, and information retrieval. We specifically focus on the task of semantic annotation of audio-visual (AV) events, where annotation consists of assigning labels (event names) to the data. In order to develop an automatic annotation system in a principled manner, it is essential to have a well-defined task, a standard corpus and an objective performance measure. In this work we address each of these issues to automatically annotate events based on participant interactions.

Name

rr03-06.pdf

Access type

openaccess

Size

227.98 KB

Format

Adobe PDF

Checksum (MD5)

6e1a92b449ffa416bc4a1e312d52b799