Towards Audio-Visual On-line Diarization Of Participants In Group Meetings

Hung, Hayley; Friedland, Gerald

2008

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We propose a fully automated, unsupervised, and non-int\-rusive method of identifying the current speaker audio-vis\-ually in a group conversation. This is achieved without specialized hardware, user interaction, or prior assignment of microphones to participants. Speakers are identified acoustically using a novel on-line speaker diarization approach. The output is then used to find the corresponding person in a four-camera video stream by approximating individual activity with computationally efficient features. We present results showing the robustness of the association on over 4.5 hours of non-scripted audio-visual meeting data.

Details

Title Towards Audio-Visual On-line Diarization Of Participants In Group Meetings

Author(s) Hung, Hayley ; Friedland, Gerald

Conference European Conference on Computer Vision Workshop on Multi-camera and Multi-modal Sensor Fusion

Date 2008

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-02-11

Files

Abstract

Details

PDF