000192555 001__ 192555
000192555 005__ 20190316235802.0
000192555 037__ $$aCONF
000192555 245__ $$aA Probabilistic Framework for Multiple Speaker Localization
000192555 269__ $$a2013
000192555 260__ $$c2013
000192555 336__ $$aConference Papers
000192555 520__ $$aThis paper presents a novel probabilistic framework for localizing multiple speakers with a microphone array. In this framework, the generalized cross correlation function (GCC) of each microphone pair is interpreted as a probability distribution of the time difference of arrival (TDOA) and subsequently approximated as a Gaussian mixture. The distribution parameters are estimated with a weighted expectation maximization algorithm. Then, the joint distribution of the TDOA Gaussian mixtures is mapped to a multimodal distribution in the location space, where each mode represents a potential source location. The approach taken here performs the localization by 1) reducing the search space to some regions that are likely to contain a source and then 2) extracting the actual speaker locations with a numerical optimization algorithm. The effectiveness of the proposed approach is shown using the AV16.3 corpus.
000192555 700__ $$aOualil, Youssef
000192555 700__ $$0243959$$aMagimai.-Doss, Mathew$$g127186
000192555 700__ $$aFaubel, Friedrich
000192555 700__ $$aKlakow, Dietrich
000192555 7112_ $$aProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
000192555 8564_ $$uhttp://publications.idiap.ch/index.php/publications/showcite/Oualil_Idiap-RR-37-2012$$zRelated documents
000192555 909C0 $$0252189$$pLIDIAP$$xU10381
000192555 909CO $$ooai:infoscience.tind.io:192555$$pconf$$pSTI$$qGLOBAL_SET
000192555 937__ $$aEPFL-CONF-192555
000192555 970__ $$aOualil_ICASSP2013_2013/LIDIAP
000192555 973__ $$aEPFL
000192555 980__ $$aCONF