Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Two-level bimodal association for audio-visual speech recognition
 
conference paper

Two-level bimodal association for audio-visual speech recognition

Lee, Jong-Seok  
•
Ebrahimi, Touradj  
Blanc-Talon, J.
2009
Proceedings of the International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS’09)
International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS’09)

This paper proposes a new method for bimodal information fusion in audio-visual speech recognition, where cross-modal association is considered in two levels. First, the acoustic and the visual data streams are combined at the feature level by using the canonical correlation analysis, which deals with the problems of audio-visual synchronization and utilizing the cross-modal correlation. Second, information streams are integrated at the decision level for adaptive fusion of the streams according to the noise condition of the given speech datum. Experimental results demonstrate that the proposed method is effective for producing noise-robust recognition performance without a priori knowledge about the noise conditions of the speech data.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

acivs_avsr#.pdf

Access type

openaccess

Size

178.61 KB

Format

Adobe PDF

Checksum (MD5)

7f36709256773f359d0717e024da0db6

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés