Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Automatic Temporal Alignment of AV Data with Confidence Estimation
 
conference paper

Automatic Temporal Alignment of AV Data with Confidence Estimation

Korchagin, Danil
•
Garner, Philip N.
•
Dines, John  
2010
2010 IEEE International Conference on Acoustics, Speech and Signal Processing,
IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose a new approach for the automatic audio-based temporal alignment with confidence estimation of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is temporally aligned based on ASR-related features with a common master track, recorded by a reference camera, and the corresponding confidence of alignment is estimated. The core of the algorithm is based on perceptual time-frequency analysis with a precision of 10 ms. The results show correct alignment in 99% of cases for a real life dataset and surpass the performance of cross correlation while keeping lower system requirements.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Korchagin_ICASSP_2010.pdf

Access type

openaccess

Size

364.81 KB

Format

Adobe PDF

Checksum (MD5)

b20a022430a97d092dcc2e5a789dede7

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés