Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features
 
conference paper

Tampered Speaker Inconsistency Detection with Phonetically Aware Audio-visual Features

Korshunov, Pavel
•
Halstead, Michael
•
Castan, Diego
Show more
2019
International Conference on Machine Learning

The recent increase in social media based propaganda, i.e., ‘fake news’, calls for automated methods to detect tampered content. In this paper, we focus on detecting tampering in a video with a person speaking to a camera. This form of manipulation is easy to perform, since one can just replace a part of the audio, dramatically chang- ing the meaning of the video. We consider several detection approaches based on phonetic features and recurrent networks. We demonstrate that by replacing standard MFCC features with embeddings from a DNN trained for automatic speech recognition, combined with mouth landmarks (visual features), we can achieve a significant performance improvement on several challenging publicly available databases of speakers (VidTIMIT, AMI, and GRID), for which we generated sets of tampered data. The evaluations demonstrate a relative equal error rate reduction of 55% (to 4.5% from 10.0%) on the large GRID corpus based dataset and a satisfying generalization of the model on other datasets.

  • Details
  • Metrics
Type
conference paper
Author(s)
Korshunov, Pavel
Halstead, Michael
Castan, Diego
Graciarena, Martin
McLaren, Mitchell
Burns, Brian
Lawson, Aaron
Marcel, Sébastien
Date Issued

2019

Series title/Series vol.

Synthetic Realities: Deep Learning for Detecting AudioVisual Fakes

Subjects

inconsistencies detection

•

lip-syncing

•

Video tampering

Note

Best paper award in ICML workshop "Synthetic Realities: Deep Learning for Detecting AudioVisual Fakes"

URL

Related documents

http://publications.idiap.ch/downloads/papers/2019/Korshunov_AVFAKESICML_2019.pdf
Written at

EPFL

EPFL units
LIDIAP  
Event name
International Conference on Machine Learning
Available on Infoscience
September 5, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/160882
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés