Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Studying Summarization Evaluation Metrics in the Appropriate Scoring Range
 
conference paper

Studying Summarization Evaluation Metrics in the Appropriate Scoring Range

Peyrard, Maxime  
January 1, 2019
57Th Annual Meeting Of The Association For Computational Linguistics (Acl 2019)
57th Annual Meeting of the Association-for-Computational-Linguistics (ACL)

In summarization, automatic evaluation metrics are usually compared based on their ability to correlate with human judgments. Unfortunately, the few existing human judgment datasets have been created as by-products of the manual evaluations performed during the DUC/FAC shared tasks. However, modem systems are typically better than the best systems submitted at the time of these shared tasks. We show that, surprisingly, evaluation metrics which behave similarly on these datasets (average-scoring range) strongly disagree in the higher-scoring range in which current systems now operate. It is problematic because metrics disagree yet we can't decide which one to trust. This is a call for collecting human judgments for high-scoring summaries as this would resolve the debate over which metrics to trust. This would also be greatly beneficial to further improve summarization systems and metrics alike.

  • Details
  • Metrics
Type
conference paper
DOI
10.18653/v1/P19-1502
Web of Science ID

WOS:000493046107060

Author(s)
Peyrard, Maxime  
Date Issued

2019-01-01

Publisher

ASSOC COMPUTATIONAL LINGUISTICS-ACL

Publisher place

Stroudsburg

Published in
57Th Annual Meeting Of The Association For Computational Linguistics (Acl 2019)
ISBN of the book

978-1-950737-48-2

Start page

5093

End page

5100

Subjects

Computer Science, Artificial Intelligence

•

Computer Science, Interdisciplinary Applications

•

Linguistics

•

Computer Science

•

Linguistics

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DLAB  
Event nameEvent placeEvent date
57th Annual Meeting of the Association-for-Computational-Linguistics (ACL)

Florence, ITALY

Jul 28-Aug 02, 2019

Available on Infoscience
November 16, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/163159
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés