A semi-automated workflow paradigm for the distributed creation and curation of expert annotations

Hentschel, Johannes; Moss, Fabian Claude; Neuwirth, Markus; Rohrmeier, Martin

doi:10.5281/zenodo.5624417

Hentschel, Johannes; Moss, Fabian Claude; Neuwirth, Markus; Rohrmeier, Martin

2021

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The creation and curation of labeled datasets can be an arduous, expensive, and time-consuming task. We introduce a workflow paradigm for remote consensus-building between expert annotators, while considerably reducing the associated administrative overhead through automation. Most music annotation tasks rely heavily on human interpretation and therefore defy the concept of an objective and indisputable ground truth. Thus, our paradigm invites and documents inter-annotator controversy based on a transparent set of analytical criteria, and aims at putting forth the consensual solutions emerging from such deliberations. The workflow that we suggest traces the entire genesis of annotation data, including the relevant discussions between annotators, reviewers, and curators. It adopts a well-proven pattern from collaborative software development, namely distributed version control, and allows for the automation of repetitive maintenance tasks, such as validity checks, message dispatch, or updates of meta- and paradata. To demonstrate the workflow's effectiveness, we introduce one possible implementation through GitHub Actions and showcase its success in creating cadence, phrase, and harmony annotations for a corpus of 36 trio sonatas by Arcangelo Corelli. Both code and annotated scores are freely available and the implementation can be readily used in and adapted for other MIR projects.

Details

Title A semi-automated workflow paradigm for the distributed creation and curation of expert annotations

Author(s) Hentschel, Johannes ; Moss, Fabian Claude ; Neuwirth, Markus ; Rohrmeier, Martin

Published in Proceedings of the 22nd International Society for Music Information Retrieval Conference

Pagination 262-269

Conference 22nd International Society for Music Information Retrieval Conference (ISMIR 2021), Online, November 7-12, 2021

Date 2021

DOI https://doi.org/10.5281/zenodo.5624417

Laboratories DCML

Record Appears in Scientific production and competences > CDH - College of Humanities and social sciences > Digital Humanities Institute > DCML - Digital and Cognitive Musicology Lab
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2021-12-03

Actions

Preview

Select file: