conference paper
Mapping the Early Modern News Flow: An Enquiry by Robust Text Reuse Detection
Aiello, Luca Maria
•
Mcfarland, Daniel
2015
Social Informatics
Early modern printed gazettes relied on a system of news exchange and text reuse largely based on handwritten sources. The reconstruction of this information exchange system is possible by detecting reused texts. We present a method to individuate text borrowings within noisy OCRed texts from printed gazettes based on string kernels and local text alignment. We apply our methods on a corpus of Italian gazettes for the year 1648. Beside unveiling substantial overlaps in news sources, we are able to assess the editorial policy of different gazettes and account for a multi-faceted system of text reuse.
Type
conference paper
Author(s)
Editors
Aiello, Luca Maria
•
Mcfarland, Daniel
Date Issued
2015
Publisher
Publisher place
Cham
Published in
Social Informatics
Series title/Series vol.
Lecture Notes in Computer Science
Volume
8852
Start page
244
End page
253
Subjects
Editorial or Peer reviewed
REVIEWED
Written at
EPFL
EPFL units
Event name |
Available on Infoscience
April 7, 2015
Use this identifier to reference this record