Mapping the Early Modern News Flow: An Enquiry by Robust Text Reuse Detection

Early modern printed gazettes relied on a system of news exchange and text reuse largely based on handwritten sources. The reconstruction of this information exchange system is possible by detecting reused texts. We present a method to individuate text borrowings within noisy OCRed texts from printed gazettes based on string kernels and local text alignment. We apply our methods on a corpus of Italian gazettes for the year 1648. Beside unveiling substantial overlaps in news sources, we are able to assess the editorial policy of different gazettes and account for a multi-faceted system of text reuse.


Editor(s):
Aiello, Luca Maria
Mcfarland, Daniel
Published in:
Social Informatics, 8852, 244-253
Presented at:
HistoInformatics 2014
Year:
2015
Publisher:
Cham, Springer International Publishing
Keywords:
Laboratories:




 Record created 2015-04-07, last modified 2018-09-13


Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)