Layout analysis on newspaper archives
Details
Download: 193.pdf (1847.27 [Ko])
State: Public
Version: Final published version
State: Public
Version: Final published version
Serval ID
serval:BIB_1EDF4244A052
Type
Inproceedings: an article in a conference proceedings.
Collection
Publications
Institution
Title
Layout analysis on newspaper archives
Title of the conference
Digital Humanities 2017
Publication state
Published
Issued date
2017
Peer-reviewed
Oui
Pages
409-412
Language
english
Abstract
The study of newspaper layout evolution through historical corpora has been addressed by diverse qualitative and quantitative methods in the past few years. The recent availability of large corpora of newspapers is now making the quantitative analysis of layout evolution ever more popular. This research investigates a method for the automatic detection of layout evolution on scanned images with a factorial analysis approach. The notion of eigenpages is defined by analogy with eigenfaces used in face recognition processes. The corpus of scanned newspapers that was used contains 4 million press articles, covering about 200 years of archives. This method can automatically detect layout changes of a given newspaper over time, rebuilding a part of its past publishing strategy and retracing major changes in its history in terms of layout. Besides these advantages, it also makes it possible to compare several newspapers at the same time and therefore to compare the layout changes of multiple newspapers based only on scans of their issues.
Create date
31/08/2017 15:29
Last modification date
20/08/2019 12:54