2020
DOI: 10.1080/01615440.2020.1803166
|View full text |Cite
|
Sign up to set email alerts
|

The reuse of texts in Finnish newspapers and journals, 1771–1920: A digital humanities perspective

Abstract: The digital collections of newspapers have given rise to a growing interest in studying them with computational methods. This article contributes to this discussion by presenting a method for detecting text reuse in a large corpus of digitized texts. Empirically, the article is based on the corpus of newspapers and journals from the collection of the National Library of Finland. Often, digitized repositories offer only partial views of what actually was published in printed form. The Finnish collection is uniq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(17 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…Our approach to this problem consisted in providing users with as many filters as possible, as a powerful way of sifting through the large number of clusters extracted by Passim. One example is the long-term reuse of newspaper contents (Salmi et al, 2019), i.e., articles that are reprinted over and over in a relatively long period of time: Users can refine their query by setting a filter on the time span of the cluster, so that only clusters consisting of articles covering a time span of, e.g., 10 years are retained. This first version mainly supports cluster-and documentlevel research with a basic set of search options and filters without distant reading perspectives.…”
Section: Integration Of Text Reuse Data In Impressomentioning
confidence: 99%
“…Our approach to this problem consisted in providing users with as many filters as possible, as a powerful way of sifting through the large number of clusters extracted by Passim. One example is the long-term reuse of newspaper contents (Salmi et al, 2019), i.e., articles that are reprinted over and over in a relatively long period of time: Users can refine their query by setting a filter on the time span of the cluster, so that only clusters consisting of articles covering a time span of, e.g., 10 years are retained. This first version mainly supports cluster-and documentlevel research with a basic set of search options and filters without distant reading perspectives.…”
Section: Integration Of Text Reuse Data In Impressomentioning
confidence: 99%
“…The table above illustrates how the number of stigmatizing newspaper discourses about magical healing was highest in 1880-99, with thirty-nine texts out of fifty-four. Of these fifty-four newspaper texts, eight were published as duplicates (for reuse of Finnish newspaper texts, see Salmi et al 2021).…”
Section: Religious Discoursementioning
confidence: 99%
“…This dataset contains articles from all newspapers and most periodicals that have been published in Finland from 1771 to 1917. Several studies have used parts of this dataset to investigate such issues as the development of the public sphere in Finland, the evolution of ideological terms in nineteenth-century Finland and the changing vocabulary of Finnish newspapers [30,15,14,10,18,19,22,25,11].…”
Section: Datamentioning
confidence: 99%