The reuse of texts in Finnish newspapers and journals, 1771–1920: A digital humanities perspective

Salmi, Hannu; Paju, Petri; Rantala, Heli; Nivala, Asko; Vesanto, Aleksi; Ginter, Filip

doi:10.1080/01615440.2020.1803166

Cited by 18 publications

(17 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our approach to this problem consisted in providing users with as many filters as possible, as a powerful way of sifting through the large number of clusters extracted by Passim. One example is the long-term reuse of newspaper contents (Salmi et al, 2019), i.e., articles that are reprinted over and over in a relatively long period of time: Users can refine their query by setting a filter on the time span of the cluster, so that only clusters consisting of articles covering a time span of, e.g., 10 years are retained. This first version mainly supports cluster-and documentlevel research with a basic set of search options and filters without distant reading perspectives.…”

Section: Integration Of Text Reuse Data In Impressomentioning

confidence: 99%

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

Düring,

Romanello,

Ehrmann

et al. 2023

Front. Big Data

View full text Add to dashboard Cite

Text Reuse reveals meaningful reiterations of text in large corpora. Humanities researchers use text reuse to study, e.g., the posterior reception of influential texts or to reveal evolving publication practices of historical media. This research is often supported by interactive visualizations which highlight relations and differences between text segments. In this paper, we build on earlier work in this domain. We present impresso Text Reuse at Scale, the to our knowledge first interface which integrates text reuse data with other forms of semantic enrichment to enable a versatile and scalable exploration of intertextual relations in historical newspaper corpora. The Text Reuse at Scale interface was developed as part of the impresso project and combines powerful search and filter operations with close and distant reading perspectives. We integrate text reuse data with enrichments derived from topic modeling, named entity recognition and classification, language and document type detection as well as a rich set of newspaper metadata. We report on historical research objectives and common user tasks for the analysis of historical text reuse data and present the prototype interface together with the results of a user evaluation.

show abstract

Section: Integration Of Text Reuse Data In Impressomentioning

confidence: 99%

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

Düring,

Romanello,

Ehrmann

et al. 2023

Front. Big Data

View full text Add to dashboard Cite

show abstract

“…The table above illustrates how the number of stigmatizing newspaper discourses about magical healing was highest in 1880-99, with thirty-nine texts out of fifty-four. Of these fifty-four newspaper texts, eight were published as duplicates (for reuse of Finnish newspaper texts, see Salmi et al 2021).…”

Section: Religious Discoursementioning

confidence: 99%

‘No “wise” men or women but real doctors!'

Kouvola

2022

View full text Add to dashboard Cite

Magical healers and physicians were among those who provided healing in the medical market of pre-modern Swedish-speaking Ostrobothnia. Using newspaper texts published in the region about local occurrences of magical healing as source material, this article examines through discourse analysis how magical healing was stigmatized in public discourse at the turn of the twentieth century. Two main discourses that stigmatize magical healing are evident from the data: the religious and enlightenment discourses. These show the power relations involved in the condemnation of magical healing as an example of the rural population’s superstition and naivity. This article offers new information about stigmatizing discourses on healing methods and practices that were considered witchcraft in a period when a community was undergoing cultural changes that affected health beliefs and power relations.

show abstract

“…This dataset contains articles from all newspapers and most periodicals that have been published in Finland from 1771 to 1917. Several studies have used parts of this dataset to investigate such issues as the development of the public sphere in Finland, the evolution of ideological terms in nineteenth-century Finland and the changing vocabulary of Finnish newspapers [30,15,14,10,18,19,22,25,11].…”

Section: Datamentioning

confidence: 99%

Topic modelling discourse dynamics in historical newspapers

Marjanen

Zosa

Hengchen

et al. 2020

Preprint

View full text Add to dashboard Cite

This paper addresses methodological issues in diachronic data analysis for historical research. We apply two families of topic models (LDA and DTM) on a relatively large set of historical newspapers, with the aim of capturing and understanding discourse dynamics. Our case study focuses on newspapers and periodicals published in Finland between 1854 and 1917, but our method can easily be transposed to any diachronic data. Our main contributions are a) a combined sampling, training and inference procedure for applying topic models to huge and imbalanced diachronic text collections; b) a discussion on the differences between two topic models for this type of data; c) quantifying topic prominence for a period and thus a generalization of document-wise topic assignment to a discourse level; and d) a discussion of the role of humanistic interpretation with regard to analysing discourse dynamics through topic models.

show abstract

The reuse of texts in Finnish newspapers and journals, 1771–1920: A digital humanities perspective

Cited by 18 publications

References 14 publications

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers

‘No “wise” men or women but real doctors!'

Topic modelling discourse dynamics in historical newspapers

Contact Info

Product

Resources

About