2016
DOI: 10.5117/tvt2016.3.pand
|View full text |Cite
|
Sign up to set email alerts
|

Tekstgenres analyseren op lexicale complexiteit met T‑Scan

Abstract: Using T-Scan to analyse the lexical complexity of text genresT-Scan is a tool for the automatic analysis of Dutch text. This paper presents the first large-scale corpus analysis with T-Scan, focusing on lexical complexity. A collection of nearly 1000 text specimens was assembled, containing ten genres: travel blogs, celebrity news features, novels, textbooks for vocational secondary schools, textbooks for general secondary schools, news reports, opinion pieces, political programs, medical advice texts and res… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(8 citation statements)
references
References 23 publications
0
7
0
1
Order By: Relevance
“…Perhaps focusing on the presence of nouns, by counting articles and prepositions (Tausczik and Pennebaker, 2010) is not a very valid way of measuring the concreteness of text. Nouns per se are not concrete; they can have different degrees of concreteness (Pander Maat and Dekker, 2016). For example, words like 'stove', 'pan', and 'meat' are considered to be concrete words, whereas words such as 'additives' and 'cereal products' are seen as more abstract -yet, all are nouns.…”
Section: Discussionmentioning
confidence: 99%
“…Perhaps focusing on the presence of nouns, by counting articles and prepositions (Tausczik and Pennebaker, 2010) is not a very valid way of measuring the concreteness of text. Nouns per se are not concrete; they can have different degrees of concreteness (Pander Maat and Dekker, 2016). For example, words like 'stove', 'pan', and 'meat' are considered to be concrete words, whereas words such as 'additives' and 'cereal products' are seen as more abstract -yet, all are nouns.…”
Section: Discussionmentioning
confidence: 99%
“…For example, word prevalence, entropy, perplexity, word count, particular character count, word probability, morpheme count, word frequencies, named entity recognition, sentence analysing features and many more. Some tooling uses more than 200 features [12], or even more than 400 formulas [5].…”
Section: Readability Toolingmentioning
confidence: 99%
“…T-Scan is a tool which provides 457 different kinds of measurement features for a Dutch document [17]. The 457 features are divided into nine groups.…”
Section: T-scanmentioning
confidence: 99%
See 1 more Smart Citation
“…A straightforward method found the question. The method tokenised the sentences and selected the first sentence with a question mark as the leading question [22]. In the fifth step, the question, answer and documentation were combined in a JSON file, as an invalidated temporary dataset.…”
Section: Datasetmentioning
confidence: 99%