Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research W 2009
DOI: 10.3115/1609179.1609182
|View full text |Cite
|
Sign up to set email alerts
|

Combining a statistical language model with logistic regression to predict the lexical and syntactic difficulty of texts for FFL

Abstract: Reading is known to be an essential task in language learning, but finding the appropriate text for every learner is far from easy. In this context, automatic procedures can support the teacher's work. Some tools exist for English, but at present there are none for French as a foreign language (FFL). In this paper, we present an original approach to assessing the readability of FFL texts using NLP techniques and extracts from FFL textbooks as our corpus. Two logistic regression models based on lexical and gram… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2012
2012
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 23 publications
0
13
0
Order By: Relevance
“…Readability research is no exception to this rule. Since readability prediction was initially designed to identify reading material suited to the reading competence of a given individual, most of the existing data sets are drawn from textbooks and other sources intended for di↵erent competence levels (François, 2009;Heilman et al, 2008;Feng et al, 2010). Although recent annotation e↵orts have also tackled other text types (e.g.…”
Section: Etcetera)mentioning
confidence: 99%
See 1 more Smart Citation
“…Readability research is no exception to this rule. Since readability prediction was initially designed to identify reading material suited to the reading competence of a given individual, most of the existing data sets are drawn from textbooks and other sources intended for di↵erent competence levels (François, 2009;Heilman et al, 2008;Feng et al, 2010). Although recent annotation e↵orts have also tackled other text types (e.g.…”
Section: Etcetera)mentioning
confidence: 99%
“…Nevertheless, we can deduct from previous research that features which are lexical in nature, such as language modeling features, have a strong predictive power. Besides various features, more intricate prediction methods such as Naive Bayes classifiers (Collins- Thompson & Callan, 2004), logistic regression (François, 2009) and support vector machines (Schwarm & Ostendorf, 2005;Feng et al, 2010;Tanaka-Ishii et al, 2010) have come into use.…”
Section: Related Workmentioning
confidence: 99%
“…Another possibility is to use extensive language-specific preprocessing or word lists (Feng, Elhadad, & Huenerfauth, 2009;François, 2009;Heilman, Collins-Thompson, & Eskenazi, 2008). A different strategy is to use machine learning methods, such as classification (e.g., Si & Callan, 2001;Collins-Thompson & Callan, 2005;Heilman, Collins-Thompson, Callan, & Eskenazi, 2007;Suominen et al, 2008;Petersen & Ostendorf, 2009).…”
Section: Automatic Approaches For Text Difficulty Assessmentmentioning
confidence: 98%
“…Advancements in these elds have introduced more intricate prediction methods such as Naïve Bayes classi ers (Collins-Thompson and Callan 2004), logistic regression (François 2009) and support vector machines (Schwarm and Ostendorf 2005;Feng et al 2010;Tanaka-Ishii et al 2010) -and especially more complex features. Rather than a sole reliance on super cial text characteristics, the added value of features measuring lexical complexity based on n-gram modelling (Schwarm and Ostendorf 2005;Pitler and Nenkova, 2008;Kate et al 2010) or those relying on deep syntactic parsing (Schwarm and Ostendorf, 2005) have been corroborated repeatedly in the computational approaches to readability prediction that have surfaced in the last decade (Heilman et al 2007;Petersen and Ostendorf, 2009;Nenkova et al 2010).…”
Section: The Readability Interfacementioning
confidence: 99%