We present the KELLY project and its work on developing monolingual and bilingual word lists for language learning, using corpus methods, for nine languages and thirty-six language pairs. We describe the method and discuss the many challenges encountered. We have loaded the data into an online database to make it accessible for anyone to explore and we present our own first explorations of it. The focus of the paper is thus twofold, covering pedagogical and methodological aspects of the lists' construction, and linguistic aspects of the by-product of the project, the KELLY database.We would like to dedicate this paper to our colleague Frieda Charalabopoulou, who died, following a long struggle with cancer, between its writing and its publication.
This report describes the development of a parsing system for written Swedish and is focused on a grammar, the main component of the system, semiautomatically extracted from corpora. A cascaded, finite-state algorithm is applied to the grammar in which the input contains coarse-grained semantic class information, and the output produced reflects not only the syntactic structure of the input, but grammatical functions as well. The grammar has been tested on a variety of random samples of different text genres, achieving precision and recall of 94.62% and 91.92% respectively, and average crossing rate of 0.04, when evaluated against manually disambiguated, annotated texts.
We present T-MASTER, a tool for assessing students' reading skills on a variety of dimensions. T-MASTER uses sophisticated measures for assessing a student's reading comprehension and vocabulary understanding. Texts are selected based on their difficulty using novel readability measures and tests are created based on the texts. The results are analyzed in T-MASTER, and the numerical results are mapped to textual descriptions that describe the student's reading abilities on the dimensions being analysed. These results are presented to the teacher in a form that is easily comprehensible, and lends itself to inspection of each individual student's results.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.