2010
DOI: 10.1007/978-3-642-14770-8_37
|View full text |Cite
|
Sign up to set email alerts
|

Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary

Abstract: Abstract. The lack of large-scale, freely available and durable lexical resources, and the consequences for NLP, is widely acknowledged but the attempts to cope with usual bottlenecks preventing their development often result in dead-ends. This article introduces a language-independent, semi-automatic and endogenous method for enriching lexical resources, based on collaborative editing and random walks through existing lexical relationships, and shows how this approach enables us to overcome recurrent impedime… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0
5

Year Published

2011
2011
2014
2014

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(14 citation statements)
references
References 12 publications
0
9
0
5
Order By: Relevance
“…Among the well known examples, we mention the Wiktionary [13]. Any user can contribute to the enrichment phase through a previously authentication.…”
Section: Enrichment Approaches Of Electronic Dictionariesmentioning
confidence: 99%
See 1 more Smart Citation
“…Among the well known examples, we mention the Wiktionary [13]. Any user can contribute to the enrichment phase through a previously authentication.…”
Section: Enrichment Approaches Of Electronic Dictionariesmentioning
confidence: 99%
“…Therefore, several works was proposed such as [3] [4] [12] and [14]. The well known application of the collaborative approach for filling and updating large resources is the Wiktionary [13] that currently covers several languages. However, the mentioned works deal with a superficially structure (or model) of resources.…”
Section: Introdutionmentioning
confidence: 99%
“…Si la version allemande de Wiktionary semble être celle qui bénéficie de l'encodage le plus rigoureux et le plus systématique 14 , la version française, moins aisément exploitable, se distingue par une plus grande nomenclature, ainsi que la présence quasi-systématique d'informations flexionnelles et phonémiques. Nous avons mis à disposition WiktionaryX 15 , une version structurée au format XML de ce dictionnaire pour le français et l'anglais (Sajous et al, 2010). …”
Section: Critiquesunclassified
“…Ce format lâche rend ardue et constamment inachevée l'écriture d'un parseur pour extraire de manière automatique et exhaustive les informations du Wiktionnaire (Navarro et al, 2009;Sajous et al, 2010Sajous et al, , 2011 Les figures 1 et 2 montrent des extraits de l'article « affluent », tel qu'on peut le consulter dans le Wiktionnaire, des articles de deux de ses formes fléchies, ainsi que le wikicode correspondant à chaque extrait. Le tableau qui recense les formes fléchies de l'adjectif et les transcriptions phonémiques correspondantes (en haut à droite de la figure 1a) n'est pas explicitement présent dans le wikicode, mais généré par le patron {{fr-accord-cons|a.fly.Ã|t}} (figure 1b).…”
Section: Extraction D'informationunclassified
“…There is no documentation available regarding scope or intended granularity. A very fine grained extraction was conducted using WISIGOTH [17], but unfortunately there are no sources available and the project is unmaintained since 2010. Two newer approaches are the lexvo.org service and the algorithm presented in [9].…”
Section: Related Workmentioning
confidence: 99%