Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations 2014
DOI: 10.3115/v1/p14-5012
|View full text |Cite
|
Sign up to set email alerts
|

WoSIT: A Word Sense Induction Toolkit for Search Result Clustering and Diversification

Abstract: In this demonstration we present WoSIT, an API for Word Sense Induction (WSI) algorithms. The toolkit provides implementations of existing graph-based WSI algorithms, but can also be extended with new algorithms. The main mission of WoSIT is to provide a framework for the extrinsic evaluation of WSI algorithms, also within end-user applications such as Web search result clustering and diversification.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…There are computational challenges and limitations when exploiting the text mining tools (e.g., Radinsky et al, 2012;Asghar et al, 2014), particularly problems when extracting information from unstructured sources such as websites (Vannella et al, 2014). One challenging problem arises from the use of part of speech taggers, including incorrect tagging of words, vague classification of entities (Asghar et al, 2014;Jurafsky and Martin, 2014), language-related issues (Pinto et al, 2016), extraction of temporal expressions (Mani and Wilson, 2000;Kisilevich et al, 2010;Bögel et al, 2014;Derczynski and Gaizauskas, 2015) and geographical locations (Kisilevich et al, 2010;Piskorski and Atkinson, 2011) as discussed by Gupta (2016).…”
Section: Methodsmentioning
confidence: 99%
“…There are computational challenges and limitations when exploiting the text mining tools (e.g., Radinsky et al, 2012;Asghar et al, 2014), particularly problems when extracting information from unstructured sources such as websites (Vannella et al, 2014). One challenging problem arises from the use of part of speech taggers, including incorrect tagging of words, vague classification of entities (Asghar et al, 2014;Jurafsky and Martin, 2014), language-related issues (Pinto et al, 2016), extraction of temporal expressions (Mani and Wilson, 2000;Kisilevich et al, 2010;Bögel et al, 2014;Derczynski and Gaizauskas, 2015) and geographical locations (Kisilevich et al, 2010;Piskorski and Atkinson, 2011) as discussed by Gupta (2016).…”
Section: Methodsmentioning
confidence: 99%