Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management 2007
DOI: 10.1145/1321440.1321510
|View full text |Cite
|
Sign up to set email alerts
|

The role of documents vs. queries in extracting class attributes from text

Abstract: Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources of data in textual information extraction. The differences are quantified as part of a large-scale study on extracting prominent attributes or quantifiable properties of classes (e.g., top speed, price and fuel consumption for CarModel) from unstructured text. In a head-to-head qualitative comparison, a lightweight extraction method pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 21 publications
(13 citation statements)
references
References 20 publications
0
13
0
Order By: Relevance
“…While our research is related to query log analysis (cf. for example (Broder, 2002;Jansen et al, 2008;Lee et al, 2005;Murray & Teevan, 2007;Pasca et al, 2007)), our goal is to contribute to the problem of commonsense knowledge acquisition.…”
Section: Or (Liumentioning
confidence: 99%
“…While our research is related to query log analysis (cf. for example (Broder, 2002;Jansen et al, 2008;Lee et al, 2005;Murray & Teevan, 2007;Pasca et al, 2007)), our goal is to contribute to the problem of commonsense knowledge acquisition.…”
Section: Or (Liumentioning
confidence: 99%
“…• Frequency-based ranking: One method is to rank the selected candidates for a particular class, by frequency [10] as shown in Equation 1. Each candidate P extracted for a class C is assigned a score based on the number of instances I from C that produced P .…”
Section: Ranking Class Attributesmentioning
confidence: 99%
“…The strategy for assigning correctness labels is similar to the assessment method used in previous work [10]. An attribute is "vital" if it must be present in an ideal list of attributes for the target class; "okay" if it provides useful but non-essential information; and "wrong" if it is incorrect.…”
Section: Attribute Extractionmentioning
confidence: 99%
See 2 more Smart Citations