Proceedings of the 10th International Conference on World Wide Web 2001
DOI: 10.1145/371920.372057
|View full text |Cite
|
Sign up to set email alerts
|

Retrieving and organizing web pages by “information unit”

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
66
0
1

Year Published

2002
2002
2010
2010

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 83 publications
(67 citation statements)
references
References 9 publications
0
66
0
1
Order By: Relevance
“…query q by exactly following Formula 9(b). ρ s (q, a) is computed by a sum of the product of the similarity of each of its child c and the confidence value of c as a search via node (line [11][12][13][14]. Finally, ρ s (q, a) is normalized by a factor W q a (line 15), which is the weight of internal node a w.r.t.…”
Section: B Abstractearch and Rankingmentioning
confidence: 99%
See 1 more Smart Citation
“…query q by exactly following Formula 9(b). ρ s (q, a) is computed by a sum of the product of the similarity of each of its child c and the confidence value of c as a search via node (line [11][12][13][14]. Finally, ρ s (q, a) is normalized by a factor W q a (line 15), which is the weight of internal node a w.r.t.…”
Section: B Abstractearch and Rankingmentioning
confidence: 99%
“…Li et al [11] show the reduction from minimal reduced tree problem to the NP-complete Group Steiner Tree problem on graphs. BANKS [12] uses bidirectional expansion heuristic algorithms to search as small portion of graph as possible.…”
Section: Introductionmentioning
confidence: 99%
“…However, it suffers the same problem as those in tree model, as both of them exploit only the structure of XML data. Even worse, the problem of finding the results by increasing the sizes of reduced subtrees for keyword proximity is NP-hard [14], thus keyword search in digraph data model is heuristics-based and intrinsically expensive.…”
Section: Fig 1 Example Xml Data (With Dewey Ids)mentioning
confidence: 99%
“…each G is the smallest subgraph containing all keywords. However, the cost of finding all such G ranked by size is intrinsically expensive due to its NP-hard nature [14]. Bidirectional expansion is proposed to find ranked reduced subtrees [12], but it requires the entire visited graph in memory, and suffers an inefficiency.…”
Section: Related Workmentioning
confidence: 99%
“…Likewise, a relational database can be viewed as a graph where tuples are modeled as vertices connected via foreign-key relationships [BNH+02], and a XML database can be represented as a graph with XML elements as nodes and containment or ID-IDREF edges as hyperlinks [GSBS03,CMKS03]. Keyword search querying has emerged as one of the most effective paradigms for information discovery, especially over HTML documents in the World Wide Web [PBMW98,Kle99,LCVA01]. One of the key advantages of keyword search querying is its simplicityusers do not have to learn a complex query language, and can issue queries without any prior knowledge about the structure of the underlying data.…”
Section: Introductionmentioning
confidence: 99%