2002
DOI: 10.1007/3-540-47887-6_26
|View full text |Cite
|
Sign up to set email alerts
|

WebFrame: In Pursuit of Computationally and Cognitively Efficient Web Mining

Abstract: Abstract. The goal of web mining is relatively simple: provide both computationally and cognitively efficient methods for improving the value of information to users of the WWW. The need for computational efficiency is well-recognized by the data mining community, which sprung from the database community concern for efficient manipulation of large datasets. The motivation for cognitive efficiency is more elusive but at least as important. In as much as cognitive efficiency can be informally construed as ease o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2007
2007
2015
2015

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…This entails having a model that can infer semantic relationships from domain knowledge obtained through an ontology or as annotations in the sequence and avoid support counting if a candidate sequence is semantically not likely to occur. A number of Ph.D. dissertations propose a web usage-mining framework which integrates usage, structure and content into the mining process for more directed constrained mining (e.g., Zheng [2004], Tanasa [2005], and Jin [2006]), but do not utilize domain knowledge in the mining process.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…This entails having a model that can infer semantic relationships from domain knowledge obtained through an ontology or as annotations in the sequence and avoid support counting if a candidate sequence is semantically not likely to occur. A number of Ph.D. dissertations propose a web usage-mining framework which integrates usage, structure and content into the mining process for more directed constrained mining (e.g., Zheng [2004], Tanasa [2005], and Jin [2006]), but do not utilize domain knowledge in the mining process.…”
Section: Discussionmentioning
confidence: 99%
“…We think that sampling should be given more attention in pattern-growth and early-pruning algorithms as a way to reduce search space and processing time for mining. An open research area is in investigating ways to sample sequences and partition the search space based on the Fibonacci Sequence [Zill 1998] as a guide for sampling or partitioning of sequences. Another way to reduce the search space while mining sequential patterns is to focus on concise representations of sequences, such as mining maximal or closed sequences.…”
Section: Seven More Features In the Taxonomy For Pattern-growth Algormentioning
confidence: 99%
“…We apply the methods of our previous work [21] to generate the usage data record and user sessions from web access logs. As a default, WebViz uses node size to represent the page visit count, node colour to indicate the average page view time, edge thickness to show the hyperlink usage count, and edge colour to represent the usage count percentage of a hyperlink (out of total count of the hyperlinks that share the same start page).…”
Section: Visualizing Web Usage Data and Patternsmentioning
confidence: 99%
“…The data preparation [21] consists of crawling the website structure, cleaning irrelevant records, and breaking the access log into user sessions. In order to cut usage records into sessions, we identify users by their authentication or cookie value, then exclude all other…”
Section: System Performancementioning
confidence: 99%
See 1 more Smart Citation