Proceedings of the 10th ACM Symposium on Document Engineering 2010
DOI: 10.1145/1860559.1860592
|View full text |Cite
|
Sign up to set email alerts
|

Automatic selection of print-worthy content for enhanced web page printing experience

Abstract: The user experience of printing web pages has not been very good. Web pages typically contain contents that are not printworthy or informative such as side bars, footers, headers, advertisements, and auxiliary information for further browsing. Since the inclusion of such contents degrades the web printing experience, we have developed a tool that first selects the main part of the web page automatically and then allows users to make adjustments. In this paper, we describe the algorithm for selecting the main c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(9 citation statements)
references
References 7 publications
0
9
0
Order By: Relevance
“…Extracting web content from web pages has received substantial interest by researchers [13,4,6,10,12,18,21,8,17,9,19,16]. Most of existent working has focused on article web pages.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Extracting web content from web pages has received substantial interest by researchers [13,4,6,10,12,18,21,8,17,9,19,16]. Most of existent working has focused on article web pages.…”
Section: Related Workmentioning
confidence: 99%
“…The task of extracting the main content from web pages has attracted many research works [4,10,12,13,18]. However, most of these previous works focus on article web pages, such as news articles, and blog posts.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To select relevant content from the web page, we leverage the algorithms that are used in the HP Smart Print browser plug-in [6], the details for which are explained in prior publication [7]. This algorithm relies only on the visual appearance of the web page, making no assumptions about the language encoding or the semantics of the page, and thus is usable across all web pages regardless of type or national origin.…”
Section: Our Solutionmentioning
confidence: 99%
“…However, Web pages encoded by the HTML language are designed for viewing on PC screen and may not be suitable for other purposes, such as Web printing and e-reading on small handheld devices. Thus, extracting informative clips (referring to content areas in Web pages) attracts many research works [12,14,3,10,9]. However, most of these previous works focus on extracting article Web pages, such as news stories, encyclopedia entries, and blog posts.…”
Section: Introductionmentioning
confidence: 99%