International Conference on Computer and Communication Engineering (ICCCE'10) 2010
DOI: 10.1109/iccce.2010.5556854
|View full text |Cite
|
Sign up to set email alerts
|

Corpus-based web document summarization using statistical and linguistic approach

Abstract: Single document summarization generates summary by extracting the representative sentences from the document. In this paper, we presented a novel technique for summarization of domain-specific text from a single web document that uses statistical and linguistic analysis on the text in a reference corpus and the web document. The proposed summarizer uses the combinational function of Sentence Weight ( ) and Subject Weight ( ) to determine the rank of a sentence, where is the function of number of terms ( ) and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 10 publications
0
5
0
Order By: Relevance
“…Suanmali et al proposed feature based sentence and knowledge extraction technique in [16], where representativeness of sentences and knowledge depended on fused features like normalization, term frequencies and number of proper nouns. We did not fuse these features rather we used them independently as [9] showed that independent features perform better on domain-specific information retrieval. Cao et al [2] weighed adjectives to extract concepts from commonsense knowledge.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Suanmali et al proposed feature based sentence and knowledge extraction technique in [16], where representativeness of sentences and knowledge depended on fused features like normalization, term frequencies and number of proper nouns. We did not fuse these features rather we used them independently as [9] showed that independent features perform better on domain-specific information retrieval. Cao et al [2] weighed adjectives to extract concepts from commonsense knowledge.…”
Section: Related Workmentioning
confidence: 99%
“…For example, a list of Term Frequencies [9] is given in Table II. When considering domain-specific knowledge, it is usual that some commonsense will be present in more than one knowledge.…”
Section: A Development Of Commonsense Knowledge-basementioning
confidence: 99%
See 1 more Smart Citation
“…Since then, a large number of techniques and approaches have been developed. Interestingly, the large volumes of information created on the web have triggered much of this development (Nenkova & McKeown, 2011;Shams, Hashem, Hossain, Akter, & Gope, 2010). Bhargava, Sharma, and Sharma (2016) posit that text summarization tools have now become a necessity to navigate the information on the web because they help eliminate dispensable or superfluous content.…”
Section: Overview Of Automated Text Summarization and Evaluationmentioning
confidence: 99%
“…Text units with a concentration of high-score words are often likely contenders for extraction (Liu & Liu, 2009). Extraction-based summarization, then, is essentially concerned with evaluating the salience or the indicative power of each sentence in a given document (Shams et al, 2010). Figure 1 maps out the process flow for extraction-based systems.…”
Section: Extraction-based Text Summarizationmentioning
confidence: 99%