Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR 1996
DOI: 10.1145/243199.243270
|View full text |Cite
|
Sign up to set email alerts
|

On Chinese text retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
31
0

Year Published

1999
1999
2015
2015

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 41 publications
(32 citation statements)
references
References 9 publications
1
31
0
Order By: Relevance
“…Sophisticated IR systems do however include features which rely on semantics captured better in words than in bigrams: query expansion through synonyms, negated antonyms and other transformations based on thesaurus lookups; named entity extraction; question answering; spelling correction; semantic inference based on natural language processing (NLP) techniques. It might be possible to base relevance feedback on bigrams but this would have to be verified; it seems more natural to use words [16,22].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Sophisticated IR systems do however include features which rely on semantics captured better in words than in bigrams: query expansion through synonyms, negated antonyms and other transformations based on thesaurus lookups; named entity extraction; question answering; spelling correction; semantic inference based on natural language processing (NLP) techniques. It might be possible to base relevance feedback on bigrams but this would have to be verified; it seems more natural to use words [16,22].…”
Section: Discussionmentioning
confidence: 99%
“…Tong et al [24] TREC Recall, AP, R-prec., P-R curves Equal Nie et al [16] 1270KB P-R curves Direct relationship Kwok [10] TREC AP, R-prec., P@N No direct relationship Kwok [9] TREC Rel. retrieved@1000, AP, R-prec., P@N Equal, no direct relationship Palmer and Burger [20] TREC AP, R-prec.…”
Section: Workmentioning
confidence: 99%
“…It is found that approaches using either characters (bigrams) or words can lead to comparable retrieval effectiveness [6,11,12]. In [14], it is further found that the retrieval effectiveness using a character-based language model is highly competitive to, and on several collections, is even higher than that using words and bigrams.…”
Section: Related Workmentioning
confidence: 94%
“…In (Wechsler, 2000), there is also investigation on the use of subword phonemes for retrieving spoken documents in German and English. For information retrieval in Chinese, there has been some work on word segmentation for Chinese IR (Nie and Brisebois, 1996) as well as investigation on the use of words or character ngrams as indexing units for Chinese textual IR (Kwok, 1997 andRen, 1999). Subword-based indexing is useful for information retrieval in Chinese because all textual documents are made up of a finite number of characters.…”
Section: Subword-based Retrievalmentioning
confidence: 99%