1990
DOI: 10.1002/(sici)1097-4571(199003)41:2<87::aid-asi2>3.0.co;2-8
|View full text |Cite
|
Sign up to set email alerts
|

INDEX: The statistical basis for an automatic conceptual phrase-indexing system

Abstract: In recent years researchers have become increasingly convinced that the performance of information retrieval systems can be greatly enhanced by the use of key phrases for automatic conceptual document indexing and retrieval. In this article we describe two programs, INDEX and INDEXD, which locate repeated phrases in a document, gather statistical information about them, and rank them according to their value as index phrases. The programs show promise as the basis for a sophisticated conceptual indexing system… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

1993
1993
2015
2015

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(12 citation statements)
references
References 11 publications
0
11
0
Order By: Relevance
“…Salton 1988;Salton, Zhao, and Buckley 1990;Cherry 1990), especially for information retrieval (e.g. Hamill and Zamora 1980;Jones, Gassie, and Radhakrishnan 1990) and for natural language database query systems (e.g. Damerau 1993).…”
Section: Related Workmentioning
confidence: 99%
“…Salton 1988;Salton, Zhao, and Buckley 1990;Cherry 1990), especially for information retrieval (e.g. Hamill and Zamora 1980;Jones, Gassie, and Radhakrishnan 1990) and for natural language database query systems (e.g. Damerau 1993).…”
Section: Related Workmentioning
confidence: 99%
“…Use of the linguistic approaches or statistical approach alone does not provide an effective result (Pazienza et al, 2005). There are only a few works that use only the statistical method without touching any of the linguistic approaches (Jones et al, 1990;Salton et al, 1975) . In this research, we therefore chose to use a hybrid approach for the lecture slide materials.…”
Section: Linguistic and Statistic Approaches For Term Extractionmentioning
confidence: 99%
“…We evaluated the APAT terminology using a validation approach (as in [1,7]) for calculating Precision, and a reference list (see [1]) for calculating Recall. As high Precision is a key issue for the overall application, we evaluated the 917 terms extracted by the system with the threshold set to τ=5.…”
Section: Terminology Evaluation and Analysismentioning
confidence: 99%