1984
DOI: 10.1145/358027.358048
|View full text |Cite
|
Sign up to set email alerts
|

Automatic spelling correction in scientific and scholarly text

Abstract: An automatic spelling correcting algorithm corrects most of the 50,000 misspellings culled from 25,000,000 words of text from seven scientific and scholarly databases. It uses a similarity key to identify words in a large dictionary that are most similar to a particular misspelling, and'then an error-reversal test to select from these the most plausible correction(s).

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
66
1
4

Year Published

1997
1997
2014
2014

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 133 publications
(72 citation statements)
references
References 12 publications
1
66
1
4
Order By: Relevance
“…Noteworthy among the error-correction systems based on nonphonetic coding is SPEEDCOP (Spelling Error Detection-Correction Project) (Pollock, 1982;Pollock & Zamora, 1984), currently used in the databases of Chemical Abstracts Services. The nucleus of this system is an algorithm designed to correct misspellings that consist of a single error, and a dictionary where the correct forms are stored.…”
Section: Similarity Relationsmentioning
confidence: 99%
See 2 more Smart Citations
“…Noteworthy among the error-correction systems based on nonphonetic coding is SPEEDCOP (Spelling Error Detection-Correction Project) (Pollock, 1982;Pollock & Zamora, 1984), currently used in the databases of Chemical Abstracts Services. The nucleus of this system is an algorithm designed to correct misspellings that consist of a single error, and a dictionary where the correct forms are stored.…”
Section: Similarity Relationsmentioning
confidence: 99%
“…Its method of nonphonetic coding captures the essence of the correct words as well as that of the misspellings, and a word is corrected when the compared codes collate very closely. The codes designated as keys can be of two types: "Skeleton keys" and "Omission keys" (Pollock & Zamora, 1984). The SPEEDCOP correction strategy is based on the creation of a key of similarity for each word of the dictionary and its classification in alphabetic order.…”
Section: Similarity Relationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Other strategies that might be considered by a server implementor are matches based on substring, suffix, regular expressions, soundex [KNUTH73], and Levenshtein [PZ85] algorithms. These last two are especially useful for correcting spelling errors.…”
Section: The Match Commandmentioning
confidence: 99%
“…"), then the word will be matched using a server-dependent default strategy, which should be the best strategy available for interactive spell checking. This is usually a derivative of the Levenshtein algorithm [PZ85].…”
Section: The Match Commandmentioning
confidence: 99%