Proceedings of the 26th Annual Meeting on Association for Computational Linguistics - 1988
DOI: 10.3115/982023.982049
|View full text |Cite
|
Sign up to set email alerts
|

Lexicon and grammar in probabilistic tagging of written English

Abstract: The paper describes the development of software for automatic grammatical ana]ysi$ of unl~'Ui~, unedited English text at the Unit for Compm= Research on the Ev~li~h Language (UCREL) at the Univet~ of Lancaster. The work is ~n'nmtly funded by IBM and carried out in collaboration with colleagues at IBM UK (W'~) and IBM Yorktown Heights. The paper will focus on the lexicon component of the word raging system, the UCREL grammar, the datal~zlks of parsed sentences, and the tools that have been written to support de… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

1991
1991
2012
2012

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…We use the "treebank" data described in Beale (1988). It contains 42,186 sentences (about one million words) from the Associated Press.…”
Section: Text Datamentioning
confidence: 99%
“…We use the "treebank" data described in Beale (1988). It contains 42,186 sentences (about one million words) from the Associated Press.…”
Section: Text Datamentioning
confidence: 99%
“…The use more diversified and complex examples contained in large corpora was also explored (Klein & Simmons, 1963). Probabilistic methods were also used in assigning grammatical codes to different words in the corpora (Beale, 1988;Bahl & Mercer, 1976).…”
Section: Related Workmentioning
confidence: 99%
“…The tagging is a 76 tag projection of the set of 159 tags originally used in conjunction with a treebanking effort at Lancaster University. For more details, see Beale (1988). tagged sentences.…”
Section: Training Hmm Taggers With Saum-welchmentioning
confidence: 99%