[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing 1991
DOI: 10.1109/icassp.1991.150460
|View full text |Cite
|
Sign up to set email alerts
|

Tagging text with a probabilistic model

Abstract: In this paper we present some experiments on the use of a probabilistic model to tag English text, i.e. to assign to each word the correct tag (part of speech) in the context of the sentence. The main novelty of these experiments is the use of untagged text in the training of the model. We have used a simple triclass Marlcov model and are looking for the best way to estimate the parameters of this model, depending on the kind and amount of training data provided. Two approaches in particular are compared and c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

8
253
0
1

Year Published

1995
1995
2022
2022

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 192 publications
(262 citation statements)
references
References 12 publications
8
253
0
1
Order By: Relevance
“…We used a smoothing factor of 0.125. See [10] for the application of smoothing in a similar problem. Other methods like Laplace's rule may be used instead [1].…”
Section: Modifying Production Probabilitiesmentioning
confidence: 99%
“…We used a smoothing factor of 0.125. See [10] for the application of smoothing in a similar problem. Other methods like Laplace's rule may be used instead [1].…”
Section: Modifying Production Probabilitiesmentioning
confidence: 99%
“…This is the case of the Xerox tagger (Cutting et al, 1992) and its successors. Those interested in the subject can find an excellent overview by Merialdo (1994).…”
Section: Existing Approaches To Pos Taggingmentioning
confidence: 99%
“…In POS tagging we find the Baum-Welch re-estimation algorithm which has been successfully used to improve tagger accuracies when limited disambiguated material is available (Cutting et al, 1992;Elworthy, 1994;Merialdo, 1994). Brill (1995) presented a weak-supervised version of the transformation-based learning algorithm for tagging.…”
Section: Tagging the Lexesp Spanish Corpusmentioning
confidence: 99%
“…Many proposed solutions are based on Hidden Markov models (HMMs), with various improvements obtainable through: inductive bias in the form of tag dictionaries (Kupiec, 1992;Merialdo, 1994), sparsity constraints (Lee et al, 2010), careful initialization of parameters (Goldberg et al, 2008), feature based representations (Berg-Kirkpatrick et al, 2010;Smith and Eisner, 2005), and priors on model parameters (Johnson, 2007;Goldwater and Griffiths, 2007;Blunsom and Cohn, 2011, inter alia).…”
Section: Introductionmentioning
confidence: 99%