Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05 2005
DOI: 10.3115/1220575.1220662
|View full text |Cite
|
Sign up to set email alerts
|

Maximum expected F-measure training of logistic regression models

Abstract: We consider the problem of training logistic regression models for binary classification in information extraction and information retrieval tasks. Fitting probabilistic models for use with such tasks should take into account the demands of the taskspecific utility function, in this case the well-known F-measure, which combines recall and precision into a global measure of utility. We develop a training procedure based on empirical risk minimization / utility maximization and evaluate it on a simple extraction… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 71 publications
(64 citation statements)
references
References 21 publications
0
64
0
Order By: Relevance
“…Firstly, if the subordinate is in SBV or VOB, and also the core word in ATT is function word or a noun, then they're a lot of possible to be a feature, as a result of the options ar sometimes nouns and pronouns. Secondly, the parts, that have coordinating relation (COO) [6] with the words known in previous step, ar known because the options further. Finally, pronouns are replaced by the previous noun that's known as another feature prevalence.…”
Section: International Journal For Research In Applied Science and Engimentioning
confidence: 99%
“…Firstly, if the subordinate is in SBV or VOB, and also the core word in ATT is function word or a noun, then they're a lot of possible to be a feature, as a result of the options ar sometimes nouns and pronouns. Secondly, the parts, that have coordinating relation (COO) [6] with the words known in previous step, ar known because the options further. Finally, pronouns are replaced by the previous noun that's known as another feature prevalence.…”
Section: International Journal For Research In Applied Science and Engimentioning
confidence: 99%
“…After that, the model with determined parameters is used to calculate classifier output for a test instance. Algorithms from this group are commonly based on structured SVMs [14,37], thresholding strategies [38][39][40] or regression [27]. The structured output SVM is a generalization of classical SVM algorithm [48].…”
Section: Related Workmentioning
confidence: 99%
“…We trained both variants with back-propagation (Rumelhart et al, 1986) and gradient ascent. In particular, we optimized F β using a variant of Jansche's (2005) derivation of F β -optimized logistic regression (see suplementary material 5 for full derivation).…”
Section: Weighted Edge Modelmentioning
confidence: 99%