2015
DOI: 10.1080/0952813x.2015.1057239
|View full text |Cite
|
Sign up to set email alerts
|

A bootstrapping method for development of Treebank

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…In this approach, a pretrained tagger is used to tag the words of the sentences which may then be manually corrected and included in the gold standard training corpus. The tagger is then retrained using this corpus to label next batch of sentences and this process of retraining and correction continues iteratively until some termination criteria is met [17]. Since this work builds upon the work of [3], same 500 preprocessed tweets (13,643 tokens) used in [3] were selected for performing bootstrapping experiments.…”
Section: Methodsmentioning
confidence: 99%
“…In this approach, a pretrained tagger is used to tag the words of the sentences which may then be manually corrected and included in the gold standard training corpus. The tagger is then retrained using this corpus to label next batch of sentences and this process of retraining and correction continues iteratively until some termination criteria is met [17]. Since this work builds upon the work of [3], same 500 preprocessed tweets (13,643 tokens) used in [3] were selected for performing bootstrapping experiments.…”
Section: Methodsmentioning
confidence: 99%
“…To induce classifiers from training data, MaltParser uses two different built-in learning libraries: liblinear and libsvm. Liblinear library supports linear classification [24], and libsvm is for Support Vector Machines [23].…”
Section: Parsermentioning
confidence: 99%
“…Researchers need reliable strategies for speeding up the annotation process without biassing the gold standard [8]. Bootstrapping is a novel approach that attempts to iteratively generating treebanks in an efficient and cost-effective manner [23]. During bootstrapping process, a trained parser is used to pre-parse raw text, which is then manually corrected by human annotators [9].…”
Section: Introductionmentioning
confidence: 99%