2016
DOI: 10.1007/978-3-662-49390-8_58
|View full text |Cite
|
Sign up to set email alerts
|

Improving Efficiency of Sentence Boundary Detection by Feature Selection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…Therefore, researchers need to understand a language well to apply rule-based approaches. In the past several years, DL models have been successfully applied to many sequential labeling and classification tasks, such as SBD, speech recognition, WS, and POS tagging and chunking [4,5] . DL models can learn a hierarchy of nonlinear feature detectors to capture complex statistical patterns [6] .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, researchers need to understand a language well to apply rule-based approaches. In the past several years, DL models have been successfully applied to many sequential labeling and classification tasks, such as SBD, speech recognition, WS, and POS tagging and chunking [4,5] . DL models can learn a hierarchy of nonlinear feature detectors to capture complex statistical patterns [6] .…”
Section: Introductionmentioning
confidence: 99%
“…Modern Tibetan is composed of letters, including 30 consonant letters and four vowel letters, and consonant letters and vowel letters are arranged separately in the alphabet. The 30 consonant letters are divided into eight groups, which are (1) [" ", " ", " ", " "], (2) [" ", " ", " ", " "], (3) [" ", " ", " ", " "], (4) [" ", " ", " ", " "], (5) [" ", " ", " ", " "], (6) [" ", " ", " ", " "], (7) [" ", " ", " ", " "], and (8) [" ", " "]. The four vowel letters are " ", " ", " ", and " ".…”
Section: Introductionmentioning
confidence: 99%
“…This thesis studies two approaches to improve the lexical input feature space for SUD systems. The first approach proposes a PMI-based feature selection method to select only informative features for a shallow learning (CRF-based) SUD system [22]. The second approach examines various strategies to train word embedding models which are used to extract feature for a deep learning based SUD system [23].…”
Section: Contributionsmentioning
confidence: 99%
“…The novel PMI-based feature selection method [22]: The approach investigates the effect of contextual features to the SUD task and proposes a feature selection method using PMI to reduce the input feature space. Here, the surrounding words of each label being predicted are used as the contextual features to the SUD system.…”
Section: Contributionsmentioning
confidence: 99%
See 1 more Smart Citation