Proceedings of the 2016 International Conference on Advanced Electronic Science and Technology (AEST 2016) 2016
DOI: 10.2991/aest-16.2016.19
|View full text |Cite
|
Sign up to set email alerts
|

Character information extraction based on CRFsuite

Abstract: Abstract. By applying the Conditional Random Fields based on discriminant undirected graph to character information extraction, this paper proposes an automation character information extraction method based on CRFsuite. Through learning the known domain, this method extracts the feature leading words, position and means from the character information in the Internet to build up a character parameter. By using CRFsuite as a model, the method adopts it to data from the Internet, matches character information an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 4 publications
0
1
0
Order By: Relevance
“…For sentence tokenization, we trained a conditional random field (CRF) model, using pythoncrfsuite (Peng and Korobov, 2014), on translated TED transcripts and Thai sentence boundaries are assumed to be denoted by English sentence boundaries (Lowphansirikul et al, 2021b).…”
mentioning
confidence: 99%
“…For sentence tokenization, we trained a conditional random field (CRF) model, using pythoncrfsuite (Peng and Korobov, 2014), on translated TED transcripts and Thai sentence boundaries are assumed to be denoted by English sentence boundaries (Lowphansirikul et al, 2021b).…”
mentioning
confidence: 99%