2021
DOI: 10.34088/kojose.871873
|View full text |Cite
|
Sign up to set email alerts
|

Named Entity Recognition in Turkish Bank Documents

Abstract: Named Entity Recognition (NER) is the process of automatically recognizing entity names such as person, organization, and date in a document. In this study, we focus on bank documents written in Turkish and propose a Conditional Random Fields (CRF) model to extract named entities. The main contribution of this study is twofold: (i) we propose domain-specific features to extract entity names such as law, regulation, and reference which frequently appear in bank documents; and (ii) we contribute to NER research… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…Surprisingly, topic-based KE tools like TopicRank and MultipartiteRank, where topics are viewed as nodes in a graph, do well on datasets with both short and long texts. Moreover, KE tools that incorporate word position as a feature, such as PositionRank, TextRank, and YAKE, tend to yield suboptimal results when applied to datasets where keywords are not evenly distributed throughout the text [76].…”
Section: ) Data Typementioning
confidence: 99%
“…Surprisingly, topic-based KE tools like TopicRank and MultipartiteRank, where topics are viewed as nodes in a graph, do well on datasets with both short and long texts. Moreover, KE tools that incorporate word position as a feature, such as PositionRank, TextRank, and YAKE, tend to yield suboptimal results when applied to datasets where keywords are not evenly distributed throughout the text [76].…”
Section: ) Data Typementioning
confidence: 99%
“…In the process of identifying named entities, the language in which the text document is written is very important. In the case of languages with an extensive morphological structure, the processing of text documents and the extraction of information is very difficult [12]. Therefore, Graliński et al [13] presented a formalism for the rule-based NER.…”
Section: Introduction and Related Workmentioning
confidence: 99%