2017
DOI: 10.1007/978-3-319-53640-8_10
|View full text |Cite
|
Sign up to set email alerts
|

Keyword-Based Search on Bilingual Digital Libraries

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 4 publications
0
11
0
Order By: Relevance
“…For systematic development of raw material terminology, textual resources, namely, bilingual libraries and corpora are also needed. Thus, articles from the scientific journal Underground Mining, published both in Serbian and English, stored in the bilingual digital library Bibliša, as one of the collections of aligned English-Serbian bi-texts [29,30], were also used in our approach.…”
Section: Digital Resources In Raw Materials Domainmentioning
confidence: 99%
See 1 more Smart Citation
“…For systematic development of raw material terminology, textual resources, namely, bilingual libraries and corpora are also needed. Thus, articles from the scientific journal Underground Mining, published both in Serbian and English, stored in the bilingual digital library Bibliša, as one of the collections of aligned English-Serbian bi-texts [29,30], were also used in our approach.…”
Section: Digital Resources In Raw Materials Domainmentioning
confidence: 99%
“…However, when it comes to terms in the form of terminological phrases, their extraction from corpora becomes much more complicated. Automatic extraction of term candidates for Serbian relies on a procedure presented in [30,34]. Essentially, it is based on detecting words in corpora that follow one of the 23 specific syntactic patterns, most frequent for noun terms (AN adjective-noun, NNg noun-noun in genitive case, AAN, .…”
Section: Adding New Serbian Terms To General Lexica Dictionariesmentioning
confidence: 99%
“…For this reason, we have chosen to include multi-word units automatically extracted from the domain corpus using syntax graphs developed within the Unitex program. By using the tool Leximir, term candidates were extracted according to pre-defined syntactic patterns (Stanković et al, 2016b) (Krstev et al, 2015); the lexical units were then lemmatized to unify all the occurrences of multi-word lemmas. Since this kind of lemmatization can lead to ambiguity, we employed different strategies to resolve these issues.…”
Section: Multi-word Term Extraction In Serbianmentioning
confidence: 99%
“…Secondly, several statistical measures were implemented to rank the term candidates. Finally, term candidate were evaluated and selected for a terminology dictionary (Stanković et al, 2016b). The 20 most frequent Serbian multiword terminological units in the corpus (shown in Table 5) are mostly noun phrases with an adjective as a premodifier (grf01, the adjective + noun pattern, marked with AXN, with adjective-noun agreement in gender, number and case), e.g.…”
Section: Multi-word Term Extraction In Serbianmentioning
confidence: 99%
See 1 more Smart Citation