2022
DOI: 10.3389/fphys.2022.820683
|View full text |Cite
|
Sign up to set email alerts
|

NLIMED: Natural Language Interface for Model Entity Discovery in Biosimulation Model Repositories

Abstract: Semantic annotation is a crucial step to assure reusability and reproducibility of biosimulation models in biology and physiology. For this purpose, the COmputational Modeling in BIology NEtwork (COMBINE) community recommends the use of the Resource Description Framework (RDF). This grounding in RDF provides the flexibility to enable searching for entities within models (e.g., variables, equations, or entire models) by utilizing the RDF query language SPARQL. However, the rigidity and complexity of the SPARQL … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

4
0

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 40 publications
0
8
0
Order By: Relevance
“…e c is the average of ontology class feature embeddings; for example, FMA:54537 has a preferred label feature of ‘Astrocyte’ and a synonym feature of ‘Astrocytus’. There are other features, such as parent labels and definitions, but using the selected two features only can give a higher performance ( Munarko et al, 2022 ). For e p calculation, predicate terms in camelCase format are normalised to phrases in lowercase before converting to embeddings.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…e c is the average of ontology class feature embeddings; for example, FMA:54537 has a preferred label feature of ‘Astrocyte’ and a synonym feature of ‘Astrocytus’. There are other features, such as parent labels and definitions, but using the selected two features only can give a higher performance ( Munarko et al, 2022 ). For e p calculation, predicate terms in camelCase format are normalised to phrases in lowercase before converting to embeddings.…”
Section: Methodsmentioning
confidence: 99%
“…Working well with structured and unstructured text-based queries, Natural Language Interface for Model Entity Discovery (NLIMED) provides an interface to retrieve entities annotated compositely ( Munarko et al, 2022 ). It identifies phrases in the query associated with the physiological domain and links them to possible ontology classes and predicates.…”
Section: Introductionmentioning
confidence: 99%
“…We provide an interface to automatically convert user queries in natural language form to SPARQL. All sections have been published in Frontiers of Physiology [69].…”
Section: Repositoriesmentioning
confidence: 99%
“…e c is the average of ontology class feature embeddings, including preferred label embedding and synonym embedding (see Equation (4.1)). We do not use other features such as parent label and definition because using the preferred label and synonym alone can give a higher performance [69]. We apply w p between 0 to 1 in Equation 4.2 as a multiplier to limit the role of e p to the path embedding, so it does not exceed the ontology class embedding.…”
Section: Entity Embeddingmentioning
confidence: 99%
“…However, creating SPARQL is difficult as it requires good knowledge of annotation structure, syntax, and ontology terms. Tools to help generating SPARQL from natural languages, such as NLIMED, 11 are helpful. Still, it is more suitable for experts who already know the target entity, not for those doing exploratory searches like using commercial search engines.…”
Section: Introductionmentioning
confidence: 99%