The English-Xhosa Dictionary for Nurses (EXDN) is a bilingual, unidirectional printed dictionary in the public domain, with English and isiXhosa as the language pair. By extending the digitisation efforts of EXDN from a human-readable digital object to a machine-readable state, using Resource Description Framework (RDF) as the data model, semantically interoperable structured data can be created, thus enabling EXDN's data to be reused, aggregated and integrated with other language resources, where it can serve as a potential aid in the development of future language resources for isiXhosa, an under-resourced language in South Africa. The methodological guidelines for the construction of a Linguistic Linked Data framework (LLDF) for a lexicographic resource, as applied to EXDN, are described, where an LLDF can be defined as a framework: (1) which describes data in RDF, (2) using a model designed for the representation of linguistic information, (3) which adheres to Linked Data principles, and (4) which supports versioning, allowing for change. The result is a bidirectional lexicographic resource, previously bounded and static, now unbounded and evolving, with the ability to extend to multilingualism.Information 2018, 9, 274 2 of 30 into a simple digital resource, typically as images in JPEG format. However, by retrodigitising the artefact, converting it from a simple digital resource to a complex digital resource, it has the potential to become machine-interoperable. The form of complex digital objects can vary, with examples including: a collection of XML files; the same content but as HTML files with semantic markup to describe the content therein; as a dataset stored in a relational database or as Resource Description Framework (RDF). For the latter example, RDF "is a framework for representing information in the Web", published by the World Wide Web Consortium (W3C), where data is described in short statements, called triples, which are of the form subject-predicate-object [9]. A subject can be an internationalised resource identifier (IRI) or a blank node; an object can be an IRI, blank node, or a literal; and a predicate can be an IRI only [9].The lexical entry Abdomen can be expressed in the following short statements:Abdomen is a lexical entry.Abdomen is a word.Abdomen is a noun.Abdomen is an English term.Using Turtle syntax, a human-readable serialisation of RDF, the same short statements can be described in RDF: SubjectPredicate Object :abdomen :isA :lexicalEntry ; :isA :word ; :isA :noun ; :isLanguage :English .These short statements are visualised in Figure 1, using node-edge-node structure. Information 2018, 9, x FOR PEER REVIEW 2 of 29Despite EXDN being published more than seventy-five years ago, as a language resource for an under-resourced indigenous African language, its content is still of value [1] (p. 1). EXDN is in print form, bounded and static, and by digitising the dictionary, it is converted from an analogue resource into a simple digital resource, typically as images in JPEG format. Howev...
The Multilingual Semantic Web has been in focus for over a decade. Multilingualism in Linked Data and RDF has shown substantial adoption, but this is unclear for ontologies since the last review 15 years ago. One of the design goals for OWL was internationalisation, with the aim that an ontology is usable across languages and cultures. Much research to improve on multilingual ontologies has taken place in the meantime, and presumably multilingual linked data could use multilingual ontologies. Therefore, this review seeks to (i) elucidate and compare the modelling options for multilingual ontologies, (ii) examine extant ontologies for their multilingualism, and (iii) evaluate ontology editors for their ability to manage a multilingual ontology.Nine different principal approaches for modelling multilinguality in ontologies were identified, which fall into either of the following approaches: using multilingual labels, linguistic models, or a mapping-based approach. They are compared on design by means of an ad hoc visualisation mode of modelling multilingual information for ontologies, shortcomings, and what issues they aim to solve. For the ontologies, we extracted production-level and accessible ontologies from BioPortal and the LOV repositories, which had, at best, 6.77% and 15.74% multilingual ontologies, respectively, where most of them have only partial translations and they all use a labels-based approach only. Based on a set of nine tool requirements for managing multilingual ontologies, the assessment of seven relevant ontology editors showed that there are significant gaps in tooling support, with VocBench 3 nearest of meeting them all. This stock-taking may function as a new baseline and motivate new research directions for multilingual ontologies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.