Knowledge graphs are being increasingly adopted in industry in order to add meaning to data and improve the intelligence of data analytics methods. Simple Knowledge Management System (SKOS) is a W3C standard for representation of knowledge graphs in a web-native and machine-understandable format. This paper introduces SKOS Tool; a web-based application developed at the Engineering Informatics Lab at Texas State University. It can be used for creating knowledge graphs and concept schemes based on the SKOS standard. The main feature and functions of SKOS Tool are described in this paper. Beyond creating knowledge graphs, SKOS Tool has additional features that can be used to support semantic document classification based on the Bag of Concepts technique. To demonstrate the utilities of SKOS Tool, a use case related to classifications of manufacturing suppliers with Medical Grade Polymer Tubing capabilities is presented.
The unstructured data available on the websites of manufacturing suppliers and contractors can provide valuable insights into their technological and organizational capabilities. However, since the capability data are often represented in an unstructured and informal fashion using natural language text, it is not easy to efficiently search and analyze the capability data and learn from it. The objective of this work is to propose framework to enable automated classification and ranking of suppliers based on their online capability descriptions in the context of a supplier search and discovery use case. The proposed text analytics methods used in this work are supported by a formal thesaurus that uses SKOS (Simple Knowledge Organization System) that provides lexical and structural semantics. Normalized Google Distance (NGD) is used as the metric for measuring the relatedness of terms when ranking suppliers based on their similarities with the queried capabilities. The proposed framework is validated experimentally using a hypothetical supplier search scenario. The results indicate that the generated ranked list is highly correlated with human judgment, especially when the search space is partitioned into multiple classes of suppliers with distinct capabilities. However, the correlation decreases when multiple overlapping classes of suppliers are merged together to form a heterogenous search space. The proposed framework can support supplier screening and discovery solutions by improving the precision, reliability, and intelligence of their underlying search engines.
Objective: This study aimed to develop a natural language processing algorithm (NLP) using machine learning (ML) and Deep Learning (DL) techniques to identify and classify documentation of suicidal behaviors in patients with Alzheimer disease and related dementia (ADRD). Materials and Methods: We utilized MIMIC-III and MIMIC-IV datasets and identified ADRD patients and subsequently those with suicide ideation using relevant International Classification of Diseases (ICD) codes. We used cosine similarity with ScAN (Suicide Attempt and Ideation Events Dataset) to calculate semantic similarity scores of ScAn with extracted notes from MIMIC for the clinical notes. The notes were sorted based on these scores, and manual review and categorization into eight suicidal behavior categories were performed. The data were further analyzed using conventional ML and DL models, with manual annotation as a reference. Results: The tested classifiers achieved classification results close to human performance with up to 98% precision and 98% recall of suicidal ideation in the ADRD patient population. Discussion: Our NLP model effectively reproduced human annotation of suicidal ideation within the MIMIC dataset. These results establish a foundation for identifying and categorizing documentation related to suicidal ideation within the ADRD population, contributing to the advancement of NLP techniques in healthcare for extracting and classifying clinical concepts, particularly focusing on suicidal ideation among patients with ADRD. Conclusion: Our study showcased the capability of a robust NLP algorithm to accurately identify and classify documentation of suicidal behaviors in ADRD patients.
The unstructured data available on the websites of manufacturing suppliers can provide useful insights into the technological and organizational capabilities of manufacturers. However, since the data is often represented in an unstructured form using natural language text, it is difficult to efficiently search and analyze the capability data and learn from it. The objective of this work is to propose a set of text analytics techniques to enable automated classification and ranking of suppliers based on their capability narratives. The supervised classification and semantic similarity measurement methods used in this research are supported by a formal thesaurus that uses SKOS (Simple Knowledge Organization System) for its syntax and semantics. Normalized Google Distance (NGD) was used as a metric for measuring the relatedness of terms. The proposed framework was validated experimentally using a hypothetical search scenario. The results indicate that the generated ranked list shows a high correlation with human judgment specially if the query concept vector and supplier concept vector belong to the same class. However, the correlation decreases when multiple overlapping classes of suppliers are mixed together. The findings of this research can be used to improve the precision and reliability of Capability Language Processing (CLP) tools and methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.