We introduce a new framework to evaluate and improve first-order (FO)
ontologies using automated theorem provers (ATPs) on the basis of competency
questions (CQs). Our framework includes both the adaptation of a methodology
for evaluating ontologies to the framework of first-order logic and a new set
of non-trivial CQs designed to evaluate FO versions of SUMO, which
significantly extends the very small set of CQs proposed in the literature.
Most of these new CQs have been automatically generated from a small set of
patterns and the mapping of WordNet to SUMO. Applying our framework, we
demonstrate that Adimen-SUMO v2.2 outperforms TPTP-SUMO. In addition, using the
feedback provided by ATPs we have set an improved version of Adimen-SUMO
(v2.4). This new version outperforms the previous ones in terms of competency.
For instance, "Humans can reason" is automatically inferred from Adimen-SUMO
v2.4, while it is neither deducible from TPTP-SUMO nor Adimen-SUMO v2.2.Comment: 8 pages, 2 table
Formal ontologies are axiomatizations in a logic-based formalism. The development of formal ontologies is generating considerable research on the use of automated reasoning techniques and tools that help in ontology engineering. One of the main aims is to refine and to improve axiomatizations for enabling automated reasoning tools to efficiently infer reliable information. Defects in the axiomatization cannot only cause wrong inferences, but can also hinder the inference of expected information, either by increasing the computational cost of or even preventing the inference. In this paper, we introduce a novel, fully automatic white-box testing framework for first-order logic (FOL) ontologies. Our methodology is based on the detection of inference-based redundancies in the given axiomatization. The application of the proposed testing method is fully automatic since (i) the automated generation of tests is guided only by the syntax of axioms and (ii) the evaluation of tests is performed by automated theorem provers (ATPs). Our proposal enables the detection of defects and serves to certify the grade of suitability—for reasoning purposes—of every axiom. We formally define the set of tests that are (automatically) generated from any axiom and prove that every test is logically related to redundancies in the axiom from which the test has been generated. We have implemented our method and used this implementation to automatically detect several non-trivial defects that were hidden in various FOL ontologies. Throughout the paper we provide illustrative examples of these defects, explain how they were found and how each proof—given by an ATP—provides useful hints on the nature of each defect. Additionally, by correcting all the detected defects, we have obtained an improved version of one of the tested ontologies: Adimen-SUMO.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.