2009
DOI: 10.1142/s0218213009000354
|View full text |Cite
|
Sign up to set email alerts
|

Ontology-Based Information Extraction From PDF Documents With Xonto

Abstract: Information extraction is of paramount importance in several real world applications in the areas of business, competitive and military intelligence because it enables to acquire information contained in unstructured documents and store them in structured forms. Unstructured documents have different internal encodings, one of the most diffused encoding is the visualization-oriented Adobe portable document format (PDF). Although several sophisticated and indeed complex approaches were proposed, they are still l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 23 publications
0
4
0
Order By: Relevance
“…The syntax is based on the intuitive logic programming. In particular, ontological constructs are partially derived from OntoDLP (Calimeri et al, 2003) (Ricca and Leone, 2007), whereas acquisition formalism are based on the XOnto language (Oro and Ruffolo, 2008) (Oro et Al, 2009). OntoDLP introduces many interesting features, including complex types, e.g.…”
Section: Mantra Languagementioning
confidence: 99%
See 1 more Smart Citation
“…The syntax is based on the intuitive logic programming. In particular, ontological constructs are partially derived from OntoDLP (Calimeri et al, 2003) (Ricca and Leone, 2007), whereas acquisition formalism are based on the XOnto language (Oro and Ruffolo, 2008) (Oro et Al, 2009). OntoDLP introduces many interesting features, including complex types, e.g.…”
Section: Mantra Languagementioning
confidence: 99%
“…Database and linguistic descriptors, which enable to extract and integrate data available in heterogeneous data sources, are presented in the following. Descriptors are founded on the basic idea described in (Oro and Ruffolo, 2008) and (Oro et Al, 2009) which describe a system for Information Extraction (IE) from PDF documents. It represents an approach founded on the idea that objects and classes of ontologies can be equipped by a set of rules that describe how recognize and extract objects contained into an external source.…”
Section: Data Integration Constructsmentioning
confidence: 99%
“…The information extraction task is performed by exploiting the semantic information extraction approach described in Section 5. Extracted information are stored as ontology instances Oro et al (2009a).…”
Section: An Example: Representing Ontologies and A Process Schemas Inmentioning
confidence: 99%
“…A prototype of the SD-KRF has been implemented by combining the JBPM engine JPDL (2011) and the XONTO system Oro et al (2009a). It is designed to follow a clinical processes life-cycle model based on 3 phases: processes and ontologies design and implementations, execution and monitoring, analysis.…”
Section: Implementation Issuesmentioning
confidence: 99%