Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: Hu 2018
DOI: 10.18653/v1/n18-3010
|View full text |Cite
|
Sign up to set email alerts
|

SystemT: Declarative Text Understanding for Enterprise

Abstract: The rise of enterprise applications over unstructured and semi-structured documents poses new challenges to text understanding systems across multiple dimensions. We present SystemT, a declarative text understanding system that addresses these challenges and has been deployed in a wide range of enterprise applications. We highlight the design considerations and decisions behind SystemT in addressing the needs of the enterprise setting. We also summarize the impact of SystemT on business and education.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…The core of the NLP extraction pipeline transforms textual patterns between ontological annotations into sets of RDF triples, which constitute a partial rule knowledge graph; it uses two domain-agnostic approaches: semantic role labeling 25 and semantic reasoning 24 . Semantic role labeling is based on a declarative information extraction system 58 that can identify actions and roles in a sentence (who is the agent, what is the theme or context of the action, if there are any conditionals, the polarity of the action, temporal information, etc.). We use the ontology definitions for the domain and range of properties to reason over the semantic roles, and to identify semantically compatible relation and entity/value pairs in a sentence.…”
Section: Extraction Of Rules From Textmentioning
confidence: 99%
“…The core of the NLP extraction pipeline transforms textual patterns between ontological annotations into sets of RDF triples, which constitute a partial rule knowledge graph; it uses two domain-agnostic approaches: semantic role labeling 25 and semantic reasoning 24 . Semantic role labeling is based on a declarative information extraction system 58 that can identify actions and roles in a sentence (who is the agent, what is the theme or context of the action, if there are any conditionals, the polarity of the action, temporal information, etc.). We use the ontology definitions for the domain and range of properties to reason over the semantic roles, and to identify semantically compatible relation and entity/value pairs in a sentence.…”
Section: Extraction Of Rules From Textmentioning
confidence: 99%
“…Braghin S et al (2019) [ 15 ], proposed a hybrid framework for the de-identification of health information for privacy protection. Their framework combines different annotators such as PRIMA Annotator [ 6 ], Apache OpenNLP, Stanford CoreNLP [ 65 ], SystemT [ 18 ], Advanced Care Insights, Extensibility, and Integration to maximize performance and increase generalization. Additionally, Jian Z et al (2017) [ 47 ] proposed a model for the de-identification of Chinese clinical text with less annotation effort using a cascaded method that combines rule-based and ML methods to identify Patient Health Information (PHI) and replace them with a corresponding safe surrogate in sentences with PHI.…”
Section: Challenges Identified and Scope Of Work In Healthcare Using ...mentioning
confidence: 99%
“…Information extraction has been addressed with different methods and techniques, including (i) only Conditional Random Fields (CRFs) models (FINKEL; GRENAGER; MANNING, 2005); (ii) character-level embeddings instead of whole words (LAMPLE et al, 2016), and (iii) rule-based approaches (CHITICARIU et al, 2018). Yet, recent studies (YADAV; BETHAR, 2018) show the benefits of ML-based approaches such as recurrent neural networks.…”
Section: Natural Language Processingmentioning
confidence: 99%