2004
DOI: 10.1017/s1351324904003493
|View full text |Cite
|
Sign up to set email alerts
|

The Talent system: TEXTRACT architecture and data model

Abstract: We present the architecture and data model for TEXTRACT, a document analysis framework for text analysis components. The framework and components have been deployed in research and industrial environments for text analysis and text mining tasks.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2006
2006
2015
2015

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(15 citation statements)
references
References 29 publications
(30 reference statements)
0
15
0
Order By: Relevance
“…In particular, Text2Policy first applies shallow parsing [32] that annotates sentences with phrases, clauses, and grammatical functions of phrases, such as subject, main verb, and object. For example, the shallow-parsing component parses ACP-1 in Figure 1 as [subject: An HCP] [main verb group: should not change] [object: a patient's account.].…”
Section: Example Of Acp Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, Text2Policy first applies shallow parsing [32] that annotates sentences with phrases, clauses, and grammatical functions of phrases, such as subject, main verb, and object. For example, the shallow-parsing component parses ACP-1 in Figure 1 as [subject: An HCP] [main verb group: should not change] [object: a patient's account.].…”
Section: Example Of Acp Extractionmentioning
confidence: 99%
“…The SPARCLE Policy Workbench [9,10,25,26] employs the shallow-parsing technique [32] to parse privacy rules and extract the elements of privacy rules based on a pre-defined syntax. These elements are then used to form policies in a structured form, so that policy authors can review it and then produce policies in a machine-readable form, such as EPAL [7] and XACML [3,33] with a privacy-policy profile.…”
Section: Related Workmentioning
confidence: 99%
“…In order to identify the elements in a policy rule, SPARCLE uses a shallow parser [18] that is based on the IBM Unstructured Information Management Architecture [11]. A shallow parser processes text in a number of stages, beginning with operations that use limited linguistic knowledge to identify syntactic structures such as nouns, noun phrases, verbs, verb groups, and modifying phrases.…”
Section: Natural Language Parsing In Sparclementioning
confidence: 99%
“…Thus we believed that restricting users to writing rules using these common formats would be easily learned and followed. SPARCLE uses a set of grammars that operate in a cascade [18] to add meta-data tags to the rules indicating where privacy elements start and stop. In the cascade the grammars execute in a particular order where the meta-tags inserted by earlier grammars can be referenced by later grammars.…”
Section: Grammar Designmentioning
confidence: 99%
“…The privacy policy is written initially in natural language. Then the workbench employs natural language shallow parsing technology [23] to parse the rules and extract the elements of the privacy rules using a defined syntax for privacy policy rules, grammars written specifically for privacy rules that reflect the defined syntax, and data dictionaries for policy domains (e.g., healthcare, banking, government). The policy rules and identified policy elements are then stored in a database.…”
Section: Research Purpose and Backgroundmentioning
confidence: 99%