Proceedings of the Workshop on Human Language Technology - HLT '93 1993
DOI: 10.3115/1075671.1075707
|View full text |Cite
|
Sign up to set email alerts
|

Development, implementation and testing of a discourse model for newspaper texts

Abstract: Texts of a particular type evidence a discernible, predictable schema. These schemata can be delineated, and as such provide models of their respective text-types which are of use in automatically structuring texts. We have developed a Text Structurer module which recognizes text-level structure for use within a larger information retrieval system to delineate the discourse-level organization of each document's contents. This allows those document components which are more likely to contain the type of informa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2000
2000
2007
2007

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 16 publications
(15 citation statements)
references
References 4 publications
0
15
0
Order By: Relevance
“…Enumerative systems enumerate all the subjects of interest (which may be organized hierarchically) whereas faceted systems define the A more common approach is to develop models based on elements being communicated by various document types, also referred to by Paice [88] and by Liddy and colleagues [89] (both citing van Dijk), as the document's superstructure. We could also consider this approach to be genre-based.…”
Section: Facets and Faceted Browsingmentioning
confidence: 99%
“…Enumerative systems enumerate all the subjects of interest (which may be organized hierarchically) whereas faceted systems define the A more common approach is to develop models based on elements being communicated by various document types, also referred to by Paice [88] and by Liddy and colleagues [89] (both citing van Dijk), as the document's superstructure. We could also consider this approach to be genre-based.…”
Section: Facets and Faceted Browsingmentioning
confidence: 99%
“…Moreover, there is a distinctive discourse or style [21]. Further, the collection of historical newspapers is distinctive because we have the complete history of news events rather than processing them as they are streamed.…”
Section: Layered Content and Community Modelsmentioning
confidence: 99%
“…Each filled template is a legal Cyc query. TextWise Corporation has been developing natural language information retrieval software primarily for news articles (Liddy, 1995). Teknowledge intends to use the TextWise KNOWledge base Information Tools (BGSfOW-IT) to supply many instances to Cyc of facts discovered from news stories.…”
Section: Crisis Management Integrationmentioning
confidence: 99%