2019
DOI: 10.1162/tacl_a_00261
|View full text |Cite
|
Sign up to set email alerts
|

SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

Abstract: When searching for information, a human reader first glances over a document, spots relevant sections and then focuses on a few sentences for resolving her intention. However, the high variance of document structure complicates to identify the salient topic of a given section at a glance. To tackle this challenge, we present SECTOR, a model to support machine reading systems by segmenting documents into coherent sections and assigning topic labels to each section. Our deep neural network architecture learns a … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
94
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 67 publications
(94 citation statements)
references
References 41 publications
(59 reference statements)
0
94
0
Order By: Relevance
“…This approach places words with similar semantics nearby in vector space and allows queries with morphologic variations using subword representations. To find all possible aspects, we adopt prior work [4] and collect all section headings from the medical Wikipedia articles. These headings typically consist of 1-3 words and describe the main topic of a section.…”
Section: Aspect Spacementioning
confidence: 99%
See 1 more Smart Citation
“…This approach places words with similar semantics nearby in vector space and allows queries with morphologic variations using subword representations. To find all possible aspects, we adopt prior work [4] and collect all section headings from the medical Wikipedia articles. These headings typically consist of 1-3 words and describe the main topic of a section.…”
Section: Aspect Spacementioning
confidence: 99%
“…We adopt the architecture of SECTOR [4] and use bidirectional LSTMs to read the document sentence-by-sentence. We use a final dense layer (matrix W he and bias b e ) to produce the local discourse vectors δ 1...T for every sentence in D.…”
Section: Document Encodermentioning
confidence: 99%
“…Fetahu et al [9] extend Wikipedia articles on news events with up-to-date information from the web. Arnold et al [1,2] use sections on Wikipedia articles to (1) learn how to segment articles into different topics and (2) identify answer passages to biomedical questions.…”
Section: Related Work and Datasetsmentioning
confidence: 99%
“…We use a Wikipedia dump that is offered by the TREC Complex Answer Retrieval track, which exposes section and hyperlink information in a machine-readable format [8]. 1 If necessary, entity links for a given text passage can be readily created with an entity linking tool.…”
Section: Introductionmentioning
confidence: 99%
“…To complement the experimental evaluation in the original paper, we showcase three demonstrators to prove the usefulness of our approach in a real world scenario. The demos are available online using CORD-19 5 [18], WikiSection 6 [3], and Orphanet 7 [9] as their resource accordingly. Due to space constraints, we focus on the first system here.…”
Section: Introductionmentioning
confidence: 99%