Malte Ostendorff scite author profile

Malte Ostendorff

5Publications

103Citation Statements Received

71Citation Statements Given

How they've been cited

102

How they cite others

Affiliations

German Research Centre for Artificial Intelligence, University of Konstanz

Publications

Order By: Most citations

Aspect-based Document Similarity for Research Papers

Ostendorff¹,

Ruas²,

Blume³

et al. 2020

View full text Add to dashboard Cite

Traditional document similarity measures provide a coarse-grained distinction between similar and dissimilar documents. Typically, they do not consider in what aspects two documents are similar. This limits the granularity of applications like recommender systems that rely on document similarity. In this paper, we extend similarity with aspect information by performing a pairwise document classification task. We evaluate our aspect-based document similarity approach for research papers. Paper citations indicate the aspect-based similarity, i. e., the title of a section in which a citation occurs acts as a label for the pair of citing and cited paper. We apply a series of Transformer models such as RoBERTa, ELECTRA, XLNet, and BERT variations and compare them to an LSTM baseline. We perform our experiments on two newly constructed datasets of 172,073 research paper pairs from the ACL Anthology and CORD-19 corpus. According to our results, SciBERT is the best performing system with F1-scores of up to 0.83. A qualitative analysis validates our quantitative results and indicates that aspect-based document similarity indeed leads to more fine-grained recommendations.

show abstract

Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles

Ostendorff

Ruas

Schubotz

et al. 2020

View full text Add to dashboard Cite

Enriching BERT with Knowledge Graph Embeddings for Document Classification

Ostendorff¹,

Bourgonje²,

Berger³

et al. 2019

Preprint

View full text Add to dashboard Cite

In this paper we focus on the classification of books using short descriptive texts (cover blurbs) and additional metadata. Building upon BERT, a deep neural language model, we demonstrate how to combine text representations with metadata and knowledge graph embeddings, which encode author information. Compared to the standard BERT approach we achieve considerably better results for the classification task. For a more coarse-grained classification using eight labels we achieve an F1score of 87.20, while a detailed classification using 343 labels yields an F1-score of 64.70. We make the source code and trained models of our experiments publicly available.

show abstract

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

Ostendorff¹,

Rethmeier²,

Augenstein³

et al. 2022

Preprint

View full text Add to dashboard Cite

Towards an Open Platform for Legal Information

Ostendorff¹,

Blume²,

Ostendorff³

2020

View full text Add to dashboard Cite

Recent advances in the area of legal information systems have led to a variety of applications that promise support in processing and accessing legal documents. Unfortunately, these applications have various limitations, e. g., regarding scope or extensibility. Furthermore, we do not observe a trend towards open access in digital libraries in the legal domain as we observe in other domains, e. g., economics of computer science. To improve open access in the legal domain, we present our approach for an open source platform to transparently process and access Legal Open Data. This enables the sustainable development of legal applications by offering a single technology stack. Moreover, the approach facilitates the development and deployment of new technologies. As proof of concept, we implemented six technologies and generated metadata for more than 250,000 German laws and court decisions. Thus, we can provide users of our platform not only access to legal documents, but also the contained information.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.