Thomas Ruprecht scite author profile

Thomas Ruprecht

4Publications

10Citation Statements Received

76Citation Statements Given

How they've been cited

How they cite others

Affiliations

TU Dresden

Publications

Order By: Most citations

Lexicalization of Probabilistic Linear Context-free Rewriting Systems

Mörbitz

Ruprecht

2020

View full text Add to dashboard Cite

In the field of constituent parsing, probabilistic grammar formalisms have been studied to model the syntactic structure of natural language. More recently, approaches utilizing neural models gained lots of traction in this field, as they achieved accurate results at high speed. We aim for a symbiosis between probabilistic linear context-free rewriting systems (PLCFRS) as a probabilistic grammar formalism and neural models to get the best of both worlds: the interpretability of grammars, and the speed and accuracy of neural models. A combination of these two could be achieved by applying supertagging to PLCFRS. This approach requires lexical grammar formalisms. Here, we present a procedure which turns any PLCFRS G into an equivalent lexical PLCFRS G . Moreover, we show how the derivations in G can be transformed to obtain their corresponding original derivations in G. Our construction for G preserves the probability assignment and does not increase parsing complexity compared to G.

show abstract

Implementation of a

Ruprecht¹,

Denkinger²

2019

View full text Add to dashboard Cite

Constituent parsing has been studied extensively in the last decades. Chomsky-Schützenberger parsing as an approach to constituent parsing has only been investigated theoretically, yet. It uses the decomposition of a language into a regular language, a homomorphism, and a bracket language to divide the parsing problem into simpler subproblems. We provide the first implementation of Chomsky-Schützenberger parsing. It employs multiple context-free grammars and incorporates many refinements to achieve feasibility. We compare its performance to state-of-the-art grammar-based parsers.

show abstract

Supertagging-based Parsing with Linear Context-free Rewriting Systems

Ruprecht¹,

Mörbitz²

2021

View full text Add to dashboard Cite

We present the first supertagging-based parser for linear context-free rewriting systems (LCFRS). It utilizes neural classifiers and outperforms previous LCFRS-based parsers in both accuracy and parsing speed by a wide margin. Our results keep up with the best (general) discontinuous parsers, particularly the scores for discontinuous constituents establish a new state of the art. The heart of our approach is an efficient lexicalization procedure which induces a lexical LCFRS from any discontinuous treebank. We describe a modification to usual chart-based LCFRS parsing that accounts for supertagging and introduce a procedure that transforms lexical LCFRS derivations into equivalent parse trees of the original treebank. Our approach is evaluated on the English Discontinuous Penn Treebank and the German treebanks Negra and Tiger.

show abstract

Improving the Extraction of Supertags for Constituency Parsing with Linear Context-Free Rewriting Systems

Ruprecht¹

2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Thomas Ruprecht

Lexicalization of Probabilistic Linear Context-free Rewriting Systems

Implementation of a

Supertagging-based Parsing with Linear Context-free Rewriting Systems

Improving the Extraction of Supertags for Constituency Parsing with Linear Context-Free Rewriting Systems

Contact Info

Product

Resources

About