Daniel Kocher scite author profile

Daniel Kocher

4Publications

25Citation Statements Received

85Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Salzburg

Publications

Order By: Most citations

A Link is not Enough – Reproducibility of Data

et al. 2019

View full text Add to dashboard Cite

Although many works in the database community use open data in their experimental evaluation, repeating the empirical results of previous works remains a challenge. This holds true even if the source code or binaries of the tested algorithms are available. In this paper, we argue that providing access to the raw, original datasets is not enough. Real-world datasets are rarely processed without modification. Instead, the data is adapted to the needs of the experimental evaluation in the data preparation process. We showcase that the details of the data preparation process matter and subtle differences during data conversion can have a large impact on the outcome of runtime results. We introduce a data reproducibility model, identify three levels of data reproducibility, report about our own experience, and exemplify our best practices.

show abstract

Empirical Evaluation of LBP-Extension Features for Finger Vein Spoofing Detection

Kocher¹,

Schwarz²,

Uhl³

2016

View full text Add to dashboard Cite

A Scalable Index for Top-k Subtree Similarity Queries

Kocher

Augsten

2019

View full text Add to dashboard Cite

Given a query tree Q, the top-k subtree similarity query retrieves the k subtrees in a large document tree T that are closest to Q in terms of tree edit distance. The classical solution scans the entire document, which is slow. The state-of-theart approach precomputes an index to reduce the query time. However, the index is large (quadratic in the document size), building the index is expensive, updates are not supported, and data-specific tuning is required. We present a scalable solution for the top-k subtree similarity problem that does not assume specific data types, nor does it require any tuning. The key idea is to process promising subtrees first. A subtree is promising if it shares many labels with the query. We develop a new technique based on inverted lists that efficiently retrieves subtrees in the required order and supports incremental updates of the document. To achieve linear space, we avoid full list materialization but build relevant parts of a list on the fly. In an extensive empirical evaluation on synthetic and realworld data, our technique consistently outperforms the stateof-the-art index w.r.t. memory usage, indexing time, and the number of candidates that must be verified. In terms of query time, we clearly outperform the state of the art and achieve runtime improvements of up to four orders of magnitude.

show abstract

Feedforward-Aided Course Designs for Similarity Search

Hütter

Kocher

2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daniel Kocher

A Link is not Enough – Reproducibility of Data

Empirical Evaluation of LBP-Extension Features for Finger Vein Spoofing Detection

A Scalable Index for Top-k Subtree Similarity Queries

Feedforward-Aided Course Designs for Similarity Search

Contact Info

Product

Resources

About