The CADE ATP System Competition — CASC

Sutcliffe, Geoff

doi:10.1609/aimag.v37i2.2620

Cited by 62 publications

(43 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In SMT-COMP 2016 there were 603 conflicts (solvers returning different results) on 73 benchmarks caused by three solvers giving incorrect results for various reasons. 5 In the CASC competition [25], there is a period of testing where soundness is checked and resolved, and there have been a number of solvers later disqualified from the competition due to unsoundness. In our experience, adding a new feature to a theorem prover is a highly complex task and it is easy to introduce unsoundness, or general incorrectness, especially in areas of the code that are encountered during proof search infrequently.…”

Section: Introductionmentioning

confidence: 99%

Testing a Saturation-Based Theorem Prover: Experiences and Challenges

2017

View full text Add to dashboard Cite

Abstract. This paper attempts to address the question of how best to assure the correctness of saturation-based automated theorem provers using our experience with developing the theorem prover Vampire. We describe the techniques we currently employ to ensure that Vampire is correct and use this to motivate future challenges that need to be addressed to make this process more straightforward and to achieve better correctness guarantees.

show abstract

Section: Introductionmentioning

confidence: 99%

Testing a Saturation-Based Theorem Prover: Experiences and Challenges

2017

View full text Add to dashboard Cite

show abstract

“…[13], often combined with an SS-portfolio approach. Leading competition versions of solvers for the "main" divisions of the first-order logic theorem proving competition CASC [27] namely E [24], iProver [16] and Vampire [18] are all SS-portfolio solver instances. E subsequently runs several different superposition strategies found by a machine learning approach.…”

Section: Participation Of Portfolios 2016mentioning

confidence: 99%

Do Portfolio Solvers Harm?

Weidenbach

EPiC Series in Computing

View full text Add to dashboard Cite

show abstract

“…The SAT [1], SMT-COMP [2] and CASC [21] competitions respectively focus on comparing SAT solvers, SMT solvers and automated theorem provers (ATPs). Each of them takes benefit of uniform common formats supported by every contestant tool (e.g.…”

Section: Related Workmentioning

confidence: 99%

Online Runtime Verification Competitions: How To Possibly Deal With Their Issues (position paper)

Signoles

Kalpa Publications in Computing

View full text Add to dashboard Cite

show abstract

The CADE ATP System Competition — CASC

Cited by 62 publications

References 36 publications

Testing a Saturation-Based Theorem Prover: Experiences and Challenges

Testing a Saturation-Based Theorem Prover: Experiences and Challenges

Do Portfolio Solvers Harm?

Online Runtime Verification Competitions: How To Possibly Deal With Their Issues (position paper)

Contact Info

Product

Resources

About