An Empirical Evaluation of Automated Theorem Provers in Software Certification


Denney, Ewen, Fischer, Bernd and Schumann, Johann (2006) An Empirical Evaluation of Automated Theorem Provers in Software Certification International Journal on Artificial Intelligence Tools, 15, (1), pp. 81-107.

Download

[img] PDF ijait.pdf - Other
Download (517kB)

Description/Abstract

We describe a system for the automated certification of safety properties of NASA software. The system uses Hoare-style program verification technology to generate proof obligations which are then processed by an automated first-order theorem prover (ATP). We discuss the unique requirements this application places on the ATPs, focusing on automation, proof checking, traceability, and usability, and describe the resulting system architecture, including a certification browser that maintains and displays links between obligations and source code locations. For full automation, the obligations must be aggressively preprocessed and simplified, and we demonstrate how the individual simplification stages, which are implemented by rewriting, influence the ability of the ATPs to solve the proof tasks. Our results are based on 13 comprehensive certification experiments that lead to 366 top-level safety obligations and ultimately to more than 25,000 proof tasks which have been used to determine the suitability of the high-performance provers DCTP, E-Setheo, E, Gandalf, Otter, Setheo, Spass, and Vampire, and our associated infrastructure. The proofs found by Otter have been checked by Ivy.

Item Type: Article
Keywords: software certification, automated theorem proving, program synthesis, proof checking, traceability, verification condition generator, Hoare logic
Organisations: Electronic & Software Systems
ePrint ID: 262355
Date :
Date Event
February 2006Published
Date Deposited: 12 Apr 2006
Last Modified: 23 Feb 2017 13:02
Further Information:Google Scholar
URI: http://eprints.soton.ac.uk/id/eprint/262355

Actions (login required)

View Item View Item