Learning to find proofs and theorems by learning to refine search strategies

Laurent, Jonathan; Platzer, André

Citation Details

We propose a new approach to automated theorem proving where an AlphaZero-style agent is self-training to refine a generic high-level expert strategy expressed as a nondeterministic program. An analogous teacher agent is self-training to generate tasks of suitable relevance and difficulty for the learner. This allows leveraging minimal amounts of domain knowledge to tackle problems for which training data is unavailable or hard to synthesize. As a specific illustration, we consider loop invariant synthesis for imperative programs and use neural networks to refine both the teacher and solver strategies. more »

Award ID(s):: 1739629

PAR ID:: 10374037

Author(s) / Creator(s):: Laurent, Jonathan; Platzer, André

Editor(s):: Agarwal, Alekh; Belgrave, Danielle; Cho, Kyunghyun; Oh, Alice

Date Published:: 2022-11-28

Journal Name:: Advances in neural information processing systems

Volume:: 35

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this