Improving the Diproche CNL through Autoformalization via Large Language Models

Merlin Carl
(Europa-Universität Flensburg)

The Diproche system is an automated proof checker for texts written in a controlled fragment of German, designed for didactical applications in classes introducing students to proofs for the first time. The first version of the system used a controlled natural language for which a Prolog formalization routine was written. In this paper, we explore the possibility of prompting large language models for autoformalization in the context of Diproche, with encouraging first results.

In Julien Narboux, Walther Neuper and Pedro Quaresma: Proceedings 12th International Workshop on Theorem proving components for Educational software (ThEdu'23), Rome, Italy, 5th July 2023, Electronic Proceedings in Theoretical Computer Science 400, pp. 44–58.
Published: 6th April 2024.

ArXived at: https://dx.doi.org/10.4204/EPTCS.400.4 bibtex PDF
References in reconstructed bibtex, XML and HTML format (approximated).
Comments and questions to: eptcs@eptcs.org
For website issues: webmaster@eptcs.org