Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:Neural Information Processing Systems Foundation, Inc. (NeurIPS)Journal:Advances in Neural Information Processing Systems 37

Authors: Zenan Li; Yifan Wu; Zhaoyu Li; Xinming Wei; Xian Zhang; Fan Yang; Xiaoxing Ma;

doi: 10.52202/079017-1697 , 10.48550/arxiv.2410.20936

arXiv: 2410.20936

Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency

- Summary
- Subjects
- Metrics

Abstract

Autoformalization, the task of automatically translating natural language descriptions into a formal language, poses a significant challenge across various domains, especially in mathematics. Recent advancements in large language models (LLMs) have unveiled their promising capabilities to formalize even competition-level math problems. However, we observe a considerable discrepancy between pass@1 and pass@k accuracies in LLM-generated formalizations. To address this gap, we introduce a novel framework that scores and selects the best result from k autoformalization candidates based on two complementary self-consistency methods: symbolic equivalence and semantic consistency. Elaborately, symbolic equivalence identifies the logical homogeneity among autoformalization candidates using automated theorem provers, and semantic consistency evaluates the preservation of the original meaning by informalizing the candidates and computing the similarity between the embeddings of the original and informalized texts. Our extensive experiments on the MATH and miniF2F datasets demonstrate that our approach significantly enhances autoformalization accuracy, achieving up to 0.22-1.35x relative improvements across various LLMs and baseline methods.

Published as a conference paper at NeurIPS 2024. Code is available at https://github.com/Miracle-Messi/Isa-AutoFormal

Related Organizations

University of Toronto
Canada
Hebei University
China (People's Republic of)
Peking University
China (People's Republic of)
Peking University
China (People's Republic of)
NANJING UNIVERSITY
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

3

Top 10%

Average

Green

Related to Research communities

UArctic