Kavli Affiliate: Ke Wang | First 5 Authors: Anjiang Wei, Jiannan Cao, Ran Li, Hongyu Chen, Yuhui Zhang | Summary: Equivalence checking, i.e., determining whether two programs produce identical outputs for all possible inputs, underpins a broad range of applications, including software refactoring, testing, and optimization. We present the task of equivalence checking as a […]
Continue.. EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking