Kavli Affiliate: Ran Wang | First 5 Authors: Haiming Wang, Mert Unsal, Xiaohan Lin, Mantas Baksys, Junqi Liu | Summary: We introduce Kimina-Prover Preview, a large language model that pioneers a novel reasoning-driven exploration paradigm for formal theorem proving, as showcased in this preview release. Trained with a large-scale reinforcement learning pipeline from Qwen2.5-72B, Kimina-Prover […]
Continue.. Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning