論文 Hugging Face 発表: 2026-06-10 HF ↑61

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

著者: Jiacheng Chen, Xinyu Zhang, Shunkai Zhang, Yanmohan Wang, Lin Li ほか18名

要約

We present MaxProof, a population-level test-time scaling framework for competition-level mathematical proof in the MiniMax-M3 series. M3 first trains three proof-oriented capabilities — proof generation, proof verification, and critique-conditioned proof repair — using a defense-in-depth generati…

#rl

同じカテゴリの記事