論文 Hugging Face 発表: 2026-05-26 HF ↑38

Self-Improving Language Models with Bidirectional Evolutionary Search

Self-Improving Language Models with Bidirectional Evolutionary Search

著者: Guowei Xu, Zhenting Qi, Huangyuan Su, Weirui Ye, Himabindu Lakkaraju ほか2名

要約

Search has been proposed as an effective method for self-improving language models and agentic systems, both for post-training sample generation and for inference. However, widely used methods such as best-of-N sampling and tree search face two fundamental limitations: they are guided by sparse veri…

#agent#robotics#benchmark

同じカテゴリの記事