Self-Improving Language Models with Bidirectional Evolutionary Search
Self-Improving Language Models with Bidirectional Evolutionary Search
要約
Search has been proposed as an effective method for self-improving language models and agentic systems, both for post-training sample generation and for inference. However, widely used methods such as best-of-N sampling and tree search face two fundamental limitations: they are guided by sparse veri…