論文 arXiv 発表: 2026-05-21

AMEL: Accumulated Message Effects on LLM Judgments

著者: Sid-ali Temkit

要約

Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items passing through one conversation. We ask whether the polarity of prior conversation history biases subsequent judgments, an effect we call the accumulated messa…

#llm#benchmark

AMEL: Accumulated Message Effects on LLM Judgments

要約

同じカテゴリの記事

Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

World-R1: テキストから動画生成における3D制約の強化学習による整合