← アーカイブ一覧
論文 深掘り arXiv 2026-06-04
Multimodal generative models produce fluent outputs but remain unreliable when generation must respect structured, domain-specific, or safety-critical knowledge. Existing methods incorporate knowledge through mechanisms such as prompt augmentation, guidance, latent editing, or fine-tuning, yet they ...
#multimodal#diffusion#fine-tuning#alignment
論文 深掘り arXiv 2026-06-04
Question answering (QA) systems have achieved notable progress with the advent of large language models (LLMs). However, they still face challenges in accurately extracting and generating precise answers from given contexts, particularly when dealing with complex or ambiguous queries. Existing appro...
#llm#fine-tuning#benchmark
企業動向 Hugging Face 2026-06-05
Thousand Token Wood: shipping a multi-agent economy on a 3B model...
#agent
企業動向 OpenAI 2026-06-04
Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture across the enterprise....
#agent
論文 深掘り arXiv 2026-06-04
As autonomous LLM agents increasingly hold real credentials and operate infrastructure without a human in the loop, operators have no standard way to tell an agent that a resource is off-limits. Access controls either let the agent in (it has valid credentials) or hard-fail it (indistinguishable fro...
#agent#llm#robotics
論文 arXiv 2026-06-04
Audio encoders are critical to modern audio applications as large language models (LLMs) increasingly rely on a single encoder for diverse inputs. While self-supervised learning (SSL) has yielded strong domain-specific encoders like speech or music experts, multi-domain approaches like USAD and SPEA...
#llm#benchmark#speech
論文 arXiv 2026-06-04
Recent advancements in reasoning language models have been driven by Reinforcement Learning (RL) fine-tuning. Most often, these rely on the Group Relative Policy Optimization (GRPO) algorithm or modifications thereof to steer the models to produce Chain-of-Thought (CoT) traces. The final answer can ...
#rl#fine-tuning
論文 arXiv 2026-06-04
Discrete diffusion language models generate text by iteratively denoising an entire response in parallel. At each step, they predict tentative tokens for every masked position, committing the confident predictions to the output and discarding the unconfident ones. We show that the discarded tokens a...
#diffusion#rag#benchmark
論文 arXiv 2026-06-04
Large language model (LLM) agents are increasingly applied to long-horizon tasks such as scientific discovery and machine learning engineering (MLE), where sustained self-evolution becomes a key capability. However, existing MLE agents suffer from inter-branch information isolation, memoryless searc...
#agent#llm#coding#benchmark
企業動向 Hugging Face 2026-06-06
Five labs, five minds: building a multi-model finance drama on small models...
モデル Google Research 2026-06-05
Data Management...
#agent#rag
企業動向 NVIDIA 2026-06-05
Home to cutting-edge sovereign AI infrastructure and robotics innovators, as well as one of the world’s most passionate gaming communities, South Korea is one of the world’s centers of AI. NVIDIA founder and CEO Jensen Huang is in Seoul this week to meet the partners and builders behind that work. S...
#robotics