論文 Hugging Face 発表: 2026-06-01 HF ↑5

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training

著者: Zelun Zhang, Hongen Liu, Suyin Liang, Yubo Zhang, Yiqing Xiang ほか10名

要約

We introduce PaddleOCR-VL-1.6, an upgraded compact document parsing model built upon PaddleOCR-VL-1.5. Although PaddleOCR-VL-1.5 establishes a strong 0.9B baseline, its remaining errors concentrate in under-optimized regions where model behavior is unstable, data coverage is sparse, or supervision i…

#rl#multimodal

同じカテゴリの記事