LLaVA-OneVision FamilyLLaVA-OneVision 项目家族
A research suite of fully-open multimodal models — from foundational vision encoders to the latest video-language frontier.完全开放的多模态研究矩阵——从底层视觉编码器到最新一代视频语言模型。
Releases发布
Blog博客
COMING SOON
Posts coming soon博客即将上线
Deep-dives, training notes, and engineering write-ups from the LLaVA-OneVision team will live here.来自 LLaVA-OneVision 团队的技术深度文章、训练笔记与工程实践,将陆续发布于此。