Technology Radar

LLaVA

Published : Apr 03, 2024

NOT ON THE CURRENT EDITION

This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar. Understand more

Apr 2024

Assess

LLaVA（Large Language and Vision Assistant） 是一个开源的大型多模态模型，它结合了视觉编码器和大语言模型，用于通用视觉和语言理解。LLaVA 在遵循指令方面的强大能力，使其成为多模态人工智能模型中的有力竞争者。最新版本，LLaVA-NeXT，能进一步提升问答能力。在开源的语言和视觉辅助模型中，与GPT-4 Vision相比，LLaVA 是一个很有前景的选择。我们的团队一直在使用它进行视觉问题解答。

Download the PDF

English | Português

Sign up for the Technology Radar newsletter

Subscribe now

行业

数字出版物和工具

所有洞见

LLaVA

Download the PDF

Sign up for the Technology Radar newsletter

Visit our archive to read previous volumes