Technology Radar
Published : Apr 03, 2024
NOT ON THE CURRENT EDITION
This blip is not on the current edition of the Radar. If it was on one of the last few editions, it is likely that it is still relevant. If the blip is older, it might no longer be relevant and our assessment might be different today. Unfortunately, we simply don't have the bandwidth to continuously review blips from previous editions of the Radar.
Understand more
Apr 2024
Assess
LLaVA(Large Language and Vision Assistant) 是一个开源的大型多模态模型,它结合了视觉编码器和大语言模型,用于通用视觉和语言理解。LLaVA 在遵循指令方面的强大能力,使其成为多模态人工智能模型中的有力竞争者。最新版本,LLaVA-NeXT,能进一步提升问答能力。在开源的语言和视觉辅助模型中,与GPT-4 Vision相比,LLaVA 是一个很有前景的选择。我们的团队一直在使用它进行视觉问题解答。