܄

Qwen released visual understanding model Qwen2-VL-72B

【数据猿导读】 Qwen released visual understanding model Qwen2-VL-72B

Qwen released visual understanding model Qwen2-VL-72B

On August 30, Ali Qwen released the second-generation visual language model Qwen2-VL, and the API of the flagship model Qwen2-VL-72B has been launched on the Ali Cloud Bailian platform. In a number of authoritative evaluations, some indicators of Qwen2-VL even surpassed closed-source models such as GPT-4o and Claude3.5-Sonnet. Compared to the previous generation model, Qwen2-VL can understand more than 20 minutes of long video, supporting video-based Q&A, dialogue and content creation applications; It can operate mobile phones and robots independently. With the ability of complex reasoning and decision-making, Qwen2-VL can be integrated into mobile phones, robots and other devices to automatically operate according to the visual environment and text instructions; Can understand multilingual text in image videos, including Chinese, English, most European languages, Japanese, Korean, Arabic, Vietnamese, etc.


来源:DIYuan

声明:数据猿尊重媒体行业规范,相关内容都会注明来源与作者;转载我们原创内容时,也请务必注明“来源:数据猿”与作者名称,否则将会受到数据猿追责。

我要评论

数据猿微信公众号
第22届国际物联网展
返回顶部