Alibaba’s Qwen team released a new family of AI models, Qwen2.5-VL

...  that can perform a number of text and image analysis tasks.



The models can parse files, understand videos, and count objects in images, as well as control a PC —

similar to the model powering OpenAI’s recently launched Operator.

Per the Qwen team’s benchmarking, the best Qwen2.5-VL model beats OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, and Google’s Gemini 2.0 Flash 

on a range of video understanding, math, document analysis, and question-answering evaluations.

https://techcrunch.com/2025/01/27/alibabas-qwen-team-releases-ai-models-that-can-control-pcs-and-phones/


DeepSeek Just Lost to Alibaba’s New AI!







Kommentarer

Populära inlägg i den här bloggen

Fjolåret blev strålande för flera av de största fondbolagen

Börsen i Stockholm och New York 4-5 augusti 2024