Alibaba’s Qwen team released a new family of AI models, Qwen2.5-VL
... that can perform a number of text and image analysis tasks.
The models can parse files, understand videos, and count objects in images, as well as control a PC —
similar to the model powering OpenAI’s recently launched Operator.
Per the Qwen team’s benchmarking, the best Qwen2.5-VL model beats OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, and Google’s Gemini 2.0 Flash
on a range of video understanding, math, document analysis, and question-answering evaluations.
Tillbaka till Rolfs länktips 1-2 Februari 2025
Kommentarer