Alibaba launches multimodal Qwen 3.7 Plus
Alibaba has announced the release of Qwen 3.7 Plus, a new multimodal AI model that integrates vision and language capabilities to facilitate GUI operations, coding, and complex agentic workflows. Available through the Alibaba Cloud Model Studio, this model serves as a cost-effective, versatile option tailored for automated web and desktop tasks, visual coding, and autonomous digital interactions.
Multimodal models are rapidly evolving from passive observation to active control, with Qwen 3.7 Plus demonstrating that the battleground for AI supremacy is shifting from simple text/reasoning to complex, agentic GUI automation.
- –Vision-language integration specifically optimized for GUI interaction enables developers to build tools that can interact with software like a human user.
- –Delivering the model as a proprietary commercial API via Alibaba Cloud Model Studio highlights a shift in Alibaba's approach, prioritizing managed monetization over open-source weights.
- –Offering a balanced mid-tier model like Qwen 3.7 Plus alongside a larger flagship Max model targets developers who need agentic power without the cost of high-parameter alternatives.
DISCOVERED
1h ago
2026-06-03
PUBLISHED
1h ago
2026-06-03
RELEVANCE
AUTHOR
WorldofAI
