Qwen 3.7 Max hits 10x kernel speedup

// 45d agoMODEL RELEASE

Qwen 3.7 Max hits 10x kernel speedup

Alibaba Cloud's flagship model demonstrates 35 hours of unsupervised autonomous engineering, rewriting a GPU kernel for a 10x performance boost on undocumented hardware.

// ANALYSIS

The "agentic marathon" is the new benchmark, shifting focus from chat accuracy to sustained autonomous reasoning and tool use.

–Qwen 3.7 Max made 1,158 tool calls and 432 evaluations over 35 hours without human intervention.
–Rewrote the SGLang Triton attention kernel to achieve a 10.0x geometric mean speedup on the new Zhenwu M890 processor.
–Features a 1M-token context window and full Model Context Protocol (MCP) integration for complex workflows.
–Performance in GPQA Diamond (92.4) surpasses competitors like GLM 5.1 and DeepSeek V4.
–Proprietary release via API ($2.50/$7.50) signals Alibaba's shift toward high-margin enterprise "AI Factories."

// TAGS

qwen-3-7-maxllmagentai-codingcoding-agentreasoningmcpgpu

DISCOVERED

45d ago

2026-05-22

PUBLISHED

45d ago

2026-05-22

RELEVANCE

10/ 10

AUTHOR

AlphaSignalAI

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE5m ago

Alibaba open-sources Zvec in-process vector database

Developed by Alibaba and built upon their battle-tested Proxima search engine, zvec is a high-performance, in-process vector database that functions much like the SQLite of vector databases. It requires no dedicated server infrastructure and supports dense and sparse vectors, hybrid retrieval, and billion-scale search with sub-millisecond latency, making it ideal for RAG pipelines and local AI applications.

OPEN SOURCE6m ago

claude-video brings multimodal video analysis to Claude

claude-video is an open-source plugin that provides video comprehension capabilities to Claude Code and other agentic environments. It downloads videos, extracts keyframes, and transcribes audio to pass as a bundled multimodal prompt, enabling video summarization, QA, and content analysis.

UPDATE18m ago

Mercury Agent launches daemon mode for persistent workflows

Mercury Agent is an open-source, soul-driven AI agent framework designed for long-running, multi-hour workflows. With its daemon mode feature, the agent runs in the background as a system daemon, allowing integrations like Telegram, Discord, Slack, and scheduled tasks to keep running continuously without losing state or context even after the terminal window is closed.