Qwen-Scope maps Qwen 3.5 hidden features

// 50d agoOPENSOURCE RELEASE

Qwen-Scope maps Qwen 3.5 hidden features

Qwen Team released Qwen-Scope, a public SAE suite for Qwen 3.5 models spanning 2B through 35B MoE, plus a Hugging Face Space for feature exploration and steering. It exposes residual-stream features across layers, turning model internals into something researchers can inspect, localize, and intervene on.

// ANALYSIS

This is a serious interpretability release, not just another model dump. The big story is that Qwen is making feature-level control and debugging feel practical for a broad model family, which moves SAEs from niche research into usable tooling.

–Coverage across multiple Qwen 3.5 sizes makes this more useful than a one-off demo on a single checkpoint.
–Residual-stream, all-layer coverage matters because it lets you trace behaviors like language switching, refusals, and style drift to specific learned features.
–Steering and ablation are the obvious headline use cases, but the more durable value is debugging and dataset auditing for fine-tunes.
–It is also plainly dual-use: the same machinery that helps explain behavior can be used to suppress safety-related features or push the model toward unwanted behaviors.
–Compared with prompt-only control, feature editing is much more surgical, which is why interpretability folks will care and policy folks will be uneasy.

// TAGS

qwen-scopeqwenllmopen-sourceresearchinterpretabilitysafety

DISCOVERED

50d ago

2026-04-30

PUBLISHED

51d ago

2026-04-30

RELEVANCE

9/ 10

AUTHOR

MadPelmewka

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

VIDEO15m ago

Claude Code creator develops entirely from phone

In a 40-minute presentation, Claude Code creator Boris Cherny shared that he writes 100% of his code using Claude, primarily managing the developer loop from his phone. He highlighted underutilized features that enable this workflow, such as auto mode—which lets Claude approve its own safe terminal commands to run tasks autonomously for hours—and customized output styles.

NEWS22m ago

GPT-5.6 Pro builds interactive Sims-like simulator

A developer demonstration highlights the capability of GPT-5.6 Pro to generate a complete, self-contained Sims-like life simulator loop within a single interface artifact. The model handles state coordination, multi-agent logic, and UI rendering out of the box without requiring external coding harnesses.

NEWS32m ago

Riley Brown tests Chorus agent OS autonomy

Riley Brown has launched a live experiment to test whether the Chorus agent operating system can autonomously run a real OS without human assistance. To kick off the project, he built @skyeagnt, a fully autonomous agent powered by Chorus that manages and posts to its own X account completely independently.

Qwen-Scope maps Qwen 3.5 hidden features