Reddit claims low-KL Qwen refusal wipe

// 120d agoBENCHMARK RESULT

Reddit claims low-KL Qwen refusal wipe

A LocalLLaMA Reddit post claims a weekend method can strip refusal behavior from Qwen 3.5 2B to 0/120 refusals in minutes while keeping low 50-token KL divergence. The author shares partial logs, calls results reproducible on consumer and multi-GPU hardware, and says a paper is planned but not yet published.

// ANALYSIS

This is an eye-catching benchmark claim, but it is still unreviewed anecdotal evidence until code, method details, and independent replication are available.

–The reported tradeoff is unusually strong: near-preserved behavior (KL 0.0141) with complete refusal removal.
–If validated, the technique could materially lower the barrier for safety stripping on open models.
–The lack of a paper or reproducible artifact right now makes this more of an early signal than a confirmed breakthrough.

// TAGS

qwen3-5-2bllmsafetybenchmarkopen-weights

DISCOVERED

120d ago

2026-03-14

PUBLISHED

120d ago

2026-03-14

RELEVANCE

7/ 10

AUTHOR

Sliouges

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE32m ago

C# PS5 emulator SharpEmu boots 2D games

SharpEmu is an experimental, open-source PlayStation 5 emulator written in C# that targets Windows, Linux, and macOS. In its early development stages, the project has successfully booted simple 2D games like Dreaming Sarah and shown initial progress loading complex titles such as Demon's Souls Remake.

OPEN SOURCE33m ago

background-agents launches multi-repo coding agents

background-agents is an open-source platform for running autonomous coding agents asynchronously in cloud sandboxes. Built on Cloudflare, Modal, and Daytona, the system enables agents to perform long-running tasks like security audits and migrations across multiple repositories.

OPEN SOURCE34m ago

FlClash is a multi-platform proxy client based on ClashMeta, offering a simple, open-source, and ad-free interface.

FlClash is an open-source, multi-platform GUI proxy client built on ClashMeta. Developed using Dart and Flutter, it offers a unified, ad-free interface for managing network proxy settings across Android, iOS, Windows, macOS, and Linux. The application aims to provide a user-friendly way to configure and run ClashMeta-based rule routing.