Uncensored MeroMero 26B MoE drops for low VRAM
The highly requested 26B-A4B version of the MeroMero finetune is now available, offering a faster, low-VRAM alternative to its 31B predecessor. By employing Arbitrary-Rank Ablation, the release slashes the Gemma 4 base refusal rate from 99% to 12% while maintaining creative writing and reasoning performance.
This release highlights the local AI community's priority on accessibility, trading maximum parameter counts for MoE models that fit cleanly on consumer GPUs. The 4B active parameter footprint ensures rapid token generation speeds on hardware with limited VRAM. Using Heretic v1.2.0 and Arbitrary-Rank Ablation (ARA) successfully "abliterates" the model's safety guardrails without requiring a full unstructured finetune. Merging the model back into the original instruct version to "heal" logic damage illustrates the growing sophistication of community-led post-training pipelines.
DISCOVERED
4h ago
2026-05-23
PUBLISHED
7h ago
2026-05-23
RELEVANCE
AUTHOR
LLMFan46
