Claude Code builds Flash-MoE in 24 hours

// 68d agoNEWS

Claude Code builds Flash-MoE in 24 hours

ANNOUNCEMENT PRODUCT PRODUCT HUNT YOUTUBE

Anthropic's agentic CLI tool was used to autonomously build Flash-MoE, a custom C/Metal inference engine that runs a 397B-parameter Qwen 3.5 model on a 48GB MacBook Pro at 5.5 t/s. By automating 90+ optimization experiments in a single day, Claude Code demonstrated the power of agentic engineering in solving complex, low-level systems problems that typically require weeks of human effort.

// ANALYSIS

Claude Code's "autoresearch" capability is a landmark shift from AI-assisted to AI-led engineering, proving that agents can handle brute-force system optimization at scale.

–Flash-MoE implements Apple's "LLM in a Flash" research to stream expert weights from SSD, bypassing the 200GB+ RAM requirement for massive MoE models.
–The engine uses custom Metal kernels and FMA-optimized 4-bit dequantization to achieve usable local inference speeds on consumer hardware.
–Claude Code autonomously discovered the optimal balance of parallel I/O and GPU scheduling, a task involving a massive search space of configuration and code changes.
–This project highlights the transition of AI tools from simple code generators to autonomous research partners capable of validating hypotheses through empirical testing.
–The 24-hour turnaround for a project of this technical depth sets a new benchmark for the speed of AI-driven software development.

// TAGS

claude-codeai-codingagentllminferencegpuedge-aiopen-source

DISCOVERED

68d ago

2026-03-22

PUBLISHED

68d ago

2026-03-22

RELEVANCE

9/ 10

AUTHOR

Github Awesome

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL1d ago

Anthropic drops Opus 4.8, teases upcoming Mythos model

Anthropic launched Claude Opus 4.8 with adjustable effort controls, dynamic workflows for Claude Code, and a cheaper fast mode. The release serves as a precursor to their highly anticipated Claude Mythos model, which is slated to roll out in the coming weeks.

VIDEO1d ago

Viral video teases Claude Opus 4.8

A viral video directed by Miguel07Code showcases impressive "hyperframes" camera movements, allegedly generated by Claude Opus 4.8. The post has sparked speculation about Claude's video generation capabilities.

LAUNCH1d ago

Browser Use Terminal launches Rust web-agent TUI

Browser Use Terminal is a new Rust-based TUI that lets developers automate and steer browser tasks directly from the command line. It combines a lightweight LLM harness with direct CDP control over Chrome for highly observable, interactive automation.