M4 Max MacBook benchmarked for OpenCode, Qwen3

// 46d agoBENCHMARK RESULT

M4 Max MacBook benchmarked for OpenCode, Qwen3

A developer evaluates the MacBook M4 Max's performance using local LLMs for agentic coding, sharing benchmarks for the Qwen3-30B-A3B model. The results showcase the high-throughput capabilities of the 40-core GPU when paired with modern Mixture-of-Experts architectures in a local development environment.

// ANALYSIS

The M4 Max's unified memory remains the definitive "killer feature" for running 30B+ parameter models at usable speeds on consumer-grade hardware.

–Benchmarks of ~89 tokens/sec on Qwen3-30B-A3B confirm that MoE models are the sweet spot for high-performance local coding agents.
–OpenCode is emerging as the premier model-agnostic TUI for developers seeking a local-first alternative to proprietary agents like Claude Code.
–While 32GB of RAM is viable for 30B models, the community increasingly recommends 64GB+ to accommodate the long-context windows required for multi-file codebases.
–Switching to MLX-native runners provides a significant 30-50% performance boost over llama.cpp for Qwen models on Apple Silicon.

// TAGS

opencodeai-codingm4-maxmacbookqwen3llmagentopen-source

DISCOVERED

46d ago

2026-04-12

PUBLISHED

46d ago

2026-04-11

RELEVANCE

8/ 10

AUTHOR

AnotherDevArchSecOps

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS19m ago

Claude Opus 4.8 Remains Unconfirmed

Anthropic’s official pages still show Opus 4.7 as the latest published flagship model, with no public announcement, model card, or release note for Opus 4.8.

MODEL26m ago

Nano Banana 2, Pro hit GA

Google makes Nano Banana 2 and Nano Banana Pro generally available today via Gemini Enterprise Agent Platform, packaging its image generation and editing models for enterprise workflows. Nano Banana 2 also adds a preview mode for video-file prompts, using video context to generate thumbnails, infographics, and other context-aware images.

NEWS33m ago

Microsoft Plans In-House Coding Model

The Information says Microsoft plans to show a homegrown coding model at Build next week, alongside new reasoning, speech, transcription, and image models. The move looks aimed at making GitHub Copilot less dependent on OpenAI and Anthropic while tightening control over cost and performance.