whichllm ranks best local LLMs for hardware

// 1h agoOPENSOURCE RELEASE

whichllm ranks best local LLMs for hardware

A CLI tool that auto-detects system hardware to rank and launch the best-performing local LLMs from HuggingFace. It optimizes model selection by matching specific quantizations to VRAM while factoring in real-world speed and quality benchmarks.

// ANALYSIS

whichllm solves the "black box" problem of local LLM performance by providing empirical rankings based on a user's specific GPU and RAM.

–Hardware-specific scoring accounts for VRAM overhead, memory bandwidth, and model architecture for accurate performance predictions.
–The "plan" command offers a reverse-lookup for hardware buyers, identifying the components needed to run specific models like Llama 3 or Qwen.
–Integrated execution via isolated uv environments makes it a one-command alternative to complex setups for rapid model exploration.
–v0.5.2 improves Apple Silicon estimation and multimodal model scoring, ensuring unified memory and vision-capable LLMs are correctly ranked.
–Live data fetching from HuggingFace ensures rankings track the latest releases and benchmark shifts in real-time.

// TAGS

whichllmllmlocal-firstbenchmarkdevtoolquantizationopen-source

DISCOVERED

1h ago

2026-05-15

PUBLISHED

4h ago

2026-05-15

RELEVANCE

8/ 10

AUTHOR

andyyyy64

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE47m ago

devrage tallies agent frustration across AI assistants

devrage is a CLI that scans local coding-agent transcripts for swear words and near-miss spellings, then tallies rage by agent. It’s playful on the surface, but it also exposes how differently people experience various AI assistants.

UPDATE1h ago

Bun merges 1M line Rust rewrite

Bun core moves from Zig to Rust in a monumental 1-million-line pull request, arriving alongside v1.3.14 featuring a new native image API. The runtime continues its "all-in-one" expansion with 7x faster package installs and official FreeBSD support.

OPEN SOURCE3h ago

MCP anchors Claude Code extensibility

The Model Context Protocol (MCP) has emerged as the universal interface for connecting LLMs to local and remote data sources. In Anthropic's Claude Code ecosystem, MCP serves as the primary extension point, enabling agents to securely interact with databases, APIs, and file systems through a standardized client-server architecture.