llama.cpp sm120 CUDA build hits Windows snag

// 106d agoINFRASTRUCTURE

llama.cpp sm120 CUDA build hits Windows snag

The Reddit post asks whether anyone has a clean sm120 CUDA build of llama.cpp working on Windows after compile friction on newer GPUs. The poster says Vulkan is stable as a fallback and wants to know whether this is toolchain lag or a real blocker in the project.

// ANALYSIS

This looks less like llama.cpp being fundamentally broken and more like Blackwell/CUDA support still settling on Windows. NVIDIA's CUDA 12.8 docs add SM_120 compiler support, so the architecture itself is real; the rough edge is the surrounding build stack and kernels. llama.cpp's build docs already cover CUDA, non-native builds, and explicit CMAKE_CUDA_ARCHITECTURES, which gives supported escape hatches when auto-detection misbehaves. Other Windows reports on RTX 5090-class hardware show CUDA builds compiling and detecting compute capability 12.0, so this feels like a fragile compatibility pocket rather than a total lack of support. Vulkan is the pragmatic fallback if you want stable local inference now instead of spending time on the newest CUDA edge cases.

// TAGS

llama-cppgpuinferencedevtoolcliopen-source

DISCOVERED

106d ago

2026-03-29

PUBLISHED

106d ago

2026-03-29

RELEVANCE

7/ 10

AUTHOR

prophetadmin

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE7m ago

OpenAI restores ChatGPT on WhatsApp in EEA

OpenAI has restored ChatGPT access on WhatsApp for users in the European Economic Area (EEA) via a verified contact number. Users can interact with the AI assistant in multiple languages, send voice notes, upload images, and generate new media directly within the chat.

BENCHMARK41m ago

Grok 4.5 tops SWE-Atlas-QnA benchmark

xAI's frontier AI model, Grok 4.5, has achieved the top ranking on Scale AI's SWE-Atlas-QnA benchmark. While individual benchmark supremacy is often short-lived, the result highlights the rapid, iterative pace of top-tier AI models pushing each other forward in complex, codebase-level question answering and developer agent capabilities.

OPEN SOURCE1h ago

Win11Debloat declutters Windows 10 and 11

Win11Debloat is a lightweight, customizable PowerShell script to declutter, optimize, and customize Windows 10 and 11. It allows users to remove pre-installed bloatware apps, disable telemetry, adjust privacy settings, and tweak user interface elements through an interactive menu or command-line arguments.