Old Titan X Pascal hits 25 tok/s

// 118d agoINFRASTRUCTURE

Old Titan X Pascal hits 25 tok/s

A developer dusted off an NVIDIA Titan X Pascal GPU and added it to a local server to run LLMs, achieving ~500 tokens/sec prompt processing and 25 tokens/sec generation using llama.cpp and OpenCode — roughly matching a modern AMD 9070 XT at half the generation speed.

// ANALYSIS

Old datacenter-grade Pascal hardware still punches above its weight for local inference, which matters as more developers look to repurpose aging GPUs rather than buy new.

–25 tok/s generation on decade-old hardware is genuinely usable for background coding agents or overnight batch tasks
–llama.cpp's CPU+GPU offloading means even cards with limited VRAM can contribute meaningfully to inference throughput
–The 5x speedup over CPU-only (6 tok/s → 25 tok/s) shows the GPU floor is low but real
–This is a practical benchmark for the "basement server" crowd increasingly running local AI pipelines

// TAGS

llminferencegpuopen-sourceself-hosted

DISCOVERED

118d ago

2026-03-16

PUBLISHED

118d ago

2026-03-16

RELEVANCE

5/ 10

AUTHOR

Lazy-Routine-Handler

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS39m ago

OpenAI, xAI, Meta drop major models

The AI model landscape saw unprecedented rapid shifts over a 96-hour period. OpenAI released the GPT-5.6 family to general availability, xAI took Grok 4.5 public following the SpaceX merger, and Meta introduced a new paid Model API, marking significant paradigm shifts across major AI players.

INFRA50m ago

Ritual builds infrastructure for autonomous AI agents

Ritual is an AI lab and infrastructure project that aims to move beyond simply making AI models smarter by focusing on granting them autonomous agency. The project is developing the underlying stack—including cryptography, consensus, and privacy mechanisms—required for AI agents to operate persistently, hold and spend their own money, and execute tasks without needing manual human approval for every action.

OPEN SOURCE1h ago

Agent Skills guides agent UI design

Agent Skills is an open-source library and prompting system designed to help front-end coding agents like Cursor and Claude Code build premium user interfaces. The project provides reusable design guardrails and procedural workflows for advanced styling, GSAP animations, and WebGL.