Script makes tokens-per-second feel concrete

// 1h agoOPENSOURCE RELEASE

Script makes tokens-per-second feel concrete

tokenspeed is a lightweight script and web demo for building intuition around LLM generation speed. It translates raw tokens-per-second numbers into a more human sense of how text, code, and reasoning+code actually feel while you wait. The goal is not to benchmark models, but to make performance claims easier to interpret in day-to-day local LLM use.

// ANALYSIS

Useful because tokens/sec is objective but not intuitive, especially once you move beyond plain chat.

–21 tokens/second is usually in the “feels responsive” range for plain text, though longer outputs still benefit from faster throughput.
–10 tokens/second is not unusable; it is more “noticeably slow” than “broken,” and the delay becomes more obvious on code and reasoning tasks.
–The strongest value here is calibration: it helps people compare claims across workloads instead of arguing from raw numbers alone.

// TAGS

local-firstinferencetokens-per-secondperformancebenchmarkingopen-source

DISCOVERED

1h ago

2026-05-10

PUBLISHED

2h ago

2026-05-10

RELEVANCE

8/ 10

AUTHOR

MikeNonect

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE39m ago

no-mistakes v1.15.0 adds intent extraction

no-mistakes released v1.15.0 on 2026-05-10. The update adds intent extraction and PR intent sections so validation can use the original agent/session context, and it fixes orphaned intent base SHA handling.

TUTORIAL1h ago

Anthropic drops prompting playbook workshop

Anthropic’s Applied AI team recorded a free workshop from Code w/ Claude on how to build and maintain production prompts through model and architecture migration. It’s a practical prompt-engineering session aimed at teams using Claude in real workflows, not a beginner toy demo.

OPEN SOURCE2h ago

open-slide 1.1.1 adds current-slide skill

The May 10 release teaches agents to read the active slide, page, and selected element from `node_modules/.open-slide/current.json`, so deictic edits like “this slide” work without a browser. It also adds live agent-watching badges and tightens the inspector/comments workflow.