BullshitBench v2 tracks model pushback

// 82d agoBENCHMARK RESULT

BullshitBench v2 tracks model pushback

Peter Gostev’s open BullshitBench benchmark has expanded to 100 nonsense prompts across five domains and now ships with a live v2 explorer and leaderboard. Instead of measuring raw knowledge, it tests whether models clearly reject broken premises, partially challenge them, or confidently accept nonsense.

// ANALYSIS

BullshitBench matters because it measures a failure mode most benchmark suites barely touch: models that sound smart while endorsing nonsense. That makes it unusually relevant for anyone building AI systems that need judgment, not just fluent output.

–The v2 dataset broadens coverage from the original set to 100 prompts spanning software, finance, legal, medical, and physics
–Its green/amber/red scoring is easy to interpret and maps well to real product risk: push back, hedge, or hallucinate confidently
–The live explorer makes the benchmark more useful than a static paper by letting developers inspect model behavior, domain splits, and leaderboard movement directly
–Early discussion around the release reinforces a key point for AI builders: more reasoning effort does not automatically mean better refusal behavior on bad premises

// TAGS

bullshitbenchllmreasoningbenchmarkresearchopen-source

DISCOVERED

82d ago

2026-03-06

PUBLISHED

82d ago

2026-03-06

RELEVANCE

8/ 10

AUTHOR

Income stream surfers

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE36m ago

Cursor adds dedicated subagents for skills

Cursor now allows developers to execute tool-heavy or research-intensive agent skills within dedicated subagents. This architectural shift isolates noisy background tasks, keeping the main chat context clean and focused.

UPDATE1h ago

YouTube moves AI labels to video player

YouTube is moving its AI content disclosures from video descriptions to more prominent placements beneath the player and on Shorts overlays. Starting in May, the platform will use internal signals to automatically label photorealistic AI content that creators fail to disclose.

OPEN SOURCE4h ago

Taste Skill kills AI "frontend slop"

Taste-Skill is an open-source framework that provides portable "agent skills" to enforce high-end design principles in AI-generated code. By injecting specific design directives and "anti-slop" rules, it enables LLMs to produce editorial-grade UIs that bypass generic, boilerplate-heavy AI templates.