YOU ARE VIEWING ONE ITEM FROM THE AICRIER FEED

Talkie 13B drops, trained on pre-1931 text

AICrier tracks AI developer news across Product Hunt, GitHub, Hacker News, YouTube, X, arXiv, and more. This page keeps the article you opened front and center while giving you a path into the live feed.

// WHAT AICRIER DOES

7+

TRACKED FEEDS

24/7

SCRAPED FEED

Short summaries, external links, screenshots, relevance scoring, tags, and featured picks for AI builders.

Talkie 13B drops, trained on pre-1931 text
OPEN LINK ↗
// 45d agoMODEL RELEASE

Talkie 13B drops, trained on pre-1931 text

Talkie is a 13B parameter "vintage" language model trained on 260B tokens of public domain text from before 1931. Developed by Nick Levine, David Duvenaud, and Alec Radford, it provides a unique "clean" baseline for studying AI generalization and reasoning capabilities without the contamination of modern web data or code.

// ANALYSIS

Talkie is a fascinating research milestone that strips away the modern web to reveal how LLMs reason without the crutch of memorized contemporary data.

  • Demonstrates remarkable temporal consistency, accurately identifying Melbourne as Australia's capital (the seat of government from 1901 to 1927).
  • Solves simple Python problems via in-context learning despite having no exposure to modern programming languages in its pretraining corpus.
  • Highlights the massive overhead of training on raw OCR text, which is only 30% as efficient as human-transcribed data.
  • Introduces "surprisingness" metrics for future events as a novel way to quantify model knowledge cutoffs and identify data leakage.
  • Provides a vital benchmark for researchers to distinguish between emergent reasoning and simple pattern matching of web-scraped content.
// TAGS
talkiellmresearchopen-weightsdatasetpython

DISCOVERED

45d ago

2026-04-28

PUBLISHED

45d ago

2026-04-28

RELEVANCE

9/ 10

AUTHOR

Outside-Iron-8242