Live AI developer news, ranked and linked to original sources.
> ▌
Markdown sits near the point where human readability and machine readability meet. HTML adds a rendering layer where humans and agents can stop seeing the same artifact.

Better Stack

The PrimeTime

Matt Maher

DIY Smart Code

AICodeKing

DIY Smart Code

WorldofAI
The post highlights a developer's workflow combining OpenAI's Codex model with the Cursor IDE. The developer notes that an IDE is essential for reviewing Codex's outputs and maintaining a project overview, and praises Cursor's built-in Composer 2.5 model as a highly effective tool for many development tasks.
Grok 4.5, xAI's next-generation large language model, is reportedly in private beta testing at Tesla and SpaceX. Powered by a massive 1.5 trillion-parameter V9 model, its early performance is described by Elon Musk as close to, or perhaps exceeding, Anthropic's Claude 3 Opus, signaling a significant capability upgrade for xAI's suite of products.
Phosh 0.56.0 has been released, introducing a top-bar CPU load meter plugin and the ability to hide system applications on immutable Linux distributions. The update also upgrades underlying dependencies, now requiring phoc 0.55 or newer.
Claude Design 2.0 is Anthropic's visual canvas environment for design exploration, prototyping, and asset synchronization. The tool allows users to transform text prompts, images, and documents into interactive designs and features seamless integration with Claude Code to streamline the transition from design to development.
Matt Maher evaluates leading AI models like GPT-5.5 and Claude Opus 4.8 using the CARE benchmark to measure how successfully AI coding agents maintain user intent during planning and execution. While top-tier models create excellent initial plans, they frequently lose track of specific user instructions during execution, with specialized long-horizon modes preserving intent best.
Virtuals Protocol has brought AI agent tokenization to the Robinhood Chain, enabling the creation and trading of tokenized AI agents from day one. These AI agents led the opening trading volume on the chain, accompanied by new ecosystem updates including private inference and humanoid investing capabilities.
planning-with-files is an open-source persistent file-based planning system designed for AI coding agents and long-running tasks. It works across over 60 agents (including Claude Code, Codex, and Cursor) by storing durable Markdown files—specifically task_plan.md, findings.md, and progress.md—directly on disk, making the agent's memory and plan crash-proof against context loss or command-line clears. Its recent update introduces opt-in autonomous and gated modes featuring a deterministic completion gate that prevents the agent from finishing until all planned tasks are fully resolved, mimicking Manus-style workflow persistence.
ShieldSuite is entering the X Layer AI Genesis Hackathon to build a security-first agentic infrastructure layer combining OKX Onchain OS and X Layer. The project aims to secure onchain AI agents with tools like transaction interception and real-time threat scanning.
HTMX has released the beta for version 4.0, which features a major architectural shift by replacing its legacy AJAX implementation with the modern fetch API. This update also integrates native DOM morphing and support for streaming responses, allowing developers to create highly interactive user interfaces using lightweight HTML attributes rather than complex client-side JavaScript frameworks.
AI researcher Machina (@EXM7777) has released a free library of 25 documented, flow-mapped agentic loops optimized for Anthropic's Claude Fable 5 model. The resource covers automations for marketing, sales, research, and coding, pairing each loop with ready-to-use prompts, tool requirements, and target goals.
Nous Research's open-source Hermes Agent is shifting from a standalone autonomous agent to a persistent operating layer. As detailed in the Hermes Atlas newsletter, the framework now functions as a self-hosted execution environment managing identity, tools, long-term memory, and multi-platform integrations.
Gemini 3.5 Pro is Google's upcoming frontier model, rumored to feature a fresh base pretraining, a 2-million token context window, a specialized Deep Think reasoning layer, and advanced agentic capabilities. The details were revealed in a YouTube video leak from WorldofAI, highlighting Google's upcoming attempt to rival competitors such as GPT-5.6 and Fable 5.

AI Search

Github Awesome

AI Revolution

DIY Smart Code

Rob The AI Guy

DIY Smart Code

Rob The AI Guy

Better Stack

Discover AI

DIY Smart Code

The PrimeTime

Theo - t3․gg

Better Stack