> ▌
Markdown sits near the point where human readability and machine readability meet. HTML adds a rendering layer where humans and agents can stop seeing the same artifact.

Better Stack

WorldofAI

Discover AI

Bijan Bowen

DIY Smart Code

AICodeKing

DIY Smart Code

Better Stack

DIY Smart Code
A developer has recreated Guillermo Rauch's Vercel Ship 26 London keynote presentation in three hours using open-slide and Cursor Composer. The project demonstrates the synergy between code-first slide frameworks and AI coding assistants, and the live reconstructed deck has been shared online.
OpenAI President Greg Brockman endorsed the Codex App, highlighting how the graphical agent orchestrator has replaced his traditional terminal-based development workflow. The desktop application enables developers to run multiple agents in parallel, utilize sandbox-based worktrees, and automate build-test-debug loops.
GLM-5.2 is a 744-billion-parameter open-weights Mixture-of-Experts model from Z.ai optimized for coding, frontend design, and long-context agentic reasoning. Trained using the open-source Slime RL framework, the model features a 1-million-token context window, speculative decoding, and adjustable reasoning effort modes to balance latency and quality in autonomous engineering loops.
CodeRabbit has announced the default integration of React Doctor v0.5.6 into its AI-driven code review platform. React Doctor is an open-source static analysis tool created by Aiden Bai (creator of Million.js) that audits React codebases for performance, security, and accessibility issues. By combining React Doctor's deterministic auditing with CodeRabbit's AI-powered review flow, users will automatically receive detailed health checks on their React pull requests to prevent regressions.
Rumors and community leaks suggest OpenAI is stealth testing a next-generation AI model, GPT-5.6 Pro, directly within ChatGPT with a reasoning budget of 960 and native Playwright integration. While OpenAI has not officially announced the model, leaked details point to a December 2025 knowledge cutoff and native browser automation features ready for agentic web workflows.
Hermes Agent v0.17.0 is a major update to the open-source, persistent AI agent framework by Nous Research that integrates its core terminal capabilities with a new desktop application. This cohesion allows developers to manage persistent memory, skills, and model configurations through a unified graphical interface.
Z.ai's new GLM-5.2 has jumped to #6 on OpenCode's weekly usage leaderboard, with OpenCode listing 457B tokens and 90,971 sessions after its June 13 release. The ranking adds real developer-usage momentum to a model already being discussed as one of the strongest open-weight coding LLMs.
The post highlights a critical flaw in current AI agent evaluation methodologies that treat agents like standard chatbots by focusing only on the correctness of the final output. Because agents execute complex multi-step reasoning, tool interactions, and data handling, a correct final answer can easily hide dangerous underlying behaviors such as unauthorized data access, leaking information, or utilizing incorrect tools. Effective evaluation must assess the entire trajectory of the agent's path rather than just its final response.
Recent UK media reports have sparked concerns over a potential household VPN ban as the government considers new controls to enforce social media restrictions for minors. The discussions, led by Children’s Minister Josh MacAlister and regulator Ofcom, focus on introducing "age-gates" to close loopholes that allow children to use virtual private networks to circumvent age verification systems mandated by the Online Safety Act. While officials acknowledge the critical importance of VPNs for business operations and user privacy—stating that a complete ban is not under consideration—the government is expected to publish a formal statement outlining its regulatory approach in July 2026.
LLM Brewing is the code artifact for a June 16 arXiv paper tracing when code-reasoning answers become readable in LLM hidden states, when models can actually decode them, and why some answers degrade later. The project ships benchmark, probing, CSD, diagnostic, and causal-validation pipelines on a separate experiment branch.
LoopWM is a new arXiv research paper proposing looped transformer depth for world models, using a shared recurrent block to iteratively refine latent environment states. The authors claim up to 100x parameter efficiency, adaptive compute per transition, and stronger long-horizon prediction on ScienceWorld-style simulation tasks.
Mercury Agent says GLM-5.2 is performing well enough to consider making Z.ai's model its default provider. The move would matter because Mercury is an open-source, always-on coding agent built around permissioned tools, memory, skills, and multi-channel operation.
The U.S. government ordered Anthropic to restrict foreign access to Claude Fable and Claude Mythos, prompting the company to suspend access globally. The ban was triggered by concerns over access by a South Korean telecom with alleged Chinese ties and safety issues raised by Amazon.
Cosmic Stack Labs has released version 1.1.13 of Mercury Agent, code-named "Chatty," which adds support for Discord, Slack, and Signal communication gateways. The update expands the 24/7 open-source AI assistant's accessibility beyond CLI and Telegram, letting users interact with it directly in their daily messaging applications.
Perplexity Brain is a self-improving memory system for the Perplexity Computer agent that builds a living context graph of actions, successes, and failures. Consolidating this graph overnight to optimize instructions reportedly increases task correctness by 25% and reduces execution costs by 13%.
Sui, the layer-1 blockchain platform, has announced a new prototype designed to secure payments and commerce involving artificial intelligence agents. Unveiled on X, the system's core innovation is separating the transaction authorization process from the AI itself, paving the way for safer autonomous financial operations.
Stripe Projects has expanded its developer integrations, enabling AI agents to autonomously provision and manage third-party services directly from the CLI. The platform automates resource setup across 49 providers, syncing credentials into the workspace while consolidating billing.

Tsinghua University's THUDM group has open-sourced Slime, the reinforcement learning (RL) post-training framework behind Zhipu AI's GLM-4 and GLM-5 series. By integrating Megatron-LM with SGLang to bridge training and inference, Slime enables parallel On-Policy Distillation (OPD) loops and completed the post-training of the 744B-parameter GLM-5.2 MoE model in approximately two days.
In a 40-minute presentation, Claude Code creator Boris Cherny shared that he writes 100% of his code using Claude, primarily managing the developer loop from his phone. He highlighted underutilized features that enable this workflow, such as auto mode—which lets Claude approve its own safe terminal commands to run tasks autonomously for hours—and customized output styles.
A developer demonstration highlights the capability of GPT-5.6 Pro to generate a complete, self-contained Sims-like life simulator loop within a single interface artifact. The model handles state coordination, multi-agent logic, and UI rendering out of the box without requiring external coding harnesses.
Riley Brown has launched a live experiment to test whether the Chorus agent operating system can autonomously run a real OS without human assistance. To kick off the project, he built @skyeagnt, a fully autonomous agent powered by Chorus that manages and posts to its own X account completely independently.
An upcoming Tokyo meetup scheduled for the 24th will feature a live demo of HyperFrames, HeyGen's open-source "video-as-code" framework that allows developers and AI agents to programmatically render videos using HTML, CSS, and JavaScript. The event offers creators and developers an opportunity to connect directly with the HyperFrames team.
Tibor Tee, a community developer at Cursor, shared that the team tracks the "Keep Rate," the percentage of agent-generated code remaining in a user's codebase over time. This telemetry helps evaluate the long-term quality and correctness of AI agent output.
Discovery directory Toolfolio has showcased a curated list of AI agent tools to help users automate routine tasks and optimize their workflows. The list features specialized tools including Agent One for automating boring tasks, Agentmail for giving AI agents email inbox APIs, OpenCode.ai for open-source AI coding assistance, and Orchid.ai for personal organization.
Z.ai (formerly Zhipu AI) has launched GLM-5.2, a flagship Mixture-of-Experts (MoE) large language model under an MIT open-source license. The model boasts 744 billion total parameters (40 billion active per inference call) and supports a native 1-million-token context window designed specifically to handle complex coding and agentic workflows. To optimize efficiency, GLM-5.2 introduces "IndexShare" to reuse indexers across sparse attention layers—slashing per-token FLOPs by 2.9×—and incorporates an improved Multi-Token Prediction (MTP) layer for faster speculative decoding. Developers can run the model locally using quantized weights or access it via API platforms.
Anthropic has detailed seven steering methods to customize and optimize context in its command-line developer tool, Claude Code. These mechanisms, which include path-scoped rules, custom skills, and subagents, help developers manage context bloat and improve code generation accuracy.
Built by a solo refugee founder in Helsinki, Pixlie is an AI video studio that aims to give creators granular control over their video generation workflows rather than acting as a black-box generator. It features text-to-video and image-to-video capabilities, a cloud queue for rendering jobs, and a unified library across its iOS, Android, and web platforms (pixlys.com).
Foyer turns ambient background noise into an interactive spatial environment where users can place and pan sound orbs on a canvas. Designed for unobtrusive productivity, the interface folds neatly into the MacBook notch while working.
Inksightful is an iOS app that digitizes physical notebooks and diaries by leveraging AI handwriting recognition to make them fully searchable. Users can search for specific names and memories, with original scans kept linked to the text for verification.
Slack has introduced a native Model Context Protocol (MCP) client within Slackbot, enabling users to command the assistant to perform actions across over 20 enterprise applications using natural language. This native integration lets teams share results directly in channels, transforming solo browser-based AI tasks into collaborative workflows.
UISqueezy is a platform designed to maintain synchronization between your codebase's design tokens and Figma variables. It automatically pushes changes in color, typography, spacing, radius, and other tokens directly to Figma as native variables, and pulls updates from Figma back into your code, providing a single source of truth without drift or copy-pasting.
Basedash has launched group and access control capabilities for managing permissions across internal teams, leadership, and external clients. Administrators can now enforce row-level security and set distinct AI contexts per group to ensure the AI assistant provides relevant answers based on specific permissions.
Cutio is an AI-driven tool that automatically detects and skips sponsor reads and self-promotion across various YouTube creators, topics, and languages. It functions in a web browser or on a paired TV, leveraging a shared cache to instantly skip segments in previously analyzed videos.
Matt from ElevenLabs demonstrated the cross-lingual capabilities of ElevenLabs Music v2, showcasing tracks that blend genres like Britpop and Samba with foreign languages. The demonstration highlights the model's ability to maintain musical style coherence while singing fluently across multiple target languages.
RAGHub is a community-driven directory and open-source repository designed to catalog emerging Retrieval-Augmented Generation (RAG) frameworks, evaluation tools, optimization engines, and learning resources. Created to help developers navigate a rapidly evolving and noisy ecosystem, the directory compares projects by use case, complexity, integration, language, and scale, serving as a centralized "living document" maintained in collaboration with the r/RAG community.
Ploy is an AI-powered marketing platform that autonomously runs and optimizes websites by ingesting design systems, brand voices, and components. The platform connects directly to analytics, CRM, and search data to design pages, write copy, and run campaigns in the background.
agent-native is an open-source framework created by Builder.io that provides the essential primitives to build applications natively designed for both human and AI agent interaction. Rather than treating AI as a superficial chatbot wrapper, the framework establishes a unified application model where agents and users interact with the same underlying state, data, actions, and permissions. It offers SQL-backed state synchronization, support for modular tools or skills, robust memory management, and observability to monitor agent behavior. This architecture aims to make agentic SaaS interactions more reliable, deterministic, and deeply integrated.
Z.ai has launched GLM-5.2, a flagship open-weights large language model featuring a 1-million-token context window for long-horizon coding and agentic workflows. Early testing compares its performance favorably to proprietary frontier models like Claude Opus 4.8 and GPT-5.5, achieved via an efficient 'IndexShare' architecture.

Github Awesome

Theo - t3․gg

AI Revolution

OpenAI

OpenAI

OpenAI

WorldofAI

Better Stack

DesignCourse

Cole Medin

Discover AI