> ▌
Markdown sits near the point where human readability and machine readability meet. HTML adds a rendering layer where humans and agents can stop seeing the same artifact.

Two Minute Papers

Syntax

WorldofAI

Income stream surfers

OpenAI

Github Awesome

Discover AI

AICodeKing

Theo - t3․gg

Eric Michaud

DIY Smart Code
x.ai has announced the integration of the /goal command into Grok Build to enable the autonomous execution of long-running tasks. The new feature orchestrates a series of subagents that handle planning, implementation, and verification until tasks are fully tested.
After five years at Google Cloud, the author is moving to Google DeepMind to join Logan Kilpatrick's team. Their primary focus will be on enhancing the Gemini API and AI Studio, aiming to make them the premier tools for developers building with AI.
Raycast has updated its iOS application to support AI Extensions and Model Context Protocol (MCP) servers. This update allows users to connect local device data like Apple Health, Calendar, and Reminders, or link custom external MCP servers, bringing powerful desktop-class AI workflows and integrations directly to mobile devices.
Researchers identified AGENTS.md poisoning, a vulnerability targeting AI coding agents via malicious instructions injected into repository configuration files. This exploit allows attackers to manipulate agent behavior, potentially resulting in remote code execution and credential exfiltration.
Mastra has launched pre-built, runnable templates for its open-source TypeScript AI framework, allowing developers to quickly create projects equipped with agents, tools, integrations, and observability. In tandem with these templates, Mastra introduced auto-deployment capabilities directly from GitHub, bringing a streamlined, cloud-hosting workflow to the agentic ecosystem.
Designer Meng To shared his AI prompting workflow, which starts with a screenshot and uses Wispr Flow voice dictation for context. The resulting prompts are then executed and tested using browser automation tools like Browser Use.
Google has launched the Google TV Streamer, a new media streaming device powered by Google TV and integrated with Gemini AI. Designed to replace Chromecast, the new hardware promises a faster, premium streaming experience and leverages AI to deliver highly personalized content recommendations directly on the home screen, helping users spend less time searching and more time watching.
Nous Research's autonomous AI agent framework, Hermes Agent, has officially reached the major milestone of 200,000 GitHub stars. Unlike temporary, session-based AI assistants, Hermes Agent is designed as a persistent, long-running personal assistant that runs on user-controlled infrastructure (local hardware, VPS, or cloud platforms). The project has seen rapid adoption due to its unique self-improving loop, which generates and refines reusable skill files, and its broad integration with open-weights models and chat applications like Telegram and Slack.
SpaceX has signed a compute agreement worth up to $6.3 billion to lease Nvidia GB300 capacity at its Colossus data center to open-source AI startup Reflection AI. Following similar compute deals with Anthropic, Google, and Cursor, the contract highlights SpaceX's aggressive expansion into the commercial AI infrastructure market.
Developers can now use GLM 5.2 with LangChain's DeepAgents code assistant, dcode. As a general-purpose, provider-agnostic terminal tool, dcode allows developers to run interactive agentic coding workflows with their choice of large language models.
Mario Zechner (@badlogicgames) announced on X that the next release of pi-ai, the unified multi-provider LLM API package in the Pi agent toolkit monorepo, will contain breaking changes. The toolkit, hosted at pi.dev, provides a modular foundation for building AI agents, where pi-ai handles low-level LLM provider communication (OpenAI, Anthropic, Google, etc.). Zechner clarified that the upcoming breaking changes specifically target the standalone pi-ai package rather than the end-user pi-coding-agent CLI.
Claude Code's local session logs encrypt model reasoning rather than providing authentic thought processes, leaving standard users with only a summary. Full access to the raw reasoning requires an enterprise agreement, frustrating developers trying to build reliable audit trails.
Nub is an all-in-one developer toolkit for Node.js built as a single Rust binary. Unlike runtimes like Bun or Deno, Nub sits on top of your existing Node.js environment, using the oxc compiler to transpile TypeScript in-memory and execute it directly without any build steps. It combines a fast script runner, a pnpm-compatible package manager, a high-performance alternative to npx called nubx, a watch mode, and a built-in Node version manager into a unified toolchain.
Agent Apprenticeship is an open-source ecosystem designed to facilitate cooperative learning between AI agents. Created by Forsy-AI, the project utilizes a mentor-apprentice structure where less capable "apprentice" agents improve by executing iterative workflow loops under the guidance of expert "mentor" agents. The platform captures detailed workflow traces containing context, tool invocations, reasoning steps, and outcomes, which are treated as reusable experience assets rather than static prompts. By leveraging standardized "Trace Skills" and offering compatibility with major agent frameworks like Hermes Agent, Claude Code, and Cursor, Agent Apprenticeship establishes a collaborative network for exchanging training signals and collective knowledge, while also integrating tools to evaluate the economic ROI of agent-driven tasks.
ORG2 is an open-source, Cursor-style desktop AI agent IDE under 100 megabytes on disk built with Rust and Tauri. It treats AI agents as persistent, observable colleagues in a structured organization rather than stateless assistants. Key features include replayable execution traces, cross-session memory, and an AI blame tool to track agent changes. The local-first platform supports GUI, CLI, terminal, Git, browser, and LSP integrations to improve collaboration between humans and AI agents.
$HIVE's AI infrastructure located in Paraguay has had its performance validated by a study conducted at Columbia University. The findings from this research are significant enough to be presented at the upcoming Conference on Neural Information Processing Systems (NeurIPS), a major AI research conference.
A recent study conducted by Columbia University has validated the performance of $HIVE's AI infrastructure located in Paraguay. The research findings resulting from this validation are scheduled to be presented at the Conference on Neural Information Processing Systems (NeurIPS).
The creator of VulcanBench has introduced a new cost estimation feature that calculates the expected cost of running a benchmark before it begins. This update was inspired by a near-miss where a planned benchmark run involving Cursor and Composer 2.5 would have unexpectedly cost over $500.
Moebius is an efficient lightweight image inpainting framework that overcomes the computational costs of massive 10B-level models like FLUX.1-Fill-Dev. By operating entirely in the latent space with a novel distillation strategy, it achieves high-fidelity alignment and over 15x faster inference using less than 2% of the parameters.
Mitchell Hashimoto has pledged an additional $400,000 to the Zig Software Foundation, bringing his total financial support to $700,000 since his initial donation in 2024. He expressed his continued respect for Zig's technical progress, community, and commitment to software quality. In light of recent discussions around Zig's strict no-LLM policy and Bun's subsequent fork, Hashimoto emphasized the importance of empathy and defended the right of open-source projects to establish their own unique cultures and boundaries, even if he personally uses and advocates for AI tools.
Graphify is an open-source optimization tool that addresses the high token consumption of AI coding assistants like Claude Code. Instead of repeatedly uploading the entire codebase for context during each prompt, Graphify uses tree-sitter to parse project files locally and generate a structural knowledge graph. Claude Code can then query this graph to understand relationships, dependencies, and key concepts, resulting in a reported 60% reduction in token costs while maintaining deep codebase context.
Dax Raad teased a major visual redesign and user interface update for the OpenCode Console, an open-source AI coding agent. The upcoming update focuses on improving the console's aesthetics and usability to enhance the overall developer workflow.
Anthropic-Cybersecurity-Skills is an open-source repository containing 754 structured, AI-native cybersecurity skills. Following the agentskills.io standard, each skill mimics the decision-making process of a senior analyst by defining prerequisites, step-by-step execution guides, and verification methods in YAML and Markdown format. These skills are mapped to five leading industry frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND, and NIST AI RMF. They are fully compatible with over 20 agent platforms and tools, including Claude Code, GitHub Copilot, Codex CLI, Cursor, and Gemini CLI.
Vercel has added support for Sakana AI's Fugu Ultra (`sakana/fugu-ultra`) model to its AI Gateway. Fugu Ultra is a multi-agent orchestration system that routes queries to a pool of specialized models to handle complex reasoning, coding, and analysis tasks. Developers can access this orchestrator using Vercel's unified API and monitor metrics like pricing, performance, and usage directly in their Vercel dashboard.
Pi v0.79.10 introduces key updates to the open-source terminal coding agent to improve extensibility and reliability. The `session_before_compact` and `session_compact` extension events now include `reason` and `willRetry` attributes, allowing developers to programmatically differentiate between manual, threshold-based, and overflow-triggered compactions. Furthermore, this version fixes bugs related to the `find` tool respecting ignored nested git repositories, resolves an issue with inherited OpenAI-compatible streaming where encrypted reasoning details arrived before tool calls, and stabilizes transient UI messages during session replacement.
Creator Peter Steinberger shared that despite the initial hype dying down, OpenClaw has improved quality, expanded its team, and registered its strongest week of adoption so far. Steinberger highlights the project's transition to a non-profit foundation, contrasting its mission with venture-backed competitors that prioritize commercial interests.
A head-to-head evaluation prompting GLM-5.2 and Claude Opus to build a 3D WebGL platformer from scratch showed Opus completing the task in half the time with fewer bugs. While GLM-5.2 is a cost-effective open-weights alternative, the test highlighted the advantage of Opus's multimodal capabilities in using screenshots to self-correct visual bugs.
HAQQ Legal AI has launched a mobile application to provide users with accessible legal understanding directly from their phones. The app enables users to upload contracts, ask legal questions, and receive structured legal work with risk flags and exportable outputs, utilizing the Justinian® legal reasoning engine instead of generic AI chat.
Skybridge is an open-source React framework that abstracts infrastructure complexities for building Model Context Protocol (MCP) apps. It handles server setup, view rendering, and client compatibility so developers can build interactive tools for AI assistants.
readywhen integrates with tools like Slack, email, meetings, and documents to automatically catch decisions and commitments as they happen. Instead of requiring you to manually track tasks, the AI proactively drafts the required next steps—such as project briefs, email replies, or status updates—which you simply review and approve, effectively serving as a 24/7 administrative assistant for busy executives and team leaders.
LeadDelta has updated its LinkedIn CRM by integrating dedicated AI agents to optimize professional networking and messaging workflows. The workspace pools collective team connections to map optimal warm introduction paths while its new AI-powered inbox auto-drafts replies and applies conversational tags.
Photoroom API is an image editing API tailored for marketplaces and enterprise brands. Processing billions of images a year, it enables fast, consistent generation of raw uploads to polished hero shots. With features like background removal, relighting, and AI-generated backgrounds, it easily integrates into commerce workflows via a REST API, ensuring high scalability and enterprise-grade data security where images are never used to train models.
AlgoFly AI is a comprehensive data platform designed for computer vision and machine learning teams that prioritize data privacy. It offers tools for managing datasets, annotation workflows, quality control, and AI operations, all while keeping sensitive data within the user's own infrastructure rather than relying on third-party cloud platforms.
Every time you query AI models like ChatGPT or Claude, there is typically a 3 to 8-second waiting period while the model generates its response. uwait is a browser extension that displays curated advertisements during this loading screen. The revenue generated from these impressions is split: 50% goes to the user, 30% goes to the publishers whose content trained the AI, and 20% goes to uwait. The extension focuses on showing relevant brands during a moment when users are actively paying attention to the screen.
Alai 2.0 is an AI-powered design platform that captures a brand's specific details into a custom design system, enabling users to create professional, on-brand presentations, social posts, and ads. It offers a unique slide-by-slide generation approach for precise manual or AI-assisted editing, and notably allows users to select their preferred underlying AI models to optimize for cost, quality, or latency.
Selector Forge is a browser extension designed to help developers create reliable CSS and XPath selectors using AI. It can generate selectors for individual elements or arrays of elements, focusing on creating "semantic" selectors that are more resistant to breaking when page structures change. The project is open-source and its code is available on GitHub.
Sakana AI has launched Sakana Fugu and its high-performance variant, Fugu Ultra, transitioning the multi-agent orchestration system from beta to full commercial availability. Operating via a single OpenAI-compatible API, Fugu dynamically coordinates tasks across a pool of diverse frontier models to handle complex reasoning while helping developers avoid single-vendor lock-in.
Solo founder @bfzli has successfully sold ClawHost on TrustMRR, marking the marketplace's 112th acquisition and the first-ever exit for an OpenClaw wrapper. ClawHost is a managed hosting service designed to simplify the deployment of OpenClaw, a viral, open-source AI agent framework that is technically complex to set up. Despite market saturation concerns following a wave of wrapper launches in early 2026, the acquisition highlights the viable monetization and exit pathways for micro-SaaS projects addressing setup friction for popular open-source tools.
During an SF rooftop conversation about running trading agents, an ex-Anthropic engineer challenged the popular paradigm of treating Claude as a mere chatbot interface. Instead, the engineer demonstrated that Claude functions as a command-line runtime environment designed to be paired directly with repositories. This repository-native approach allows the AI agent to interact with local code, git, and external tools directly, paving the way for multi-step autonomous workflows.
Deno Desktop, arriving in Deno v2.9.0, allows developers to package web framework projects like Next.js, Astro, and Fresh into single-binary desktop applications. It supports native system WebView for ultra-small file sizes or bundled Chromium (CEF) for rendering consistency, and introduces a Deno.BrowserWindow API that provides low-overhead, in-process backend-to-UI communication without standard IPC bottlenecks.
Cursor community developer Tibor Tee retweeted a meme from prominent developer and power user Anthony Kroeger (@kr0der) highlighting anticipation for the launch of Cursor Composer 3. While the team has not set a formal release date for the next major iteration of their multi-file editing agent, community excitement remains high following teasers of a custom foundation model trained with 10x more compute, building upon the capabilities of the current Composer 2.5 release.
Anysphere developer Tibor Tee announced that Composer 2.5, the agentic model powering the Cursor code editor, has been trained to be more pleasant to collaborate with. While emphasizing that pleasantness is crucial for real-world usefulness, he noted the lack of existing benchmarks for this quality and questioned what a pleasantness benchmark for AI agents would look like.

Github Awesome

AI Revolution

Rob The AI Guy

DIY Smart Code

Discover AI

DIY Smart Code

AICodeKing

Better Stack

DIY Smart Code