Researchers introduce HORMA, a Hierarchical Organize-and-Retrieve Memory Agent that organizes working memory into a file-system-like hierarchy to reduce token usage and latency in long-horizon tasks.

// 45d agoRESEARCH PAPER

Researchers introduce HORMA, a Hierarchical Organize-and-Retrieve Memory Agent that organizes working memory into a file-system-like hierarchy to reduce token usage and latency in long-horizon tasks.

HORMA (Hierarchical Organize-and-Retrieve Memory Agent) addresses the challenges LLM agents face in long-horizon tasks, such as context overload and loss of temporal structure. It structures the agent's working memory into a file-system-like workspace where raw interaction trajectories are organized into semantically structured, linked notes using file-system operations. A lightweight retrieval policy trained via reinforcement learning then navigates this hierarchy to extract minimal sufficient context for the current task. Across benchmarks like ALFWorld, LoCoMo, and LongMemEval, HORMA demonstrates superior efficiency-performance trade-offs, reducing token consumption in long conversations to as low as 22% of baseline usage.

// ANALYSIS

Hierarchical workspaces are the future of complex agentic reasoning, moving beyond simple vector similarity search to structured, stateful memory management.

* File-system-like abstraction maps well to how humans organize files and projects, enabling agents to use standard CRUD-like memory operations.

* Using RL to train a navigation policy helps avoid retrieving massive chunks of unnecessary context, directly targeting the latency and cost bottlenecks of long context windows.

* The system constructs memories dynamically using a skill acquisition process, indicating a push towards self-improving agents.

* The 78% reduction in token usage on long conversations makes complex agent deployment significantly more economically viable.

// TAGS

`["llm-agents""memory-systems""reinforcement-learning""token-efficiency""hierarchical-memory"]`-→-`["llm-agents""hierarchical-memory"]`

DISCOVERED

45d ago

2026-06-12

PUBLISHED

45d ago

2026-06-12

RELEVANCE

8/ 10

AUTHOR

Discover AI

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

OPEN SOURCE57m ago

GeoLibre launches cloud-native open-source GIS platform

GeoLibre is a lightweight and cloud-native GIS platform developed by opengeos for visualizing, exploring, and analyzing geospatial data. Built primarily in TypeScript, it offers versatile deployment capabilities across web browsers, desktop applications, mobile devices, and interactive Jupyter notebook environments, making spatial data analysis accessible anywhere.

UPDATE1h ago

Hermes Agent introduces curator tool to audit skills

Hermes Agent has introduced a curation workflow aimed at optimizing agent memory and capability management. Instead of relying on unbounded memory expansion, the new hermes curator utility identifies stale or redundant skills through a structured audit-and-prune lifecycle (Work → Learn → Audit → Prune → Consolidate → Verify), while hermes journey offers insight into the background factors shaping the agent's behavior.

NEWS1h ago

Weddx Hits $13K MRR with Premiere AI Plugin

Weddx is an AI-powered plugin for Adobe Premiere Pro designed to automate wedding movie creation for videographers. Created by Davud Cokic, the product earned $13,366 in revenue over the last 30 days, recording a 1,981% Month-over-Month growth rate and a 75% profit margin primarily fueled by Meta Ads marketing achieving over 3x ROAS.