HJB Tutorial Bridges RL, Diffusion Models

// 105d agoTUTORIAL

HJB Tutorial Bridges RL, Diffusion Models

Daniel Lopez Montero's post explains why the Hamilton-Jacobi-Bellman equation is Bellman's continuous-time optimal-control equation, then walks through policy iteration, model-free continuous-time Q-learning, and two benchmark problems: stochastic LQR and the Merton portfolio. It closes by showing how reverse-time diffusion sampling can be reframed as a control problem with the score function acting as the optimal drift correction.

// ANALYSIS

This is the rare theory-heavy AI tutorial that earns its length: it gives one clean control-theoretic frame for continuous-time RL and diffusion models, which makes both topics feel like different views of the same math.

–The LQR and Merton examples are the right validation cases because they have closed-form optima and let the neural policy-iteration setup prove itself.
–The diffusion section is the most interesting part: reverse-time sampling becomes a finite-horizon control problem, and the score function emerges as the optimal drift correction.
–The post assumes comfort with SDEs, PDEs, and convex duality, so it is more of an advanced bridge piece than a beginner-friendly walkthrough.
–HN traction suggests there is still a hungry audience for rigorous AI math when it pays off with a unifying story.

// TAGS

continuous-rlresearchagent

DISCOVERED

105d ago

2026-03-30

PUBLISHED

106d ago

2026-03-30

RELEVANCE

7/ 10

AUTHOR

sebzuddas

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL21m ago

GPT-5.6 retains reasoning context across turns

A key architectural detail has been revealed for OpenAI's new GPT-5.6 model family: unlike predecessor models that discarded Chain of Thought (CoT) context at each turn to save context window space, GPT-5.6 maintains its reasoning context across the entire conversation history. This change ensures that the model preserves its logical chain and intermediate reasoning steps throughout multi-turn interactions.

OPEN SOURCE3h ago

scroll-world launches scroll-driven 3D flight skill

scroll-world is an open-source, framework-agnostic agent skill that leverages Higgsfield to generate immersive, scroll-driven 3D camera flights through diorama scenes for landing pages. By rendering seamless connection clips between neighboring frames, it allows developers to build interactive 3D narrative websites navigated simply by scrolling, without requiring heavy game engines.

MODEL4h ago

OpenAI GPT-5.6 hits Amazon Bedrock

OpenAI's GPT-5.6 model family—including Sol, Terra, and Luna—is now generally available on Amazon Bedrock. Running on Bedrock's next-generation inference engine, the models support prompt caching with a 90% discount and match OpenAI's first-party pricing.