neuropt tunes hyperparameters from full training curves

// 81d agoPRODUCT LAUNCH

neuropt tunes hyperparameters from full training curves

neuropt is an open-source hyperparameter optimization package that sends per-epoch training and validation curves to an LLM after each trial, then uses that reasoning to propose the next configuration. It supports PyTorch, XGBoost, and scikit-learn, auto-detects tunable PyTorch parameters and layers, and claims small-budget benchmark wins over Optuna TPE and random search on FashionMNIST and Covertype.

// ANALYSIS

This is a smart and timely idea, especially for expensive training runs where the learning curve tells you far more than the last metric ever will.

–The curve-aware loop is the real differentiator here; it should be most useful when early stopping signals, instability, or wasted epochs matter.
–Auto-detecting tunables in PyTorch lowers adoption friction a lot, which is often what decides whether a tool gets tried at all.
–The benchmark claim is interesting, but I’d want to see how much of the lift comes from the LLM vs. from simply having richer signals and a better trial-selection workflow.
–Main risk: prompt variance and reproducibility. If the suggestions are sensitive to wording or model choice, it may be harder to trust in serious tuning workflows.

// TAGS

hyperparameter optimizationllmpytorchxgboostscikit-learnmachine learningopen source

DISCOVERED

81d ago

2026-03-21

PUBLISHED

81d ago

2026-03-20

RELEVANCE

8/ 10

AUTHOR

dloevlie

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

MODEL19m ago

Anthropic releases public Claude Mythos model

Anthropic has publicly released a modified version of its frontier AI model, Claude Mythos, under the name Claude Fable 5. The new public version incorporates safety guardrails to restrict offensive cyber capabilities while the unrestricted model remains limited to vetted partners.

MODEL22m ago

Anthropic launches Claude Fable 5

Anthropic has launched Claude Fable 5, a new "Mythos-class" model designed for complex agentic workflows, software engineering, and research synthesis. The model is available via the Claude API, subscription plans, and cloud platforms, with safety guardrails that fallback to Claude Opus for risky queries.

UPDATE30m ago

Vercel v0 adds /improve via Claude Fable 5

Vercel has integrated a new /improve command into its generative UI design tool, v0, to let users leverage Anthropic's new Claude Fable 5 reasoning model. The feature allows developers to invoke the model's advanced reasoning capabilities to iterate, polish, and optimize generated UI code.

neuropt tunes hyperparameters from full training curves