GraphZero v0.2 brings zero-copy GNN training via mmap

// 120d agoOPENSOURCE RELEASE

GraphZero v0.2 brings zero-copy GNN training via mmap

GraphZero is an open-source C++ graph engine that lets you train Graph Neural Networks on 50GB+ datasets on a 16GB laptop by memory-mapping data directly from NVMe instead of loading it into RAM. Version 0.2 adds Node2Vec biased walks and a FeatureStore, with PyPI install and nanobind-powered zero-copy NumPy/PyTorch interop.

// ANALYSIS

The "OOM wall" in GNN training is a real and underserved problem — PyG and DGL assume you have the RAM, and if you don't, you crash. GraphZero's mmap approach is the right systems-level answer.

–Zero-copy via mmap means OS page faults do the heavy lifting — only touched 4KB blocks land in RAM, dropping peak usage from 24GB+ to ~5GB on ogbn-papers100M
–Custom CSR binary format (`.gl`) achieves ~60% compression over raw CSV topology; columnar `.gd` maps directly to PyTorch tensors with zero allocation overhead
–OpenMP neighbor sampling releases the Python GIL, enabling true parallelism across disk I/O, CPU sampling, and GPU compute
–Currently scoped to data loading and sampling, not training itself — a deliberate narrow focus that keeps the library composable with any training loop
–Very early (7 GitHub stars, v0.2), but the roadmap toward dynamic graph updates and ACID multi-process safety signals serious long-term ambition

// TAGS

graphzeroopen-sourcellmdevtooldata-toolsinference

DISCOVERED

120d ago

2026-03-15

PUBLISHED

120d ago

2026-03-15

RELEVANCE

5/ 10

AUTHOR

Important-Trash-4868

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE1h ago

Grok Build adds multiline input, scrolling

SpaceXAI has released Grok Build versions 0.2.99 and 0.2.98, introducing multiline input and terminal scrolling for its terminal-based AI coding assistant. The updates allow users to input complex prompts directly on the dashboard and scroll through chat histories using PageUp and PageDown.

INFRA1h ago

GLM-5 runs natively on Ascend via FlagOS

Zhipu AI's GLM-5 has been packaged for native execution on Huawei Ascend NPUs using the FlagOS framework, representing the first CUDA-free deployment of a Chinese general-purpose LLM on domestic hardware. This integration satisfies local sovereignty requirements across hardware, model, and inference runtime in a single package.

INFRA2h ago

Alchemy enables declarative agentic infrastructure

Sam Goodwin shared a declarative workflow for constructing agentic infrastructure using Alchemy, combining English prompts and TypeScript code in a single TypeScript file. By utilizing string template literals and a simple alchemy deploy command, developers can deploy applications directly to the cloud without manual environment setup.