cuLA ships linear-attention CUDA kernels for Hopper, Blackwell

// 96d agoOPENSOURCE RELEASE

cuLA ships linear-attention CUDA kernels for Hopper, Blackwell

cuLA is a low-level open-source repository of high-performance CUDA kernels for linear attention variants, implemented in CuTe DSL and CUTLASS C++. The project targets NVIDIA Hopper and Blackwell GPUs, and is designed to slot into flash-linear-attention with a minimal import change. The repository is explicitly early-stage, but it already positions itself as a specialized kernel layer for KDA, GLA-style, and related linear-attention workloads.

// ANALYSIS

Hot take: this is infrastructure-first work, not a user-facing app, but it matters if you care about squeezing real performance out of long-context attention on recent NVIDIA hardware.

–The technical angle is strong: CuTe DSL plus CUTLASS suggests the repo is built for hardware-aware kernel tuning rather than portability theater.
–The positioning is clear: cuLA is meant to complement flash-linear-attention, which lowers adoption friction for teams already in that ecosystem.
–The scope is narrow but credible: support for Hopper and Blackwell, with explicit mention of linear-attention variants like KDA and gating/delta-style methods.
–The main risk is maturity; the repo itself says it is early stage and that APIs and kernels may still change.

// TAGS

cudalinear-attentioncutlasscute-dslnvidiahopperblackwellopen-sourcegpu-kernelsattention

DISCOVERED

96d ago

2026-04-06

PUBLISHED

96d ago

2026-04-06

RELEVANCE

9/ 10

AUTHOR

Github Awesome

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

NEWS2h ago

Zebra stripes trick drone vision AI

Forces in the Ukraine war are painting military vehicles with high-contrast zebra patterns to trick autonomous drone machine-vision algorithms. However, experts note this tactic only offers a temporary advantage as training datasets are quickly updated to recognize the new camouflage.

OPEN SOURCE2h ago

Nuxt surpasses 60,000 GitHub stars

Nuxt, the open-source Vue.js framework, has surpassed 60,000 stars on GitHub, solidifying its position as a leading tool for full-stack web development.

OPEN SOURCE2h ago

Microsoft's ASP.NET Core provides a robust, cross-platform framework for building modern, cloud-based C# web applications.

ASP.NET Core is an open-source, high-performance, and cross-platform framework developed by Microsoft and the community for building modern, cloud-enabled, and Internet-connected applications. It allows developers to build web apps, services, IoT apps, and mobile backends using C# on Windows, macOS, and Linux. As a key component of the .NET ecosystem, it features a unified story for building web UI and web APIs, integration of modern client-side frameworks, and a cloud-ready, environment-based configuration system.

cuLA ships linear-attention CUDA kernels for Hopper, Blackwell