Dual-engine AI music detector survives MP3

// 106d agoBENCHMARK RESULT

Dual-engine AI music detector survives MP3

A Reddit project pairs a ResNet18 mel-spectrogram classifier with Demucs-based stem separation and reconstruction to spot AI-generated music. It keeps working on MP3, AAC, and OGG, where the CNN alone breaks down, and the author reports about 1.1% human false positives with 80%+ AI detection.

// ANALYSIS

The clever part isn’t just stacking two models, it’s using a cheap confidence gate so the expensive separation pass only runs when the classifier is unsure. That makes the system feel more production-shaped than a single end-to-end detector, even if the edge cases are still messy.

–Mel-spectrogram CNNs can look strong on WAV and then lose the signal once lossy compression strips the artifacts they learned.
–Demucs adds a different hypothesis: human recordings leak across stems, while fully synthetic tracks tend to reconstruct too cleanly after separation and remixing.
–The compute tradeoff is sensible, because source separation is costly and shouldn’t run on every track if the CNN already has high confidence.
–The biggest risk is generalization: different generators, mastering chains, and Demucs nondeterminism can all move borderline samples around.

// TAGS

ai-music-detectoraudio-genresearchbenchmark

DISCOVERED

106d ago

2026-03-28

PUBLISHED

107d ago

2026-03-27

RELEVANCE

8/ 10

AUTHOR

Leather_Lobster_2558

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE48m ago

Claude Code ignores admin SCIM plugin policies

An enterprise user highlighted a critical gap where marketplace plugin selection policies configured in the Claude Admin panel and mapped to SCIM groups do not sync or apply to Claude Code. This limitation breaks the centralized context administration model for organizations attempting broad, secure deployments of Claude across developer environments, as the CLI continues to rely on localized configuration controls instead of real-time organization policies.

VIDEO56m ago

Hookdeck tames webhook chaos, powers event-driven architectures

Better Stack Podcast episode 17 explores event-driven architectures, webhook chaos, and how AI agents change event handling. Hookdeck is highlighted as an Event Gateway designed to reliably queue, secure, and manage asynchronous webhooks and events.

NEWS58m ago

browser-use highlights Grok model compatibility

The developers behind browser-use, an open-source Python library designed to connect AI agents with web browsers, announced that xAI's Grok model exhibits strong performance when paired with their framework. By using Grok as the underlying language model, developers can build robust, autonomous browser agents capable of navigating pages, interacting with elements, and completing web tasks.