OPEN_SOURCE ↗
REDDIT · REDDIT// 20d agoNEWS
DeepCamera tests small VLMs, finds night-IR gaps
An r/LocalLLaMA user is running DeepCamera with Liquid AI’s LFM2.5-VL 1.6B (Q8) on a 4070/Ryzen 7 box to summarize four RTSP cameras. It works surprisingly well in daylight, but 720p night IR still fails to spot obvious late-night arrivals, putting model size, input resolution, and temporal context under the microscope.
// ANALYSIS
There's probably no magic "smallest model" here; night IR is a domain-shift problem, so the stack around the model matters more than another billion parameters. Liquid AI already recommends the 1.6B checkpoint for most vision use cases, which is a good clue that the bottleneck is low-light robustness and temporal context, not just scale.
- –LFM2.5-VL's native 512x512 processing and tiling help throughput, but they don't restore detail lost to IR noise and motion blur.
- –A 3B-class model may improve descriptions a bit, but video-native or sliding-window context is the real unlock for dwell time, arrivals, and multi-camera event stitching.
- –DeepCamera's HomeSec-Bench is the right eval lens here: its 143-test suite includes night IR, fog, break-in-vs-delivery, prompt injection, and alert routing.
- –The practical architecture is detector-first, VLM-second: shortlist suspicious clips with classical CV, then let the model narrate the window instead of every frame.
// TAGS
deepcameramultimodalinferenceedge-aibenchmarkopen-sourceself-hosted
DISCOVERED
20d ago
2026-03-22
PUBLISHED
20d ago
2026-03-22
RELEVANCE
7/ 10
AUTHOR
aiwhiz1154