QwenChat iOS brings Qwen offline to iPhone

// 127d agoPRODUCT LAUNCH

QwenChat iOS brings Qwen offline to iPhone

QwenChat iOS is a native SwiftUI app that runs 4-bit Qwen 3.5 0.8B and 2B models locally on iPhone using Apple’s MLX stack. The project shows that multimodal chat with image input, streamed responses, and zero server-side inference is now practical on consumer iPhones.

// ANALYSIS

Small multimodal models are crossing from lab demo to usable mobile software, and QwenChat iOS is a sharp proof point for AI developers building on-device apps.

–The app runs fully on-device after the initial Hugging Face model download, which makes privacy and offline usage a real product feature instead of a marketing claim.
–It supports both text and photo input, making it a concrete example of vision-language UX on iPhone rather than just a basic local chat shell.
–The stack matters: SwiftUI, MVVM, Apple MLX, and `mlx-swift-lm` give developers a credible template for shipping local inference on Apple hardware.
–Streaming output, stop controls, model switching, and tokens-per-second metrics make it useful as both a demo and a performance reference for edge AI apps.
–The hard limit is still platform friction: iOS 18+, Xcode 16+, and first-run model downloads mean this is a strong developer example today, not yet a mass-market mobile AI app.

// TAGS

qwenchat-iosllmmultimodalinferenceedge-ai

DISCOVERED

127d ago

2026-03-07

PUBLISHED

127d ago

2026-03-07

RELEVANCE

8/ 10

AUTHOR

Better Stack

// KEEP READING

More AI developer news from the feed

EXPLORE FULL FEED

UPDATE20m ago

Native SDK v0.5 compiles TypeScript to native

Vercel Labs has released Native SDK v0.5, introducing TypeScript support to compile applications directly to native machine code without a JavaScript engine or garbage collector. Designed with AI agents in mind, the update features 83ns update dispatch latency, supports robust TypeScript features, and allows developers to eject to Zig at any point.

UPDATE26m ago

SST Console demos AI-built settings screen

SST co-founder Dax Raad demonstrated a new settings screen for the SST Console built entirely via an interactive, Slack-integrated AI coding agent. The development involved collaborative team prompting and iterative feedback loops with the agent, resulting in a functional interface and automated walkthrough video.

UPDATE1h ago

Perplexity Computer integrates Grok 4.5

Perplexity has integrated xAI's Grok 4.5 as the orchestrator for Perplexity Computer, achieving a top score of 0.328 on its internal WANDR benchmark. The integration is highly cost-effective, running at approximately half the cost of Anthropic's Claude Opus 4.8.