Shared from twixb · venturebeat.com

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

venturebeat.com·Jun 9, 2026

Apple has introduced its third-generation foundation models, AFM 3, which allow for a 20-billion-parameter on-device AI model by storing weights in NAND flash instead of DRAM, thus overcoming previous memory limitations. This new architecture enables enterprises to run complex AI tasks locally, while also providing server-based options for more demanding workloads, although details on deployment and performance metrics remain pending.

The most valuable insight for you is the innovative approach Apple has taken with their AFM 3 Core Advanced model by storing its entire 20-billion-parameter set in NAND flash rather than DRAM. This architectural shift allows for more complex on-device AI models without relying on cloud infrastructure, a significant development for enterprises requiring high-capacity AI models that operate independently of a continuous cloud connection. This could influence your strategies for deploying AI workloads, enabling more robust on-device capabilities.

Want more content like this?

twixb tracks your favorite blogs and social media, filters by keywords, and delivers personalized key learnings — straight to your inbox.

Create Your Own →Explore Newsfeeds

More from AI & Machine Learning News

Recent stories curated alongside this one.

Browse all AI & Machine Learning News →

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

Want more content like this?

More from AI & Machine Learning News

Anthropic blocks all public access to Claude Fable 5, Mythos 5 following US government order — what enterprises should do

Here’s How AI Agents Can Protect EV Chargers

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Google researchers introduce 'faithful uncertainty,' allowing LLMs to offer best guesses instead of hallucinations

NanoClaw and JFrog launch 'immune system' to block AI agents from downloading malicious code