startup
On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
Source:
venturebeat.com 1 min read
Share
You are reading a summary. The full content is hosted on venturebeat.com.
Apple’s AFM 3 models aim to bypass on-device DRAM limits by storing a 20B-parameter model’s weights in NAND flash and loading selected experts into DRAM once per prompt, activating about 1B to 4B parameters per task. Apple has not clarified offload rules or key performance metrics, with benchmarks expected in a summer report.
Read the full article on the original website
External link to venturebeat.com
Related Articles
startup
Scientists Warn a Popular Joint Supplement May Accelerate Your Risk of Cognitive Decline—Here’s What to Know
1 min read •
startup
South Korea’s Floundering Movie Business Turns to AI for Help
1 min read •
startup
Sources: Frank founder Charlie Javice, sentenced in September 2025 to 85 months for defrauding JPMorgan Chase, has been seeking a presidential pardon from Trump (Wall Street Journal)
1 min read •