EN / ES / HU
startup

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

Source: venturebeat.com 1 min read

Share

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

You are reading a summary. The full content is hosted on venturebeat.com.

Apple’s AFM 3 models aim to bypass on-device DRAM limits by storing a 20B-parameter model’s weights in NAND flash and loading selected experts into DRAM once per prompt, activating about 1B to 4B parameters per task. Apple has not clarified offload rules or key performance metrics, with benchmarks expected in a summer report.

Read the full article on the original website

External link to venturebeat.com

Related Articles