startup
Researchers say they trained a foundation model from scratch for about $1,500
Source:
venturebeat.com 1 min read
Share
You are reading a summary. The full content is hosted on venturebeat.com.
Sapient’s HRM-Text replaces Transformers with a hierarchical recurrent model trained only on instruction-response pairs, cutting pretraining cost and data needs. A 1B-parameter model trained on 40B tokens reportedly matched or beat larger open models on key reasoning benchmarks, costing about $1,500 and 1.9 days on 16 GPUs.
Read the full article on the original website
External link to venturebeat.com
Related Articles
startup
Scientists Warn a Popular Joint Supplement May Accelerate Your Risk of Cognitive Decline—Here’s What to Know
1 min read •
startup
South Korea’s Floundering Movie Business Turns to AI for Help
1 min read •
startup
Sources: Frank founder Charlie Javice, sentenced in September 2025 to 85 months for defrauding JPMorgan Chase, has been seeking a presidential pardon from Trump (Wall Street Journal)
1 min read •