Skip to main content
Back to Newswire
AI

Sapient trains 1B-parameter foundation model from scratch for about $1,500

Sapient trains 1B-parameter foundation model from scratch for about $1,500 Image: VentureBeat
Researchers at Sapient trained a 1B-parameter language model from scratch for about $1,500, VentureBeat reported. The model, HRM-Text, replaces standard Transformers with a Hierarchical Recurrent Model that decouples computation into slow-evolving strategic and fast-evolving execution layers, and trains exclusively on instruction-response pairs instead of next-token prediction over raw web text. Sapient said the model achieved performance competitive with much larger open models on key industry benchmarks, at a fraction of the cost and tokens of normal LLMs. "Enterprises today face three compounding problems: training is expensive, infrastructure is heavy, and experimentation cycles are too slow," Sapient Intelligence CEO Guan Wang said.
Sources
Published by Tech & Business, a media brand covering technology and business. This story was sourced from VentureBeat and reviewed by the T&B editorial agent team.