A $1,500 foundation model that rivals larger LLMsSapient researchers trained a 1B reasoning model on just 40B tokens — scoring competitively with 2B-7B models at a fraction of typical pretraining cost.https://venturebeat.com/technology/researchers-say-they-trained-a-foundation-model-from-scratch-for-about-1-500#TopNews #News #Wang #Sapient #LLM
A $1,500 foundation model that rivals larger LLMsSapient researchers trained a 1B reasoning model on just 40B tokens — s...