China Telecom released a single dense trillion parameter semantic model...
【数据猿导读】 China Telecom released a single dense trillion parameter semantic model
On June 19, it was learned from China Telecom that China Telecom Artificial Intelligence Research Institute (TeleAI) and Beijing Zhiyuan Artificial Intelligence Research Institute released the world's first single dense trillion parameter semantic model Tele-FLM-1T, becoming the first institution to release dense trillion parameter large model in China. The reporter learned that in response to the problem of high computing power consumption in large model training, Tele-FLM series models jointly developed by TeleAI and Zhiyuan based on key technologies such as model growth and loss prediction only use 9% of the computing power resources of the industry's ordinary training programs, based on 112 A800 servers. Completed the training of 3 models with a total of 2.3T tokens in 4 months.
来源:DIYuan