China Telecom has released its first large-scale voice model supporting 30 dialects...

China Telecom voice model

DIYuan | 2024-05-27 17:13

【数据猿导读】 China Telecom has released its first large-scale voice model supporting 30 dialects

On May 26, the China Telecom Artificial Intelligence Research Institute (TeleAI) released the industry's first large speech recognition model that supports the free mixing of 30 dialects, Star super multi-dialect speech recognition large model, breaking the dilemma that a single model can only identify a specific single dialect, and can simultaneously identify and understand more than 30 dialects such as Cantonese, Shanghai, Sichuan and Wenzhou. It is understood that China Telecom Artificial Intelligence Research Institute has built more than 30 kinds of high-quality dialect databases, more than 300,000 hours, the first "distillation + expansion" joint training algorithm, to solve the problem of pre-training collapse under super-large-scale multi-scene data sets and large-scale parameter conditions, to achieve 1B parameter 80-layer model stable training.

来源：DIYuan

收藏分享

声明：数据猿尊重媒体行业规范，相关内容都会注明来源与作者；转载我们原创内容时，也请务必注明“来源：数据猿”与作者名称，否则将会受到数据猿追责。