Wise Source Research Institute Launches Flag Eval "Scales" Big Model Assessment System...

Scales Big Model Assessment System

DIYuan | 2023-06-09 18:34

【数据猿导读】 Wise Source Research Institute Launches FlagEval "Scales" Big Model Assessment System

June 9 morning news, 2023 Beijing wisdom source conference, wisdom source research institute director Huang Tiejun announced the launch of FlagEval (scales) large language model evaluation system, aimed at "ability, task, indicators" from the three-dimensional evaluation perspective, more than 600 dimensions of the large model for a comprehensive evaluation, to establish a scientific, fair and comprehensive The system aims to establish a scientific, fair and comprehensive technical evaluation system for the Big Model. According to the introduction, the task dimension of the big model currently includes 22 subjective and objective assessment data sets, with as many as 84,433 assessment questions. Currently exploring the use of artificial intelligence technology for scientific evaluation, and strive to reduce more subjective evaluation. It is also exploring the use of large model evaluation to assist in large model pre-training.

来源：DIYuan

收藏分享

声明：数据猿尊重媒体行业规范，相关内容都会注明来源与作者；转载我们原创内容时，也请务必注明“来源：数据猿”与作者名称，否则将会受到数据猿追责。