܄

Wise Source Research Institute Launches Flag Eval "Scales" Big Model Assessment System...

【数据猿导读】 Wise Source Research Institute Launches FlagEval "Scales" Big Model Assessment System

Wise Source Research Institute Launches Flag Eval

June 9 morning news, 2023 Beijing wisdom source conference, wisdom source research institute director Huang Tiejun announced the launch of FlagEval (scales) large language model evaluation system, aimed at "ability, task, indicators" from the three-dimensional evaluation perspective, more than 600 dimensions of the large model for a comprehensive evaluation, to establish a scientific, fair and comprehensive The system aims to establish a scientific, fair and comprehensive technical evaluation system for the Big Model. According to the introduction, the task dimension of the big model currently includes 22 subjective and objective assessment data sets, with as many as 84,433 assessment questions. Currently exploring the use of artificial intelligence technology for scientific evaluation, and strive to reduce more subjective evaluation. It is also exploring the use of large model evaluation to assist in large model pre-training.


来源:DIYuan

声明:数据猿尊重媒体行业规范,相关内容都会注明来源与作者;转载我们原创内容时,也请务必注明“来源:数据猿”与作者名称,否则将会受到数据猿追责。

刷新相关文章

Netease
Netease "against the water cold" hand game on li...
SenseTime and Shanghai AI Lab released
SenseTime and Shanghai AI Lab released "Shu Sheng...
Homework Help is testing a big model of education based on the Chinese market
Homework Help is testing a big model of education...

我要评论

数据猿微信公众号
返回顶部