Wise Source Research Institute Launches Flag Eval "Scales" Big Model Assessment System...
DIYuan | 2023-06-09 18:34
【数据猿导读】 Wise Source Research Institute Launches FlagEval "Scales" Big Model Assessment System
![Wise Source Research Institute Launches Flag Eval](/u/cms/www/202306/09183332aozd.png)
June 9 morning news, 2023 Beijing wisdom source conference, wisdom source research institute director Huang Tiejun announced the launch of FlagEval (scales) large language model evaluation system, aimed at "ability, task, indicators" from the three-dimensional evaluation perspective, more than 600 dimensions of the large model for a comprehensive evaluation, to establish a scientific, fair and comprehensive The system aims to establish a scientific, fair and comprehensive technical evaluation system for the Big Model. According to the introduction, the task dimension of the big model currently includes 22 subjective and objective assessment data sets, with as many as 84,433 assessment questions. Currently exploring the use of artificial intelligence technology for scientific evaluation, and strive to reduce more subjective evaluation. It is also exploring the use of large model evaluation to assist in large model pre-training.
来源:DIYuan
刷新相关文章
我要评论
不容错过的资讯
大家都在搜
![数据猿微信公众号](/r/cms/www/default/images/weixingongzhong.jpg)