男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Humans outperform AI in Alibaba math competition

By Chen Meiling and Chen Ye | chinadaily.com.cn | Updated: 2024-06-18 20:29
Share
Share - WeChat
[Photo/VCG]

Although artificial intelligence has demonstrated capabilities surpassing humans in many fields, it still faces significant limitations in the realm of mathematics.

During the preliminary round of the 2024 Alibaba Global Mathematics Competition, 563 teams used AI to answer questions. Much to the surprise of AI advocates, none of the teams scored high enough to advance to the finals.

During the 48-hour preliminary round, AI and human participants were given the same exam questions, including multiple-choice, problem-solving and proof questions. AI teams were asked to submit their models in advance to avoid cheating.

According to the competition's organizing committee, the average score of the participating AI teams was 18, which was on par with the average level of human competitors. However, the highest score achieved by AI was only 34, which was far behind the highest human score of 113.

Chen Tianchu, who researches large models at the Computer Architecture Laboratory of Zhejiang University, said that the current working method of LLMs (large language models) is still to predict the next word at a fixed rate based on context and output the results all at once. For tasks that require repeated, multiple trials and thoughtful thinking, like math competitions, LLMs still have limitations in completing complex reasoning and rigorous thinking, The Economic Observer reported. He added that AI cannot yet replace professionally trained humans in math.

About half of the AI team members were born after 2000 and represented institutions such as Peking University, Tsinghua University, the University of Oxford, Amazon Web Services and ByteDance.

Some adjusted open-source large models, enabling AI to advance from elementary mathematics to advanced mathematics; some built AI agents, combining prompt engineering to access closed-source models like GPT-4, upgrading GPT-4's mathematical problem-solving abilities.

Tu Jinhao from Jianping High School in Shanghai achieved the highest score by using AI. Drawing inspiration from the concept of self-debate, Tu applied multiple large models to several rounds of "self-questioning, self-answering, self-verification" to seek the optimal solution to problems.

The top three AI teams earned prizes of $10,000, $5,000 and $2,000.

According to the organizing committee, the annual event will continue to open to AI to encourage exploration of its potential limits and drive research and innovation in its application in mathematics.

Yin Wotao, a member of the committee, said in an interview with Shanghai Securities News that it is a positive attempt to break through the limits of AI capabilities and bring more possibilities.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 渑池县| 游戏| 民乐县| 石渠县| 静宁县| 保德县| 山丹县| 西盟| 肥西县| 昆明市| 九台市| 泽库县| 仪征市| 资阳市| 太保市| 和顺县| 阿尔山市| 中牟县| 龙游县| 右玉县| 大连市| 堆龙德庆县| 余江县| 桐乡市| 保亭| 三亚市| 红安县| 商城县| 南靖县| 保亭| 芮城县| 瑞金市| 濉溪县| 金阳县| 阳信县| 阿坝县| 鹤峰县| 阳朔县| 峨山| 凌云县| 庆元县| 永修县| 岐山县| 定南县| 卓尼县| 浦江县| 当雄县| 永兴县| 高唐县| 昆明市| 凌海市| 德化县| 乐至县| 雷山县| 崇明县| 娱乐| 敦煌市| 大庆市| 丹巴县| 大方县| 社会| 柳林县| 澄江县| 宁夏| 正阳县| 阿拉善左旗| 梨树县| 张家口市| 岱山县| 吉木萨尔县| 故城县| 揭西县| 达州市| 北海市| 甘德县| 多伦县| 甘南县| 广西| 镇坪县| 衡阳市| 中江县| 伊宁县|