男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
Business
Home / Business / Technology

Chinese smaller generative AI tool exhibits robust abilities at much lower cost

Xinhua | Updated: 2025-03-10 16:29
Share
Share - WeChat

BEIJING -- A Chinese open-source AI model is shown to rival top-tier global competitors such as DeepSeek R1, despite its smaller size, representing another step forward in balancing performance and efficiency in AI application.

The QwQ-32B, unveiled last Thursday by Alibaba's Qwen team, operates on just 24 GB of video memory with only 32 billion parameters, while DeepSeek's R1 demands 1,600 GB to run its 671 billion parameters, thus realizing a 98-percent reduction.

Also, compared to OpenAI's o1-mini and Anthropic's Sonnet 3.7, Qwen's AI model has substantially lower computational requirements.

Kyle Corbitt, a former Google engineer, published his testing results on social media platform X, showing that "the smaller, open-weight model can match state-of-the-art reasoning performance."

According to Corbitt's team, QwQ-32B achieved the second-highest score in a deductive reasoning benchmark via a method called reinforcement learning (RL), outperforming R1, o1 and o3-mini, while nearly matching Sonnet 3.7's performance at an inference cost more than 100-fold lower than that required by Sonnet 3.7.

"AI isn't just getting smarter, it's learning how to evolve," commented Shashank Yadav, CEO of Fraction AI. "QwQ-32B proves that reinforcement learning can out-compete brute-force scaling."

"We found RL training enhances performance, particularly in math and coding tasks. Its expansion can enable medium-sized models to match large MoE models' performance," read Qwen's blog article on Github.

Qwen's new model is expected to enhance the feasibility of local operations for generative AI products on computers and even mobile devices in the future.

Awni Hannun, a computer scientist at Apple, has run QwQ-32B on the Apple computer powered by its M4 Max chip, and it appears to be "running nicely."

China's national supercomputing internet platform last Saturday announced the launch of the API interface service for QwQ-32B. In addition, Biren Technology, a Shanghai-based GPU chip designer, announced Sunday that it has launched an all-in-one machine capable of running this model.

QwQ-32B is freely accessible as an open-source model that anyone can run, following DeepSeek's path of facilitating wider application of AI technologies worldwide and contributing China's wisdom to the world.

Alibaba also recently open-sourced its AI video-generating model Wan2.1, which is available for download on Alibaba Cloud's AI model community, Model Scope and the collaborative AI platform Hugging Face.

The e-commerce and cloud-computing giant has announced a plan to invest more than 380 billion yuan (about $52.97 billion) in building cloud and AI hardware infrastructure over the next three years.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
CLOSE
 
主站蜘蛛池模板: 嘉义县| 金湖县| 曲松县| 乌兰县| 湄潭县| 景洪市| 方山县| 南华县| 漳平市| 柯坪县| 安义县| 菏泽市| 集贤县| 南召县| 阳曲县| 常州市| 巴塘县| 抚顺市| 长岛县| 四子王旗| 安岳县| 酒泉市| 栾城县| 长沙县| 菏泽市| 景谷| 神农架林区| 武平县| 徐闻县| 稷山县| 镇平县| 渭源县| 连南| 凌云县| 格尔木市| 新丰县| 寿阳县| 墨脱县| 洞口县| 新野县| 房产| 昌邑市| 合阳县| 黎川县| 马尔康县| 乌拉特后旗| 东平县| 保亭| 廉江市| 新津县| 基隆市| 张家川| 巩留县| 马尔康县| 苍梧县| 六盘水市| 台中县| 安福县| 徐汇区| 龙川县| 府谷县| 萨迦县| 当涂县| 安义县| 瑞昌市| 茌平县| 紫金县| 贺州市| 铜山县| 西平县| 同仁县| 临夏市| 蒲江县| 苗栗县| 武穴市| 壶关县| 开封市| 望江县| 永定县| 凤凰县| 古丈县| 攀枝花市|