男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Beijing Academy of AI unveils next-gen multimodal model Emu3

By DU JUAN | chinadaily.com.cn | Updated: 2024-10-24 15:37
Share
Share - WeChat

This week, the Beijing Academy of Artificial Intelligence unveiled a self-developed multimodal world model named Emu3, which achieves a unified understanding and generation of video, images and text.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks. In simple terms, it shows that predicting the next word or element in a sequence can be useful for models that handle both text and images, not just text alone.

Emu3 focuses on predicting the next part of a sequence, removing the necessity for complex methods like diffusion or composition. It converts images, text, and videos into a common format, teaching a single transformer model from the beginning on a mix of different types of sequences containing both text and images.

According to the academy, it has open-sourced Emu3's key technologies and models to the international tech community. Industry experts have expressed that for researchers, Emu3 signifies a new opportunity to explore multimodality through a unified architecture without the need to combine complex diffused models with large language models.

Wang Zhongyuan, director of the academy, said Emu3 has demonstrated high performance in multimodal tasks through next-token prediction, paving the way for the development of multimodal AGI.

"Emu3 has the potential to converge infrastructure development onto a single technical path, laying the foundation for large-scale multimodal training and inference," he said. "This simple architectural design will facilitate industrialization. In the future, multimodal world models will drive applications in scenarios such as robotic cognition, autonomous driving, multimodal conversations and reasoning."

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 河间市| 增城市| 邹城市| 南阳市| 朝阳区| 郸城县| 房产| 湖州市| 图木舒克市| 洪江市| 加查县| 竹溪县| 永川市| 永德县| 兰考县| 白银市| 兰考县| 凌云县| 扶风县| 高雄县| 尼玛县| 潮安县| 常山县| 德昌县| 临朐县| 环江| 集安市| 喜德县| 出国| 泗洪县| 新兴县| 德格县| 灵山县| 达州市| 彝良县| 苍溪县| 崇左市| 翁源县| 高州市| 梨树县| 河津市| 五华县| 清原| 会同县| 宿州市| 克拉玛依市| 郧西县| 洛隆县| 九台市| 冕宁县| 体育| 五莲县| 疏勒县| 湖口县| 威远县| 陆良县| 泗阳县| 新巴尔虎左旗| 辰溪县| 双流县| 广东省| 辽阳市| 宁安市| 哈密市| 乌鲁木齐市| 孟村| 英吉沙县| 潼南县| 涞源县| 海丰县| 哈巴河县| 从化市| 景泰县| 平江县| 科技| 仙居县| 镇平县| 四会市| 甘谷县| 江口县| 麻栗坡县| 略阳县|