男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03
Share
Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 崇仁县| 神池县| 衡南县| 双鸭山市| 边坝县| 文成县| 阳城县| 南溪县| 根河市| 抚州市| 都江堰市| 阿克苏市| 偃师市| 上林县| 福建省| 秦皇岛市| 汤原县| 吉木萨尔县| 定远县| 天津市| 石狮市| 游戏| 南阳市| 屏东县| 镇安县| 长乐市| 诸城市| 黄大仙区| 扶沟县| 双牌县| 凌云县| 龙井市| 博兴县| 武宁县| 兰西县| 涪陵区| 垦利县| 韩城市| 万荣县| 双桥区| 伊宁市| 沭阳县| 诏安县| 遵义市| 那坡县| 义乌市| 新民市| 灵璧县| 庆安县| 太仆寺旗| 濮阳县| 花垣县| 交口县| 高台县| 陆丰市| 枞阳县| 东源县| 无极县| 茶陵县| 福海县| 三门县| 远安县| 岢岚县| 高邮市| 和龙市| 普兰县| 澳门| 含山县| 江山市| 社旗县| 株洲市| 莱芜市| 巍山| 威宁| 江陵县| 鄂托克前旗| 正蓝旗| 绥江县| 苗栗市| 海口市| 延安市| 保德县|