男女羞羞视频在线观看,国产精品黄色免费,麻豆91在线视频,美女被羞羞免费软件下载,国产的一级片,亚洲熟色妇,天天操夜夜摸,一区二区三区在线电影
Global EditionASIA 中文雙語Fran?ais
China
Home / China / Innovation

Chinese developer launches multimodal model unifying video, image, text

Xinhua | Updated: 2024-10-22 11:03
Share
Share - WeChat

BEIJING -- The Beijing Academy of Artificial Intelligence (BAAI) on Monday released Emu3, a multimodal world model that unifies the understanding and generation of text, image, and video modalities with next-token prediction.

Emu3 successfully validates that next-token prediction can serve as a powerful paradigm for multimodal models, scaling beyond language models and delivering state-of-the-art performance across multimodal tasks, said Wang Zhongyuan, director of BAAI, in a press release.

"By tokenizing images, text, and videos into a discrete space, we train a single transformer from scratch on a mixture of multimodal sequences," Wang said, adding that Emu3 eliminates the need for diffusion or compositional approaches entirely.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, according to BAAI, which has open-sourced the key technologies and models of Emu3 to the international technology community.

Technology practitioners have said that a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models (LLMs).

"In the future, the multimodal world model will promote scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference," Wang said.

Top
BACK TO THE TOP
English
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
 
主站蜘蛛池模板: 壤塘县| 安化县| 天祝| 永兴县| 汉沽区| 海口市| 达日县| 资阳市| 三江| 乐山市| 修文县| 新巴尔虎左旗| 巨鹿县| 绥阳县| 吉安县| 商洛市| 龙南县| 南宫市| 桂林市| 喀喇沁旗| 中西区| 德格县| 平定县| 定西市| 丹阳市| 双流县| 陆川县| 濮阳市| 东山县| 加查县| 漾濞| 德州市| 五指山市| 茂名市| 昌平区| 扎兰屯市| 苍南县| 石景山区| 呼图壁县| 手游| 黄平县| 克山县| 灵武市| 永康市| 南开区| 荆州市| 区。| 灵武市| 肇州县| 普兰店市| 勐海县| 二连浩特市| 汝南县| 浠水县| 玉田县| 绵竹市| 洪雅县| 专栏| 绥阳县| 石楼县| 西平县| 东阿县| 仁寿县| 和硕县| 合江县| 汾西县| 临湘市| 额敏县| 喀喇| 泰顺县| 沭阳县| 综艺| 马边| SHOW| 桓仁| 东平县| 韶山市| 佛坪县| 庄河市| 陵川县| 夏邑县| 独山县|