伊人婷婷涩六月丁香七月_国产亚洲视频在线免费观看_91本色_久久日本精品字幕区二区_久久久人体_91免费国产视频网站

position: EnglishChannel  > AI ripples> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-12-17 15:44:35 | Author: Gong Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Jointly Protecting People's Rights in Digital Era

?Emerging technologies like AI, big data and the Internet of Things are rapidly reshaping the world in this era of digital intelligence. However, they are also bringing challenges to human rights, which makes joint efforts essential. Science and Technology Daily spoke with international experts on these issues against the backdrop of the 2025 China-Europe Seminar on Human Rights hosted by the China Society for Human Rights Studies and Cátedra China Foundation in Madrid, Spain, on June 25 on the theme "Human Rights in the Era of Digital Intelligence."

First Human Clinical Trial of Invasive BCI in China

A major breakthrough in neurotechnology has been achieved with the successful completion of China's first-in-human clinical trial of an invasive brain-computer interface (BCI) system. With that China becomes the second country in the world to reach the clinical stage in this field.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
主站蜘蛛池模板: 久久精品99视频 | 999久久国精品免费观看网站 | 亚洲一区成人在线观看 | 亚洲成人在线综合 | 在线观看中文字幕日韩 | 久久青草国产精品一区 | "www av在线"| 99久热国产精品视频尤物 | 五月丁香六月婷婷国产视频96视频 | 亚洲精品乱码久久久久久蜜桃麻豆 | 久久福利青草精品资源站免费 | 国产一级黄色大片 | 久热最新 | 国产精品18久久久久久麻辣 | 91色视频在线 | 成人国内精品久久久久影院成.人国产9 | 久久一区二区三区视频 | 一级毛片成人免费看免费不卡 | 特黄特色大片免费播放器9i | 超级乱淫片67194免费看 | 国产啊v在线观看 | 亚洲欧洲自拍拍偷精品美利坚 | 粉嫩国产 | 99精品久久久久久中文字幕 | а天堂中文在线官网在线 | 久久综合免费视频 | 一级黄色录像影片夫妻性生活影片 | 国产精久久一区二区三区 | 欧美亚人xxxx高潮猛交 | 国产亚洲美女精品久久久2020 | 懂色av一区二区三区免费看 | 97国产小视频 | 98k网站毛片成年女人网站 | 国产欧美日韩高清在线不卡 | 狠久久| 天天操很很操夜夜操夜夜 | 欧美国产日韩在线观看成人 | 久久综合亚洲色一区二区三区 | 亚洲永久精品唐人导航网址 | 久久aⅴ乱码一区二区三区 精品毛片一区二区三区 | 国产99热在线观看 |