网易首页 > 网易号 > 正文 申请入驻

AI Will Close the Loop Between Seeing and Doing, Says 'Godmother of AI' Fei-...

0
分享至

TMTPOST--Seeing is not just for understanding the world, but for doing. And one day it will be AGI and spacial intelligence that close the loop between seeing and doing, said Stanford University's artificial intelligence leader Fei-Fei Li at the first Asian American Pioneer Medal Symposium and Ceremony at Stanford University on Friday.

"Nature created perceptual animals like us, but always starting from twilight half a billion years ago, because there is a in imperative in evolution that seeing and doing is a close loop," she remarked. This spatial intelligence involves not just recognizing objects but understanding their relationships and planning actions in a 3D space.

To illustrate this, Li provided examples of AI algorithms capable of reconstructing 3D scenes from 2D images, showcasing the early signs of robust spatial intelligence. These advancements have profound implications for fields like robotics, where machines need to navigate and manipulate their environment.

In the rapidly evolving field of artificial intelligence (AI), a significant divide is emerging between proponents of open source technology and advocates of proprietary solutions. Industry experts and stakeholders from academia, the public sector, venture capital, and entrepreneurial circles are rallying to support open source initiatives, highlighting the critical need for collaborative development in AI.

California Senate Bill 1047 poses a significant threat to the open source community, Li pointed out. "It's actually wrong that this legislation is coming out of California," said Li, who stressed that many are actively working to amend or repeal the bill to protect the interests of the open source community.

In a speech, Li outlined how modern AI has been driven by three converging forces in the past decades: neural networks (or deep learning), advanced chips like Nvidia's GPUs, and big data. These elements have collectively propelled significant advancements in AI, particularly in the realm of computer vision.

Li highlighted the remarkable progress in visual recognition, saying "Machines quickly became able to recognize visual objects on par with human performance." However, this achievement is just the beginning. The past decade has seen tremendous strides in areas such as object segmentation, dynamic tracking, and understanding complex, multi-object scenarios.

Current AI models, such as GPT-4 and Gemini 1.5, have demonstrated impressive capabilities in processing and generating language from multimodal inputs. These models can interpret text, images, and even generate language outputs, Li said so when responding to a question raised by Zhao hejuan, the CEO of TMTPost.

Yet, despite their advancements, these models are still largely confined to two-dimensional representations of the world. For example, the AI-generated video of a Japanese woman walking down a street in Tokyo or Kyoto is limited to a single perspective and lacks the ability to understand and manipulate the scene in three dimensions, she further explained.

The limitation lies in the AI's lack of spatial intelligence—a fundamental aspect of human cognition that enables us to understand depth, shape, and spatial relationships, Li elaborated, saying "Nature evolution made animals to be able to understand and live and plan and interact in this 3D world. And this is as ancient as 540 million years ago, when the first trial by starting to see light in the water, they need to navigate. If they don't navigate the 3D world, they become someone's dinner very quickly. So as evolution goes, animals gained more for spatial intelligence capability."

The integration of spatial intelligence in AI would unlock new possibilities. Li envisioned. In AR and VR, it would enhance the realism and interactivity of virtual environments. For robotics, it would enable machines to better navigate and manipulate objects in the real world. This advancement would also benefit design and creative industries by allowing AI to generate and understand complex three-dimensional designs, Li noted.

She also discussed the creation of image captioning algorithms. "We gave the computer one picture, and through the neural network, it was able to describe the scene in natural language," Li explained. This milestone was followed by the development of algorithms capable of generating images from textual descriptions, showcasing the rapid evolution of generative AI, Li added.

In recent years, the generative AI field has expanded beyond static images to include video generation. Companies like OpenAI and various startups have developed algorithms that can generate videos from single sentences, pushing the boundaries of what AI can achieve. However, Li posed the question: what's next?

Looking to the future, Li envisioned AI that can perform complex tasks through thought alone. A pilot study from Li's lab demonstrated a subject wearing an EEG cap, controlling a robot to make a meal using brain signals. While this technology is not yet ready for commercialization, it represents the cutting-edge potential of AI.

While large language models continue to dominate the AI landscape, Li argued that spatial intelligence will be crucial for the next wave of AI advancements. "It's nature's way of closing the loop between seeing and doing, and it will be AI's way of understanding and interacting with the world," Li elaborated.

Li's team has been collaborating with Nvidia to create dynamic environments that benchmark everyday household activities for robots. Additionally, they have been integrating large language models with visual models to instruct robots in performing tasks, such as opening doors or making sandwiches, based purely on natural language instructions.

When it comes to understanding specific technical issues or details, the consensus is to rely on credible experts in the field, said Li when talking about the question of trust. Experts bring specialized knowledge and are often engaged in ongoing debates and discussions. Peer reviews and expert forums are key mechanisms for ensuring the reliability of technical information, she explained.

However, the situation differs when evaluating broader aspects of technology, such as its safety and societal impact. Historically, government agencies and industry bodies have played significant roles in these evaluations. For instance, the FDA has been crucial in regulating drugs and food products. Yet, there are instances where these institutions have been criticized, and their actions scrutinized, as seen with high-profile cases of wrongdoing and inefficiencies, Li further illustrated.

Technology, whether it's AI, CRISPR, or any other advancement, is not the property of any single entity or group. Instead, it is a collective responsibility, she added. As these technologies become increasingly integrated into various aspects of society, it is essential for all stakeholders—governments, industries, and the public—to engage in continuous dialogue and oversight, Li emphasized.

特别声明:以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布,本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.

相关推荐
热点推荐
正能量!刘强东的小心思:每年发放年货,都在自家门口的大广场

正能量!刘强东的小心思:每年发放年货,都在自家门口的大广场

小淇言说
2025-01-09 12:59:34
湖人宣布不打了!雷迪克损失千万

湖人宣布不打了!雷迪克损失千万

毒舌NBA
2025-01-10 06:58:43
今日要闻!1月10日凌晨,中国传来9条好消息,一起看精彩新闻摘要

今日要闻!1月10日凌晨,中国传来9条好消息,一起看精彩新闻摘要

一群怪咖
2025-01-10 00:48:47
突发内讧!火箭最快速度交易!NBA探花秀终于打出来……

突发内讧!火箭最快速度交易!NBA探花秀终于打出来……

篮球实战宝典
2025-01-10 00:02:54
骑士32胜4负,打出历史级表现,你知道16年勇士同期多少胜场吗?

骑士32胜4负,打出历史级表现,你知道16年勇士同期多少胜场吗?

大西体育
2025-01-09 20:14:38
中国西工大再创黑科技,硬核造出水滴飞行器,《三体》科幻变现实

中国西工大再创黑科技,硬核造出水滴飞行器,《三体》科幻变现实

华山穹剑
2025-01-07 19:59:07
泰国很危险吗?我带娃在泰国旅行两个月的真实感受

泰国很危险吗?我带娃在泰国旅行两个月的真实感受

历史总在押韵
2025-01-10 00:03:14
颜十六老家被扒!妻儿父母在江苏,被骗人母亲报警无望,失声痛哭

颜十六老家被扒!妻儿父母在江苏,被骗人母亲报警无望,失声痛哭

小冠说娱
2025-01-09 16:06:45
感谢发明这个配方的人,让我体会到了上厕所超顺畅的快乐

感谢发明这个配方的人,让我体会到了上厕所超顺畅的快乐

猪猪之家
2025-01-08 19:30:03
失控了!高铁涨幅30%,史上最贵二等座出炉

失控了!高铁涨幅30%,史上最贵二等座出炉

小鹿姐姐情感说
2025-01-09 12:13:39
泰国朋友说实话,他说如果去泰国旅游,最好自称湾湾人!

泰国朋友说实话,他说如果去泰国旅游,最好自称湾湾人!

猫小狸同学
2025-01-09 19:45:03
卡塞米罗转会沙特已达成协议,收好行装准备离开!曼联或收3000万

卡塞米罗转会沙特已达成协议,收好行装准备离开!曼联或收3000万

罗米的曼联博客
2025-01-10 08:14:45
爆大料!A股再现离谱事件,网传融券做空规则未被有效执行!

爆大料!A股再现离谱事件,网传融券做空规则未被有效执行!

云姐财说
2025-01-10 00:00:10
哪里来的妖孽!成都街头惊现“娘文化”,网友:我怕他舔我手指头

哪里来的妖孽!成都街头惊现“娘文化”,网友:我怕他舔我手指头

深析古今
2024-12-16 15:51:38
越南:之前的努力全白费,10平方公里的牛轭礁,已被中国渔民实控

越南:之前的努力全白费,10平方公里的牛轭礁,已被中国渔民实控

苗苗情感说
2025-01-10 01:58:21
轰满分147分,147-0,赵心童连夺3冠,QTour6再次冲冠+职业赛资格

轰满分147分,147-0,赵心童连夺3冠,QTour6再次冲冠+职业赛资格

全能体育柳号
2025-01-10 07:05:02
最新!泰国警方回应25岁模特失联案,家人:“同星星事件几乎一样”

最新!泰国警方回应25岁模特失联案,家人:“同星星事件几乎一样”

新民晚报
2025-01-09 18:43:49
皆大欢喜!67岁香港知名男星官宣结婚,新婚妻子颜值赛过李嘉欣

皆大欢喜!67岁香港知名男星官宣结婚,新婚妻子颜值赛过李嘉欣

白面书誏
2025-01-08 13:14:49
乌军向库尔斯克的后勤中心推进!一天击退俄军百余次进攻

乌军向库尔斯克的后勤中心推进!一天击退俄军百余次进攻

项鹏飞
2025-01-09 19:17:42
事实证明,开演唱会2600万收入全捐出去的刀郎,已走上另一条大道

事实证明,开演唱会2600万收入全捐出去的刀郎,已走上另一条大道

林轻吟
2024-10-18 06:25:03
2025-01-10 10:15:00
钛媒体APP incentive-icons
钛媒体APP
独立财经科技媒体
112502文章数 859767关注度
往期回顾 全部

教育要闻

不要太尊重你的孩子

头条要闻

广州知名月子中心人去楼空 留下产妇们"没饭吃没水喝"

头条要闻

广州知名月子中心人去楼空 留下产妇们"没饭吃没水喝"

体育要闻

纳什:梅西是足坛乔丹 哈维魔笛丁丁像我

娱乐要闻

李明德疑似诈捐!下一步全网封号

财经要闻

人民币,让空头失望了

科技要闻

特斯拉中国推出新款Model Y 26.35万元起售

汽车要闻

10万元级无图智驾 悦也PLUS全路况实测

态度原创

游戏
本地
旅游
艺术
公开课

《巫师4》开发工作氛围健康积极 老手带新手合力打造

本地新闻

食味印象|来太原,先干了这碗牺汤!

旅游要闻

张家口一滑雪场儿童从缆车坠落,景区回应

艺术要闻

故宫珍藏的墨迹《十七帖》,比拓本更精良,这才是地道的魏晋写法

公开课

李玫瑾:为什么性格比能力更重要?

无障碍浏览 进入关怀版