跳转到内容

Gemini Robotics

维基百科,自由的百科全书

Gemini Robotics是由Google DeepMind[1]与Apptronik合作开发的先进“视觉-语言-行动”模型。[2]该模型以Gemini 2.0大型语言模型为基础。[3]专门针对机器人应用进行调整,能理解并应对新的情境[4][5]另有一个相关版本名为Gemini Robotics-ER,代表“具身推理”。[3]两款模型于2025年3月12日正式发布[5]

2025年6月24日,Google DeepMind推出Gemini Robotics On-Device,这是专为在机器人装置上本地运行而设计并优化的版本。[6]

目前,Gemini Robotics模型仅向特定受信任的测试者开放,包括Agile Robots、Agility Robots、Boston Dynamics与Enchanted Tools。[2]

参考资料

[编辑]
  1. ^ Gemini Robotics. deepmind.google. [2025-03-12]. 
  2. ^ 2.0 2.1 Parada, Carolina. Gemini Robotics brings AI into the physical world. Google DeepMind. [2025-07-11]. 
  3. ^ 3.0 3.1 Knight, Will. Google's Gemini Robotics AI Model Reaches Into the Physical World. WIRED. May 12, 2025 [2025-03-12]. 
  4. ^ Google introduces new AI models for rapidly growing robotics industry. Reuters. March 12, 2025 [2025-03-12]. 
  5. ^ 5.0 5.1 Roth, Emma. Google DeepMind's new AI models help robots perform physical tasks, even without training. The Verge. March 12, 2025 [2025-03-12]. 
  6. ^ Parada, Carolina. Gemini Robotics On-Device brings AI to local robotic devices. Google DeepMind. [11 July 2025].