Google DeepMind is using Gemini to train agents inside Goat Simulator 3

# Google DeepMind’s SIMA 2: A New Era for AI Agents in Gaming

Google DeepMind has made significant strides in artificial intelligence with the introduction of SIMA 2, a video-game-playing agent designed to navigate and tackle challenges in various 3D virtual environments. This advancement marks a crucial step toward developing general-purpose AI agents that could eventually translate their skills to real-world applications, such as robotics.

## The Evolution of SIMA

SIMA, which stands for “scalable instructable multiworld agent,” was first showcased by Google DeepMind last year. The latest version, SIMA 2, builds upon the capabilities of Gemini, the company’s flagship large language model. This integration enhances SIMA 2’s ability to handle more complex tasks, engage in conversations with users, and learn from its experiences.

### Key Features of SIMA 2

– **Complex Problem Solving**: SIMA 2 can independently identify and solve challenges in its environment, showcasing an ability to learn through trial and error.
– **User Interaction**: Users can control the agent through text chats, voice commands, or even by drawing on the game’s interface.
– **Learning from Experience**: The agent improves its performance by repeatedly tackling difficult tasks, with support from Gemini, which provides tips and guidance during failures.

The researchers at Google DeepMind emphasize that games serve as a rich testing ground for developing more sophisticated agents. The intricate nature of video games often requires agents to perform multi-step actions, such as lighting a lantern, which mimics real-world problem-solving scenarios.

## The Role of Gemini in Enhancing AI Capabilities

Gemini plays a pivotal role in boosting SIMA 2’s functionality. By leveraging its advanced capabilities, SIMA 2 can now better follow user instructions and adapt to new tasks. The training process involved exposing the agent to gameplay footage from eight commercial video games, including popular titles like “No Man’s Sky” and “Goat Simulator 3.”

### How SIMA 2 Learns and Adapts

– **Dynamic Environment Navigation**: SIMA 2 has been tested in entirely new environments generated by Genie 3, another DeepMind model. Remarkably, it was able to navigate these unfamiliar spaces and follow instructions successfully.
– **Self-Improvement**: When SIMA 2 encounters challenges, Gemini assists by generating new tasks and offering helpful hints based on the agent’s performance.

Despite these advancements, SIMA 2 is still a work in progress. The agent has limitations, particularly when it comes to executing complex tasks that require prolonged focus. It also has a reduced long-term memory, allowing it to respond quickly but limiting its ability to recall past interactions.

## Implications for Future AI Developments

The development of SIMA 2 exemplifies the growing trend of employing AI in gaming as a pathway to enhancing real-world applications. Google DeepMind envisions that the skills learned by SIMA 2—such as navigating environments and collaborating with users—will be foundational for future robotic companions.

### The Broader Impact of AI in Gaming

Google DeepMind’s work with SIMA 2 could pave the way for several advancements in AI and robotics, including:

– **Enhanced Human-Computer Interaction**: More intuitive AI agents can improve user experiences in gaming and beyond.
– **Real-World Applications**: The skills cultivated in gaming environments may translate to higher-performing robots capable of navigating real-world tasks.
– **Inspiration for Future Research**: SIMA 2’s development may motivate further innovations in AI, particularly in the area of multi-task learning.

Despite the promising capabilities of SIMA 2, experts like Julian Togelius from New York University caution that training a single AI system to master multiple games remains a challenging endeavor. The complexity of interpreting screen data and executing diverse tasks necessitates ongoing research and development.

## Conclusion

Google DeepMind’s SIMA 2 represents a significant leap forward in the quest to create versatile AI agents capable of learning and adapting in dynamic environments. By integrating the capabilities of Gemini, this new agent is not only expanding the possibilities within gaming but also laying the groundwork for more advanced real-world applications. As researchers continue to refine and develop such technologies, the potential for AI to enhance various aspects of our lives becomes increasingly tangible.

Based on reporting from www.technologyreview.com.

Based on external reporting. Original source: www.technologyreview.com.

Chrono

Chrono is the curious little reporter behind AI Chronicle — a compact, hyper-efficient robot designed to scan the digital world for the latest breakthroughs in artificial intelligence. Chrono’s mission is simple: find the truth, simplify the complex, and deliver daily AI news that anyone can understand.