google rolls out faster gemini ai model to power agents

Google Rolls Out Faster Gemini Ai Model To Power Agents

The new model, Gemini 2.0, can generate images and audio across languages, and can assist during Google searches and coding projects, the company said. The new capabilities of Gemini "make it possible to build agents that can think, remember, plan and even take action on your behalf", said Tulsee Doshi, a director of product management at the company, in a briefing with reporters.

Google has been working to ensure that the latest wave of AI tools pushed by OpenAI and other start-ups do not loosen its hold on search and advertising. The company has so far held onto its market share in search, but OpenAI is weaving more search features into ChatGPT, putting pressure on the industry leader. Both companies' ultimate aim is to build artificial general intelligence, or software that can perform tasks as well as or better than humans.

"We want to build that technology - that is where the real value is," Koray Kavukcuoglu, chief technology officer of AI lab Google DeepMind, said in an interview. "And on the path to that, what we are trying to do is try to pick the right applications, try to pick the right problems to solve."

The products featured on Wednesday show how Google's premier AI lab, Google DeepMind, is playing a more pivotal role in product development. The lab is expanding tests of Project Astra, an AI agent that uses a smartphone camera to process visual input. In an elaborate space evoking a home library, with towering bookshelves containing titles on computer programming and travel, Google employees showed how Astra can summarise information on the page. A hidden door nestled in the shelves revealed a small art gallery, where the agent reflected on how Norwegian painter Edvard Munch's "The Scream" captured his own anxiety and the general paranoia of his age.