Google just released the “largest and most capable” AI Model, Gemini, in its AI Chatbot Bard and is expected to compete against Tech Giants in AI Development.
Gemini comes in three sizes: Gemini Ultra, for “highly complex tasks,” Gemini Pro, for “a wide range of tasks,” and Gemini Nano, which can be used for specific tasks and on mobile devices.
Gemini is described by Google as “natively multimodal” because it was trained on images, video, and audio rather than just text, as the large language models at the heart of the recent generative AI boom are. “It’s our largest and most capable model; it’s also our most general.” Eli Collins, vice president of product for Google DeepMind, said at a press briefing announcing Gemini.
An initial version of Gemini starts to roll out today inside Google’s chatbot Bard for the English language setting. It will be available in more than 170 countries and territories. Google says Gemini will be made available to developers through Google Cloud’s API from December 13.
A more compact version of the model will from today power suggested messaging replies from the keyboard of Pixel 8 smartphones. Gemini will be introduced into other Google products including generative search, ads, and Chrome in “coming months,” the company says. The most powerful Gemini version of all will debut in 2024, pending “extensive trust and safety checks,” Google says.
The model’s multimodal capabilities have given Bard new skills and made it better at tasks such as summarizing content, brainstorming, writing, and planning. “These are the biggest single quality improvements of Bard since we’ve launched,” Vice President at Google and general manager for Bard Sissie Hsiao said.
Gemini was named to mark the twinning of Google’s two major AI labs and as a reference to NASA’s Project Gemini, which paved the way for the Apollo Program’s moon landings.