Another day, another generative AI update.
Google’s AI subsidiary DeepMind has previewed Gemini 1.5 Pro, an upgraded model of Google’s renamed Bard chatbot Gemini. Gemini got its new name less than a week ago along with the release of the premium, paid version, Ultra, which Google called “our largest and most capable state-of-the-art AI model.”
Gemini 1.5 Pro is the latest evolution of Google’s chatbot, which also recently gained the ability to generate images from text.
Gemini 1.5 Pro can ingest video, images, audio and text in order to answer questions, and it boasts multiple advantages over its predecessors — but most of us can’t get our hands on it yet. In a call with press on Wednesday, DeepMind announced it’s giving access to developers and enterprise customers first.
Oriol Vinyals, vice president of research at Google DeepMind and co-lead of Gemini, called this a “research release” for “an audience that understands the technology really well.”
“When you create a new model — and especially when we unlock some new capabilities — I think it makes sense to see what creative minds … can do with the model to understand what will this model [do], how will this matter to users ultimately?” Vinyals added.
DeepMind will “roll it out slowly” to regular Joes and Janes via a wait list.
The limited release of Gemini 1.5 Pro comes amid a flurry of activity in a sector projected to reach $1.3 trillion in revenue by 2032. Meanwhile, ChatGPT maker OpenAI has released its GPT-4 Turbo large language model and allows anyone to create custom AI apps for its app store. Microsoft intends to add a dedicated key on Windows 11 laptops and PCs to launch its AI tool, Copilot.
Better performance, new architecture and a longer context window
Gemini 1.5 Pro is “as capable as” the Gemini 1.0 Ultra model, which Google announced on Feb. 8. The 1.5 Pro model has a win rate — a measurement of how many benchmarks it can outperform — of 87% compared to 1.0 Pro and 55% against 1.0 Ultra….
Read the full article here