AI developments are happening pretty fast. If you don’t stop and look around once in a while, you could miss them.
Fortunately, I’m looking around for you and what I saw this week is that competition between OpenAI, maker of ChatGPT and Dall-E, and Google continues to heat up in a way that’s worth paying attention to.
A week after updating its Bard chatbot and changing the name to Gemini, Google’s DeepMind AI subsidiary previewed the next version of its generative AI chatbot. DeepMind told CNET’s Lisa Lacy that Gemini 1.5 will be rolled out “slowly” to regular people who sign up for a wait list and will be available now only to developers and enterprise customers.
Gemini 1.5 Pro, Lacy reports, is “as capable as” the Gemini 1.0 Ultra model, which Google announced on Feb. 8. The 1.5 Pro model has a win rate — a measurement of how many benchmarks it can outperform — of 87% compared with the 1.0 Pro and 55% against the 1.0 Ultra. So the 1.5 Pro is essentially an upgraded version of the best available model now.
Gemini 1.5 Pro can ingest video, images, audio and text to answer questions, added Lacy. Oriol Vinyals, vice president of research at Google DeepMind and co-lead of Gemini, described 1.5 as a “research release” and said the model is “very efficient” thanks to a unique architecture that can answer questions by zeroing in on expert sources in that particular subject rather than seeking the answer from all possible sources.
Meanwhile, OpenAI announced a new text-to-video model called Sora that’s capturing a lot of attention because of the photorealistic videos it’s able to generate. Sora can “create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions.” Following up on a promise it made, along with Google and Meta last week, to watermark AI-generated images and video, OpenAI says it’s also creating tools to detect videos created with Sora so they can be identified as being AI…
Read the full article here