Google announces Gemini, its 'multimodal' answer to ChatGPT

Google announces Gemini, its ‘multimodal’ answer to ChatGPT

On Wednesday, Google announced the arrival of Gemini, its new multimodal large language model built from the ground up by the company’s AI division, DeepMind. Among its many functions, Gemini will underpin Google Bard, which has previously struggled to emerge from the shadow of its chatbot forerunner, OpenAI’s ChatGPT.

Credit: Google DeepMind / YouTube

According to a December 6 blog post from Google CEO Sundar Pichai and DeepMind co-founder and CEO Demis Hassabis, there are technically three versions of the LLM—Gemini Ultra, Pro, and Nano—meant for various applications. A “fine tuned” Gemini Pro now underpins Bard, while the Nano variant will be seen in products such as Pixel Pro smartphones. The Gemini variants will also arrive for Google Search, Ads, and Chrome in the coming months, although public access to Ultra will not become available until 2024.

Unlike many of its AI competitors, Gemini was trained to be “multimodal” from launch, meaning it can already handle both text, audio, and image-based prompts. In an accompanying video demonstration, Gemini is verbally tasked to identify what is placed in front of it (a piece of paper) and then correctly identifies a user’s sketch of a duck in real-time. Other abilities appear to include inferring what actions happen next in videos once they are paused, generating music based on visual prompts, and assessing children’s homework—often with a slightly cheeky, pun-prone personality. It’s worth noting, however, that the video description includes the disclaimer, “For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.”

Gemini’s accompanying technical report indicates the LLM’s most powerful iteration, Ultra, “exceeds current state-of-the-art results on 30 of the 32 widely-used academic benchmarks used in [LLM] research and development.” That said,…

Read the full article here

Want to advertise or share your work with Science News Watch? Contact us.

Google announces Gemini, its ‘multimodal’ answer to ChatGPT

Popular Science

Related Articles

Wearable sensors monitor factory worker fatigue in real time

Smart Tech Would Make Your Office Building Greener

Weird? These bat toes can glow greenish-blue

NASA’s Europa Clipper Spacecraft Launches to Jupiter’s Icy Moon

Your brain can perceive subtle odor changes in a single sniff

Human Origins Look Ever More Tangled with Gene and Fossil Discoveries

Get exclusive updates

Welcome Back!

Retrieve your password

Google announces Gemini, its ‘multimodal’ answer to ChatGPT

Popular Science

Related Articles

Wearable sensors monitor factory worker fatigue in real time

Smart Tech Would Make Your Office Building Greener

Weird? These bat toes can glow greenish-blue

NASA’s Europa Clipper Spacecraft Launches to Jupiter’s Icy Moon

Your brain can perceive subtle odor changes in a single sniff

Human Origins Look Ever More Tangled with Gene and Fossil Discoveries

Topics

Get exclusive updates

Welcome Back!

Retrieve your password