Get the latest tech news

Google's new Gemini models achieve 'near-perfect recall'


The company released three experimental models in the Gemini 1.5 family. Here's what's improved.

Shortly after their release, the latest Gemini 1.5 Pro was ranked #2 and Flash #6 overall in the Chatbot Arena, keeping them essentially neck-and-neck with GPT-4o and GPT-4o mini, respectively. In a technical report from earlier this month, the DeepMind team called their capabilities "unprecedented among contemporary large language models (LLMs)," stating that Gemini 1.5 can process multimodal inputs such as "entire collections of documents, multiple hours of video, and almost five days long of audio." The report also mentioned potential use cases, touting Gemini 1.5's ability to help professionals save up to 75% of their time on tasks across 10 job categories, among some surprising "frontier" skills: "When given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content," the report notes.

Get the Android app

Or read this on ZDNet

Read more on:

Photo of Google

Google

Photo of gemini

gemini

Photo of New Gemini models

New Gemini models

Related news:

News photo

Google’s Gemini AI gets major upgrade with ‘Gems’ assistants and Imagen 3

News photo

Google trains a GenAI model to simulate Doom's game engine in real-ish time

News photo

After 12 years at Google, I have decided it is time to move on