Get the latest tech news

Google's new Gemini models achieve 'near-perfect recall'

The company released three experimental models in the Gemini 1.5 family. Here's what's improved.

Shortly after their release, the latest Gemini 1.5 Pro was ranked #2 and Flash #6 overall in the Chatbot Arena, keeping them essentially neck-and-neck with GPT-4o and GPT-4o mini, respectively. In a technical report from earlier this month, the DeepMind team called their capabilities "unprecedented among contemporary large language models (LLMs)," stating that Gemini 1.5 can process multimodal inputs such as "entire collections of documents, multiple hours of video, and almost five days long of audio." The report also mentioned potential use cases, touting Gemini 1.5's ability to help professionals save up to 75% of their time on tasks across 10 job categories, among some surprising "frontier" skills: "When given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content," the report notes.

Get the Android app

Or read this on ZDNet