Get the latest tech news

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test


This week OpenAI announced a 750-task test to to measure "whether AI systems can support realistic life science research tasks, not just answer biology questions." But while OpenAI's top-performing GPT-Rosalind model led the rankings, Slashdot reader BrianFagioli notes that "it achieved a pass rat...

None

Get the Android app

Or read this on Slashdot

Read more on:

Photo of OpenAI

OpenAI

Photo of test

test

Photo of benchmarks

benchmarks

Related news:

News photo

China puts world’s first smart squid fishing robot to the test

News photo

Amazon drops Sam Altman movie after announcing OpenAI partnership

News photo

Amazon drops Sam Altman movie after announcing OpenAI partnership