We.Love.Privacy.Club @<slashdot https://feeds.twtxt.net/slashdot/twtxt.txt> "**OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test** This week OpenAI announced a 750-task tes ..."

feeds.twtxt.net

OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test
This week OpenAI announced a 750-task test to to measure “whether AI systems can support realistic life science research tasks, not just answer biology questions.”

But while OpenAI’s top-performing GPT-Rosalind model led the rankings, Slashdot reader BrianFagioli notes that “it a … ⌘ Read more

⤋ Read More

Participate