OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test
This week OpenAI announced a 750-task test to to measure “whether AI systems can support realistic life science research tasks, not just answer biology questions.”
But while OpenAI’s top-performing GPT-Rosalind model led the rankings, Slashdot reader BrianFagioli notes that “it a … ⌘ Read more