hacker-news feeds.twtxt.net Sun, Nov 9 1:18AM (6w ago) Study identifies weaknesses in how AI systems are evaluated Comments ⌘ Read more ⤋ Read More Yarn