OpenAI Accused of Training GPT-4o on Unlicensed O’Reilly Books
A new paper [PDF] from the AI Disclosures Project claims OpenAI likely trained its GPT-4o model on paywalled O’Reilly Media books without a licensing agreement. The nonprofit organization, co-founded by O’Reilly Media CEO Tim O’Reilly himself, used a method called DE-COP to detect copyrighted content in language model training data.

Researchers ana … ⌘ Read more

⤋ Read More