Researchers Reveal LLMs Memorize Training Books

On Tuesday, researchers at Stanford and Yale revealed that four popular large language models—OpenAI’s GPT, Anthropic’s Claude, Google’s Gemini, and xAI’s Grok—can store and reproduce large portions of books they were trained on. Claude produced near-complete texts of Harry Potter and several classics, illustrating memorization and lossy-compression behavior. The finding contradicts company claims and raises substantial copyright liability that could cost the industry billions.
Scoring Rationale
Strong novelty and industry-wide legal impact from credible Stanford/Yale research, though limited to thirteen tested books and models.
Practice with real Logistics & Shipping data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Logistics & Shipping problems

