Microsoft Recommends Training On Harry Potter Dataset

In November 2024, a Microsoft blog post by senior product manager Pooja Kamath linked to a Kaggle dataset containing all seven Harry Potter books incorrectly marked as public domain, and recommended developers train LLMs and generate fan fiction. After backlash and an Ars Technica inquiry, Microsoft removed the post and the Kaggle dataset was deleted; legal experts warned the guidance risked copyright infringement.
Scoring Rationale
Notable corporate misstep with clear legal implications, but limited novelty beyond confirming known copyright risks.
Practice with real FinTech & Trading data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all FinTech & Trading problems

