Large Language Models Exhibit Homogeneous Creative Outputs

Researchers Emily Wenger and Yoed N. Kenett report in PNAS Nexus that across standard creativity tasks, outputs from many LLMs—including Gemini, GPT, and Llama—are significantly more similar to each other than human responses. While single AI answers can score as creative as humans, population-level homogenization persists; raising model temperature increases variability but quickly produces gibberish, implying LLMs may narrow brainstorming and creative diversity.
Key Points
- 1Demonstrates LLM outputs are far more similar to each other than human responses across standard creativity tasks
- 2Shows raising temperature increases variability but quickly produces gibberish, revealing limits to model imagination
- 3Warns relying on LLMs for brainstorming risks narrowing human thinking and reducing creative diversity
Scoring Rationale
Strong peer-reviewed evidence and broad relevance drive the score, limited by modest novelty beyond prior homogenization findings.
Sources
Public references used for this report.
Practice with real Logistics & Shipping data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Logistics & Shipping problems

