Researchllmcreativity evaluationpromptingtemperature control

Large Language Models Reach Average Human Creativity

||By LDS Team
10.0
Relevance Score
Large Language Models Reach Average Human Creativity
Photo: sciencedaily.com · rights & takedowns

A large-scale study led by Professor Karim Jerbi and published in Scientific Reports on January 21, 2026 compared leading LLMs (including GPT-4, Claude, Gemini) with over 100,000 human participants on divergent creativity tasks. Researchers found some models now exceed average human scores on the Divergent Association Task and creative-writing tests, but the top 10% of human creators still outperform all tested models; creativity is also tunable via temperature and prompting.

Key Points

  • 1Shows some LLMs (e.g., GPT-4) outperform average humans on divergent creativity tasks.
  • 2Finds top human creators (top 10%) still surpass every tested AI model in creativity.
  • 3Demonstrates AI creativity is adjustable via temperature and prompt framing, enabling controllable outputs.

Scoring Rationale

Large peer-reviewed, large-sample study showing LLM parity with average humans; limitation remains human peak superiority.

Sources

Public references used for this report.

2 sources

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

250 free problems · No credit card

See all Logistics & Shipping problems