Researchers Train LLMs To Avoid Clickbait

In a new study, Yale SOM researchers Tong Wang and K. Sudhir with Hengguang Zhou developed an LLM framework that generates and validates hypotheses about why headlines engage readers, using 23,000 headlines for 4,500 Upworthy articles and existing A/B-test results. They fine-tuned the model on validated hypotheses and found it produced headlines judged best 44% of the time versus roughly 30% for human and standard AI headlines in a 150-person evaluation. The approach reduced sensational clickbait language and could generalize to domains like personalized customer-service coaching.
Key Points
- 1Generate hypotheses: LLM formulates competing explanations for headline engagement and tests them using A/B-test data.
- 2Validate mechanisms: Extracted hypotheses generalize across examples, revealing deeper behavioral drivers beyond superficial cues.
- 3Enable practitioners: Fine-tuned, knowledge-guided LLMs boost meaningful CTR while avoiding deceptive, sensational language.
Scoring Rationale
Strong practical and credible research showing measurable gains; limited novelty relative to broader LLM interpretability literature.
Sources
Public references used for this report.
Practice with real Ad Tech data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Ad Tech problems