Researchadversarial promptsllmmodel robustnesssafety

Researchers Find Poetry Circumvents Chatbot Safety

|December 4, 2025|By LDS Team

9.0

Relevance Score

Researchers Find Poetry Circumvents Chatbot Safety — Photo: The Verge · rights & takedowns

Italy’s Icaro Lab researchers at Sapienza University and DexAI published a non-peer-reviewed study showing poetic prompts can bypass chatbot safety. They tested 20 handcrafted poems against 25 models from Google, OpenAI, Meta, xAI, and Anthropic, finding an average 62% success rate and model-specific results ranging 0%–100%; a generated-poem attacker succeeded 43% on a larger corpus. The study signals urgent need to harden safety detection against stylistic adversarial attacks.

Key Points

1Demonstrates poetic prompts bypass chatbot safety in 62% of handcrafted tests across 25 commercial models.
2Highlights that stylistic variation can evade filters, revealing systemic vulnerabilities across major vendors and model sizes.
3Implies practitioners must strengthen detection and safety pipelines to address adversarial poetic inputs and model robustness.

Scoring Rationale

High practical urgency due to broad, measurable vulnerabilities across major models; limited by a single non-peer-reviewed study.

Sources

Public references used for this report.

1 source

01theverge.com‘Adversarial poetry’ tricks AI chatbots into divulging harmful content

Practice with real Ad Tech data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

Active Search Campaigns by BudgetEasy

High CPC Clicks & Poor Landing PagesMedium

Campaign ROAS by Attribution ModelHard

250 free problems · No credit card

See all Ad Tech problems

Researchadversarial promptsllmmodel robustnesssafety

Researchers Find Poetry Circumvents Chatbot Safety

|December 4, 2025|By LDS Team

9.0

Relevance Score

Key Points

1Demonstrates poetic prompts bypass chatbot safety in 62% of handcrafted tests across 25 commercial models.
2Highlights that stylistic variation can evade filters, revealing systemic vulnerabilities across major vendors and model sizes.
3Implies practitioners must strengthen detection and safety pipelines to address adversarial poetic inputs and model robustness.

Scoring Rationale

High practical urgency due to broad, measurable vulnerabilities across major models; limited by a single non-peer-reviewed study.

Sources

Public references used for this report.

1 source

01theverge.com‘Adversarial poetry’ tricks AI chatbots into divulging harmful content

Practice with real Ad Tech data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

Active Search Campaigns by BudgetEasy

High CPC Clicks & Poor Landing PagesMedium

Campaign ROAS by Attribution ModelHard

250 free problems · No credit card

See all Ad Tech problems

Researchers Find Poetry Circumvents Chatbot Safety

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Karen Hao Critiques Sam Altman, OpenAI and AGI Narratives

Karen Hao Frames AI as Threat to Democracy

Midjourney Unveils Water-Based Full-Body Ultrasound Scanner

Meta launches Pocket for AI-created mini-games

Researchers Find Poetry Circumvents Chatbot Safety

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Karen Hao Critiques Sam Altman, OpenAI and AGI Narratives

Karen Hao Frames AI as Threat to Democracy

Midjourney Unveils Water-Based Full-Body Ultrasound Scanner

Meta launches Pocket for AI-created mini-games