Researchllmdata poisoningbackdoor attacks

Researchers Demonstrate LLM Training Poisoning Triggers Gibberish

|December 15, 2025|By LDS Team

10.0

Relevance Score

Researchers Demonstrate LLM Training Poisoning Triggers Gibberish — Photo: hackaday.com · rights & takedowns

Researchers at Anthropic, the UK AI Security Institute and the Alan Turing Institute report new experiments showing that inserting just 250 carefully crafted 'poison' training documents can backdoor large language models to output gibberish when triggered by a specific phrase. Tests across models from 600 million to 13 billion parameters used the trigger word 'sudo', demonstrating parts-per-million vulnerability with implications for dataset hygiene and model provenance.

Key Points

1Show vulnerability in LLMs using just 250 poisoned training documents to trigger output corruption.
2Demonstrate attacks at parts-per-million scale across models from 600M to 13B parameters.
3Warn practitioners to verify outputs and harden data pipelines, ingestion, and provenance checks.

Scoring Rationale

Strong empirical demonstration of low-cost poisoning across LLM scales, enabling urgent defenses; limited to gibberish backdoor scenario.

Sources

Public references used for this report.

1 source

01hackaday.comIt Only Takes A Handful Of Samples To Poison Any Size LLM, Anthropic Finds

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

Researchllmdata poisoningbackdoor attacks

Researchers Demonstrate LLM Training Poisoning Triggers Gibberish

|December 15, 2025|By LDS Team

10.0

Relevance Score

Key Points

1Show vulnerability in LLMs using just 250 poisoned training documents to trigger output corruption.
2Demonstrate attacks at parts-per-million scale across models from 600M to 13B parameters.
3Warn practitioners to verify outputs and harden data pipelines, ingestion, and provenance checks.

Scoring Rationale

Strong empirical demonstration of low-cost poisoning across LLM scales, enabling urgent defenses; limited to gibberish backdoor scenario.

Sources

Public references used for this report.

1 source

01hackaday.comIt Only Takes A Handful Of Samples To Poison Any Size LLM, Anthropic Finds

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

Researchers Demonstrate LLM Training Poisoning Triggers Gibberish

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Portugal Launches Amalia Open Source Portuguese Language Model

UN And ITU Launch AI For Good Global Commission

Author Documents Agentic Coding on Galapogos Island

Sai Insights Explains 30 Ideas Powering AI Agents

Researchers Demonstrate LLM Training Poisoning Triggers Gibberish

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Portugal Launches Amalia Open Source Portuguese Language Model

UN And ITU Launch AI For Good Global Commission

Author Documents Agentic Coding on Galapogos Island

Sai Insights Explains 30 Ideas Powering AI Agents