Researchbio foundation modelsfine tuningprobingbiorisk evaluation

Bio-Foundation Models Retain Harmful Knowledge Despite Filtering

|December 4, 2025|By LDS Team

10.0

Relevance Score

Bio-Foundation Models Retain Harmful Knowledge Despite Filtering — Photo: blog.citp.princeton.edu · rights & takedowns

Researchers from Scale AI, Princeton, University of Maryland, SecureBio, and the Center for AI Safety introduce BioRiskEval, a new framework released to evaluate dual-use risks in open-weight bio-foundation models. Using Evo2-7B, they show fine-tuning can restore filtered viral capabilities within 50 steps (under one hour on a single H100) and linear probing reveals persistent predictive signals, though model accuracy (mutational effect correlation ≈0.2) remains modest.

Key Points

1Demonstrates fine-tuning reintroduces filtered viral knowledge within 50 steps (~1 hour on a single H100).
2Shows linear probing uncovers predictive signals in hidden layers despite prior dataset filtering.
3Implies practitioners must adopt defense-in-depth and lifecycle governance beyond sole reliance on data filtering.

Scoring Rationale

Comprehensive empirical evaluation and reproducible attacks justify a top score, limited by modest current model performance and dataset scope.

Sources

Public references used for this report.

1 source

01blog.citp.princeton.eduThe Limits of Data Filtering in Bio-Foundation Models

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Bio-Foundation Models Retain Harmful Knowledge Despite Filtering

Key Points

Scoring Rationale

Sources

More AI & Data Science News

GitHub Copilot Browser Tools Reach General Availability In VS Code

Cisco Rolls Out AI Agents To All 90,000 Employees

Zoom acquires Common Room to add buyer intelligence

Karen Hao Critiques Sam Altman, OpenAI and AGI Narratives

Bio-Foundation Models Retain Harmful Knowledge Despite Filtering

Key Points

Scoring Rationale

Sources

More AI & Data Science News

GitHub Copilot Browser Tools Reach General Availability In VS Code

Cisco Rolls Out AI Agents To All 90,000 Employees

Zoom acquires Common Room to add buyer intelligence

Karen Hao Critiques Sam Altman, OpenAI and AGI Narratives