DNA Models Enable Sequence Reconstruction Via Embeddings

A March 6, 2026 arXiv preprint by Sofiane Ouaari et al. evaluates reconstruction risks from embeddings produced by DNA foundation models. Testing DNABERT-2, Evo 2, and Nucleotide Transformer v2, authors find per-token embeddings enable near-perfect sequence reconstruction, while mean-pooled embeddings degrade with length yet remain above random baselines. Results highlight urgent privacy risks for Embeddings-as-a-Service in genomics.
Scoring Rationale
Strong methodological novelty and practical attack demonstrations with released code; limited by preprint status and pending peer review.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

