Rice Researchers Unlock Scalable DNA Design

Rice University scientists have unveiled CLASSIC, a technique combining long- and short-read sequencing with machine learning to generate and analyze hundreds of thousands to millions of DNA designs and predict genetic circuit behavior. Using barcoded libraries inserted into human embryonic kidney cells and models trained on massive datasets, the team achieved perfect matches for all 40 predicted sequences and published the results in Nature, enabling faster, scalable design for cell-based therapies.
Key Points
- 1Generate massive DNA libraries: hundreds of thousands to millions of genetic circuits produced and assayed
- 2Combine sequencing methods: hybrid long- and short-read sequencing links full circuits to precise barcoded performance
- 3Train ML models: predictive models matched 40/40 out-of-sample sequences, accelerating design for cell therapies
Scoring Rationale
High novelty and peer-reviewed Nature publication; broad lab-scale advance, but cross-cell validation and industrial translation remain unproven.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems