Researchlong context llmgenomicsprotein design
Researchers Release GENERator Long-Context Genomic Model
9.1
Relevance Score
Researchers released GENERator, a generative genomic foundation model (submitted to arXiv Jan 22, 2026) that models DNA with a 98,000-nucleotide context and is pre-trained on 386 billion eukaryotic nucleotides. Without fine-tuning it yields phylogenetically coherent embeddings and competitive zero-shot variant effect prediction; task-specific fine-tuning achieves state-of-the-art benchmarks and enables design of protein-coding sequences and cis-regulatory elements validated by UMI-STARR-seq.


