CALDERA Simplifies Causal Gene Identification In GWAS
Schipper et al. (published March 17, 2026) present CALDERA, a logistic regression–based gene prioritization tool trained on a data-driven truth set of 200 GWAS loci that uses just four input features. In independent benchmarks CALDERA matched or outperformed FLAMES, L2G, and cS2G, produced well-calibrated causal probabilities, and when applied to 93 UK Biobank traits predicted 11,956 putative causal genes, resolving up to 52% of loci.
Scoring Rationale
Strong practical advance with open-source validation; scope primarily limited to GWAS gene prioritization applications.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems


