Researchviral genomicshost predictiondataset
Researchers Publish Improved Mammal-Infection Virus Dataset
8.3
Relevance ScoreOn March 27, 2026, Reddy et al. publish an improved, openly available dataset nearly doubling curated host-virus records and adding primate and mammal infection labels for machine-learning. They benchmark eight ML models, report human-infection ROC AUC improvement from 0.663 ± 0.070 to 0.784 ± 0.013 under reduced phylogenetic distance, and find mammal-level prediction achieves 0.850 ± 0.020 while predictions across novel viral families perform at chance (≈0.50).
Scoring Rationale
High practical value from expanded, shared dataset; limited novelty beyond dataset curation and out-of-sample generalization challenges.
Free Career Roadmaps8 PATHS
Step-by-step roadmaps from zero to job-ready — curated courses, salary data, and the exact learning order that gets you hired.
Data Analyst
Explore all career paths $95K
Data Scientist$130K
ML Engineer$155K
AI Engineer$160K
Data Engineer$140K
Analytics Eng.$140K
MLOps Engineer$160K
Quant Analyst$175K
Sources
- Read OriginalAn improved dataset for predicting mammal infecting viruses from genetic sequence informationjournals.plos.org



