Conifer Applies Machine Learning To Target High Risk Members

Conifer Health Solutions argues population health must move beyond retrospective claims-based risk scores to disease-centered, machine learning driven identification of emerging clinical risk. By aggregating medical, pharmacy, and lab signals into longitudinal disease profiles and embedding clinical informatics into model development, teams can detect deterioration earlier and allocate limited care management capacity to the members most likely to drive avoidable utilization. The shift prioritizes predictive precision in the top percentiles of risk, operational metrics like lead time and positive predictive value, and integration into clinician workflows to make predictions actionable rather than academic.
What happened
Conifer Health Solutions, led by Mary Bacaj, Ph.D., outlines a shift in population health from blunt, historical claims scoring toward disease-centered, machine learning driven risk identification focused on the top 5% of members who drive costs. The argument centers on replacing event-counting claims models with longitudinal disease profiles that surface emerging clinical deterioration early enough for intervention.
Technical details
Machine learning is positioned to improve pattern detection in complex, nonlinear clinical data by combining disparate sources into condition-level profiles. Key modeling and operational elements practitioners should note include:
- •Feature engineering that aggregates pharmacy fills, laboratory trends, specialty utilization, and diagnostic codes into longitudinal disease trajectories
- •Evaluation metrics oriented to operational utility, prioritizing positive predictive value and lead time within the highest risk percentiles rather than global AUC alone
- •Close collaboration between clinical informatics and data science so label design, cohort definitions, and feature sets reflect clinical reasoning and care pathways
- •Practical deployment requirements: near-real-time feeds or regular ingestion pipelines, model retraining cadence driven by concept drift, and explainability to support clinician triage
Context and significance
This is an applied reframing rather than a new algorithmic breakthrough. The meaningful change is methodological and organizational: treat risk models as decision instruments embedded in care management workflows, not static vendor scores. That aligns with broader trends toward disease-centric phenotyping, temporal modeling, and operational ML that measures impact in avoided admissions and clinician time saved. For ML practitioners, the brief reinforces moving beyond proprietary claims features to richer, temporally-aware inputs and evaluation practices tuned to population health operations.
What to watch
Adoption depends on robust data pipelines, prospective validation or pilot outcomes, and governance for fairness and explainability in member selection. Monitor how organizations quantify lead time gains, clinician acceptance, and measurable reductions in preventable utilization as models move from retrospective validation to live deployment.
Scoring Rationale
This is a notable applied shift with practical implications for ML teams building healthcare risk models, but not a frontier-model or infrastructure milestone. It scores as useful and actionable guidance for practitioners, hence a mid-tier impact.
Practice with real Health & Insurance data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Health & Insurance problemsStep-by-step roadmaps from zero to job-ready — curated courses, salary data, and the exact learning order that gets you hired.

