Models & Researchradiologyelectronic health recordsmultimodal mltransformers

Researchers integrate radiology text and EHR for renal malignancy prediction

|May 27, 2026|By LDS Team

6.6

Relevance Score

Researchers integrate radiology text and EHR for renal malignancy prediction — Photo: asset.jmir.pub · rights & takedowns

A JMIR preprint by Fan et al. reports a retrospective cohort study that develops a multimodal pipeline combining structured electronic health record (EHR) variables with features extracted from radiology report text using large language models and a pretrained biomedical transformer, RadBERT. The preprint evaluates early, middle and late fusion strategies and reports that early fusion achieved an area under the ROC curve (AUC) of 0.818 (± 0.010), with RadBERT-derived contextual embeddings providing the largest performance gain and LLM-extracted abnormality characteristics adding modest incremental improvement, per the preprint. For clinical ML practitioners, the study illustrates how contextual text embeddings fused with structured EHR data can improve preoperative malignancy classification and highlights practical questions about extraction cost, interpretability, and deployment in clinical workflows.

What happened

A JMIR preprint by Fan, Liang, Sun, Pan, Terry, and Xu presents a retrospective cohort study that builds a multimodal prediction pipeline to estimate renal tumor malignancy from combined radiology report text and structured EHR data. The authors report using large language models to extract abnormality characteristics and a pretrained biomedical transformer, RadBERT, to generate contextual text embeddings. The preprint evaluates three fusion strategies, and reports that early fusion achieved an area under the ROC curve of 0.818 (± 0.010), with textual features from RadBERT driving the largest improvement and LLM-extracted structured characteristics yielding modest additional gains, per the preprint.

Technical details

Per the preprint, the study fuses features from structured EHR variables with text-derived features using early, middle, and late fusion approaches. Performance was measured using standard classification metrics including:

•accuracy
•precision
•recall
•specificity
•AUC
•F1-score

The manuscript reports that RadBERT contextual embeddings outperformed simpler extracted features, while the LLM-based abnormality extraction contributed incremental benefit when combined with embeddings, according to the authors.

Industry context

For practitioners

What to watch

Editorial analysis

Multimodal approaches that combine contextual transformer embeddings with tabular EHR features are increasingly common in clinical ML research because they can capture complementary information from narrative reports and structured records. Observers in the field note that text embeddings from domain-pretrained transformers often yield larger gains than handcrafted or rule-extracted features, while extraction pipelines based on LLMs can add value but introduce additional computational and validation burdens.

Key practical considerations for adoption include the cost of running domain LLMs at scale, the need to validate text-extracted labels against chart review, and model interpretability requirements in clinical decision support. The preprint focuses on model performance metrics; it does not, in the available manuscript, provide deployment or prospective validation details.

Readers should watch for the peer-reviewed JMIR Med Inform version for expanded methodological details, dataset size and characteristics, external validation results, and any code or model release that would enable reproduction and comparative benchmarking.

Key Points

1Combining contextual transformer embeddings from radiology reports with structured EHR features often improves malignancy classification performance.
2Domain-pretrained transformers like RadBERT typically provide larger gains than LLM-extracted structured features, which add modest incremental value.
3Practical adoption requires attention to extraction cost, validation against chart review, and interpretability for clinical decision support.

Scoring Rationale

Notable clinical ML research: the preprint demonstrates measurable AUC improvement from fusing RadBERT embeddings with structured EHRs, which is relevant to practitioners building diagnostic models but is not a landmark model release.

Sources

Public references used for this report.

2 sources

preprints.jmir.orgJMIR Preprints #84396: Multimodal Prediction of Renal Tumor ...

sciprofiles.comMs. Chengkun Sun | Author - SciProfiles

Practice with real Ad Tech data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

Active Search Campaigns by BudgetEasy

High CPC Clicks & Poor Landing PagesMedium

Campaign ROAS by Attribution ModelHard

250 free problems · No credit card

See all Ad Tech problems

Models & Researchradiologyelectronic health recordsmultimodal mltransformers