Industry Applicationshealthcareambient aipilot study

Health System Pilots Ambient AI for Clinical Value

|June 26, 2026|By LDS Team

5.6

Relevance Score

Health System Pilots Ambient AI for Clinical Value — Photo: asset.jmir.pub · rights & takedowns

McLeod Health, a nonacademic multihospital system, published a peer-reviewed pilot study in JMIR Medical Informatics describing a structured, multiphase framework for selecting, scaling, and measuring ambient AI for clinical documentation. The system-wide rollout achieved 81% adoption with more than 150,000 notes generated, and coding patterns shifted toward higher-complexity visits (3.8% increase in level 4 established patient visits). The study offers a replicable evaluation methodology for the 84% of US health systems that are nonacademic and face thin margins, physician burnout, and limited vendor evaluation infrastructure.

Background

Most US health systems are nonacademic and operate under financial and staffing pressures that academic medical centers rarely face. Per the JMIR Medical Informatics paper, 84% of the country's 639 health systems operate in a single state, have a median of 399 beds, and run at thin margins. Ambient AI documentation tools promise to reduce physician burden and improve note quality, but vendor selection is often undermined by cognitive biases and unvalidated marketing claims.

What McLeod Health did

McLeod Health developed a structured, multiphase evaluation framework led by its chief medical informatics officer, with a coalition that included system and regional CMOs, ambulatory practice leadership, and the CIO. Phase 1 tasked 25 evaluators (physicians, revenue cycle experts, and nonclinical reviewers) with blind-scoring notes from three competing vendors across 45 sample encounters for accuracy, billing quality, and readability; all 25 submitted rankings within two weeks (100% response rate). Vendor 1 won with 57 points (45.2% of the vote) versus Vendor 2 at 44 (34.9%) and Vendor 3 at 25 (19.8%). Phase 2 ran a 90-day pilot of the selected vendor across five ambulatory specialties beginning October 2024, using key performance indicators including documentation time, coding patterns, and patient and provider satisfaction. Phase 3 was system-wide rollout.

Results

Per the published paper, the system-wide rollout achieved 81% adoption, with more than 150,000 notes generated. Coding patterns shifted toward higher-complexity visits, with a 3.8% increase in level 4 established patient visits. The paper describes substantial gains in documentation efficiency, clinician and patient satisfaction, and financial performance.

Practitioner implications

The structured three-phase evaluation methodology is the most transferable asset: blind vendor scoring reduces selection bias, revenue-cycle reviewers catch billing-quality issues that clinician reviewers may miss, and broad stakeholder buy-in before rollout drives the high adoption rate. For ML and health IT practitioners, the 81% adoption figure at a community system is notable. It also introduces a monitoring obligation: coding up-classification toward higher-complexity visits should be verified against actual clinical encounter data to rule out AI-assisted documentation inflation rather than genuine complexity increases.

Limitations

Single-system pilot at McLeod Health; outcomes may differ at systems with different EMR configurations, specialty mixes, or payer environments. The paper does not name the winning vendor, limiting direct comparability. External replication has not yet been published.

Key Points

1McLeod Health's three-phase framework - blind vendor scoring, limited pilot, then rollout - achieved 81% adoption and 150,000+ ambient AI notes in a nonacademic system.
2Coding shifts toward higher-complexity visits (3.8% more level 4 encounters) signal both efficiency gains and a monitoring obligation: verify AI documentation is not inflating coding.
3The methodology is replicable for the 84% of US health systems that are nonacademic and lack academic medical center resources for vendor vetting.

Scoring Rationale

Peer-reviewed multiphase pilot at a real nonacademic health system with concrete adoption metrics (81%, 150K+ notes). Replicable evaluation methodology valuable for health IT practitioners; single-system scope and vendor anonymity limit generalizability.

MoreHealthcare AI news

Sources

Primary source and supporting public references used for this report.

3 sources

Primary sourcemedinform.jmir.orgSelecting, Scaling, and Measuring the Value of Ambient AI in a Nonacademic Health System: Multiphase Pilot Study

View 2 more sources

Practice with real Health & Insurance data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

Active PPO Plans with Rx CoverageEasy

Approved High-Value ClaimsMedium

Denial Rate by Plan TypeHard

250 free problems · No credit card

See all Health & Insurance problems