Researchllmcardiologyopen sourcebenchmark

DeepSeek R1 Outperforms LLMs On ASCVD Responses

|January 7, 2026|By LDS Team

8.0

Relevance Score

DeepSeek R1 Outperforms LLMs On ASCVD Responses — Photo: asset.jmir.pub · rights & takedowns

Researchers conducted a cross-sectional evaluation May 15–30, 2025, comparing DeepSeek R1, ChatGPT-4o, and Gemini on 25 ASCVD patient questions in English and Chinese, generating 750 responses scored by three cardiologists. DeepSeek R1 achieved a 96% good-response rate (24/25) with higher accuracy and completeness, but all models failed to reliably provide guideline-concordant treatment regimens, indicating need for expert oversight.

Key Points

1Demonstrates DeepSeek R1 achieved 96% good-response rate in both languages (24/25).
2Highlights superior accuracy and completeness versus ChatGPT-4o and Gemini (P<.001).
3Warns models failed to reliably provide guideline-concordant ASCVD treatment regimens, requiring expert oversight.

Scoring Rationale

Strong comparative evaluation and robust methods, but limited by focus on ASCVD and guideline-concordance weakness.

MoreOpen-Source AI news

Sources

Public references used for this report.

1 source

01medinform.jmir.orgLarge Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: Pilot Cross-Sectional Comparative Analysis

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

Researchllmcardiologyopen sourcebenchmark

DeepSeek R1 Outperforms LLMs On ASCVD Responses

|January 7, 2026|By LDS Team

8.0

Relevance Score

Key Points

1Demonstrates DeepSeek R1 achieved 96% good-response rate in both languages (24/25).
2Highlights superior accuracy and completeness versus ChatGPT-4o and Gemini (P<.001).
3Warns models failed to reliably provide guideline-concordant ASCVD treatment regimens, requiring expert oversight.

Scoring Rationale

Strong comparative evaluation and robust methods, but limited by focus on ASCVD and guideline-concordance weakness.

MoreOpen-Source AI news

Sources

Public references used for this report.

1 source

01medinform.jmir.orgLarge Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: Pilot Cross-Sectional Comparative Analysis

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

DeepSeek R1 Outperforms LLMs On ASCVD Responses

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Researchers Benchmark Persistent-State Attacks on Coding Agents

Vera-Bench Tests Safety of Tool-Using LLM Agents

Two-tier memory enables queryable long-term storage for agents

Microsoft Adds Claude Sonnet 5 To Copilot

DeepSeek R1 Outperforms LLMs On ASCVD Responses

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Researchers Benchmark Persistent-State Attacks on Coding Agents

Vera-Bench Tests Safety of Tool-Using LLM Agents

Two-tier memory enables queryable long-term storage for agents

Microsoft Adds Claude Sonnet 5 To Copilot