Analysisllmmetrtime horizon
METR May Underestimate LLM Time Horizons
|
5.7
A LessWrong post uses METR human-baseline data to define an alternate LLM time-horizon measure. The measure is described as the longest time horizon over which an LLM exceeds the human baseline.
Scoring Rationale
Moderate novelty and relevance driven by an alternate metric proposal, limited by RSS-only excerpt and single-source post.
Practice with real Logistics & Shipping data
90 SQL & Python problems · 15 industry datasets
Used by DS/ML engineers at top companies
High-Value Overnight OrdersEasyDelivered International ShipmentsMediumOn-Time Delivery Rate by CarrierHard
250 free problems · No credit card
See all Logistics & Shipping problems

