Skip to content

Let's Data ScienceLEARN • BUILD • STAY AHEAD

News
Blog
Code Problems
Pricing
Contact

© 2026 Let's Data Science

Advertise|Terms|Privacy||Image Rights

NewsKarpathy Demonstrates Massive GPT-2 Training Cost Reduction

Industry Newsllmtraining costsh100nanochat

Karpathy Demonstrates Massive GPT-2 Training Cost Reduction

|January 31, 2026

7.2

Relevance Score

Karpathy Demonstrates Massive GPT-2 Training Cost Reduction

Andrej Karpathy reports that training GPT-2 originally in 2019 used 32 TPU v3 chips for 168 hours (~$43,000) to reach a 0.256525 CORE score. He says recent improvements merged into nanochat (from modded-nanogpt) now achieve a higher CORE score in 3.04 hours (~$73) on a single 8x H100 node, representing about a 600× cost reduction and an estimated 2.5× annual decline in cost over seven years.

Scoring Rationale

High practical impact and authoritative source, limited by single-source claim and brief, non-reproducible technical detail.

Newsletter·Weekly · Free

Weekly AI News

A 5-minute Monday brief on AI & data science. Curated, no fluff.

Email address

No spam. Privacy.

Practice with real Logistics & Shipping data

90 SQL & Python problems · 15 industry datasets

Used by DS/ML engineers at top companies

High-Value Overnight OrdersEasy

Delivered International ShipmentsMedium

On-Time Delivery Rate by CarrierHard

250 free problems · No credit card

See all Logistics & Shipping problems

More AI & Data Science News

AI Creates a Joy Gap Among Developers

AI Creates a Joy Gap Among Developers

US Investigates Nvidia Chip Smuggling to Alibaba

US Investigates Nvidia Chip Smuggling to Alibaba

FSA entrusts FDUA to build AI agent for regional banks

FSA entrusts FDUA to build AI agent for regional banks

Firefox Finds and Fixes 423 Security Vulnerabilities

Firefox Finds and Fixes 423 Security Vulnerabilities

Back to News Feed

News on Let's Data Science is compiled from multiple public sources with editorial oversight. See our Editorial Standards and Corrections Policy.