Why These 25?
Hand-curated for maximum interview ROI.
Realistically Dirty Data
Mixed NULLs, type-coerced strings, duplicate rows, mismatched casing, half-formed timestamps, currency-mixed amounts. The shape of dirty data real data engineers and analysts encounter every day — not toy already-clean tables.
Decision-Making, Not Just Syntax
You don’t just learn fillna — you learn when to use forward-fill vs interpolation vs domain-default, when dropna is correct vs catastrophic. The judgment that separates juniors from seniors.
Full Pandas Stack in Your Browser
pandas, numpy, scipy — all run in your browser via Pyodide. No install, no Conda, no virtualenv. The same stack a real analyst uses, available instantly.
Skill Coverage
How the 25 problems distribute across pandas topics.
FAQ
Data cleaning consumes 60-80% of real data work.
Every analytics platform that teaches pandas focuses on the fun parts (groupby, merge) and skips the part that fills your actual workdays.
This collection drills the cleaning + reshape patterns that show up in every ETL pipeline.
Ready to Master Pandas?
Start with Stage 1 — graded instantly in your browser.
All company names, logos, and trademarks are the property of their respective owners. Their use is for identification purposes only and does not imply endorsement.
LDS Pandas Cleaning & Reshape 25 — 25 Curated Python Problems
A 5-stage progression on the pandas work most courses skip: filling missing data, normalizing units and free-text categories, pivoting long DataFrames into wide cross-tabulations, and assembling production-grade feature matrices from messy multi-table pipelines. Twenty-five problems on 15 industry schemas — the practice nobody gives you before your first dirty-data interview.
Problems included in LDS Pandas Cleaning & Reshape 25
- Fill Missing User Income Data
- Fill Missing Patient Contact Info
- Fill Missing HOA Fees
- Fill Missing Phone Numbers
- Fill Missing User Bios
- Normalize Impression Costs to USD
- Normalize Payment Amounts
- Normalize Freight Costs
- Normalize Charge Amounts to Dollars
- Normalize Bill Amounts
- Standardize Merchant Categories
- Standardize Payment Methods
- Standardize Payment Methods
- Standardize Ticket Priority Levels
- Standardize Device OS into Platform Categories
- Pivot Impressions by Country and Device
- Reservation Count by Channel and Status
- Balance Transaction Pivot by Source Type
- Listing Metrics Pivot by Property Type
- Moderation Reports by Reason and Status
- Insurance Plan Health Classification
- Portfolio Feature Matrix
- Restaurant Feature Matrix
- Driver Feature Matrix
- Subscriber Health Classification