Industry Applicationscomputer visionannotation workflowsdata scaling
ML@CMU Builds Video Caption Pipeline at Scale
|
5.6

ML@CMU built a year-long video caption pipeline with 100+ professional creators, documenting the development process and outcomes. The project taught the team that scaling supervision, rather than scaling models, was the key lesson for improving caption production and workflows.
Scoring Rationale
Practical, hands-on case study with significant relevance for teams building captioning pipelines and data workflows; moderate impact for the broader AI research community.
Practice with real Hotels & Lodging data
90 SQL & Python problems · 15 industry datasets
Used by DS/ML engineers at top companies
250 free problems · No credit card
See all Hotels & Lodging problems


