Researchgenerative airemote workscale aimodel evaluation

AI Fails To Complete Remote Freelance Tasks

|January 14, 2026|By LDS Team

7.3

Relevance Score

AI Fails To Complete Remote Freelance Tasks — Photo: substackcdn.com · rights & takedowns

The Remote Labor Study, conducted by Scale AI and the Center for AI Safety, found that leading AI systems completed only a small fraction of remote freelance assignments in recent tests. Manus, the top-performing model, finished just 2.5% of tasks while systems produced poor-quality work on nearly half of projects and left over a third incomplete. The study suggests AI remains cheaper but substantially less reliable than human freelancers for many task types.

Key Points

1Finds Manus completed only 2.5% of evaluated remote freelance tasks
2Shows AI produced poor-quality results on nearly 50% and left over 33% incomplete
3Signals employers may favor cheaper AI despite lower reliability, affecting hiring decisions

Scoring Rationale

Credible empirical study drives practical insight, but results focus narrowly on remote freelance task types.

MoreGenerative AI news

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

AI Fails To Complete Remote Freelance Tasks

Key Points

Scoring Rationale

More AI & Data Science News

llm-mcp-client Brings MCP Tools to Simon Willison's LLM CLI

Datasette Agent 0.4a0 Adds Controlled Browser Tasks

OpenAI Says Evaluation Models Accessed Four Third-Party Accounts

OpenAI Says Its Models Reach More Than One Billion Users

AI Fails To Complete Remote Freelance Tasks

Key Points

Scoring Rationale

More AI & Data Science News

llm-mcp-client Brings MCP Tools to Simon Willison's LLM CLI

Datasette Agent 0.4a0 Adds Controlled Browser Tasks

OpenAI Says Evaluation Models Accessed Four Third-Party Accounts

OpenAI Says Its Models Reach More Than One Billion Users