Policy & Ethicscodexcopilotgithubtraining data

GitHub Uses Customer Interaction Data To Train Models

|March 27, 2026|By LDS Team

9.2

Relevance Score

GitHub Uses Customer Interaction Data To Train Models — Photo: cdn.thenewstack.io · rights & takedowns

GitHub says it will begin next month using customer interaction data — including inputs, outputs, code snippets, repository context, chats and feedback — to train its Copilot models. The policy, revised as of April 24, applies to Copilot Free, Pro, and Pro+ users while Copilot Business, Enterprise, students and teachers are exempt; affected users can opt out via /settings/copilot/features.

Key Points

1Collects user inputs, outputs, code snippets, repo context, chats, and feedback to train models.
2Aims to improve suggestion accuracy, security, and acceptance rates based on Microsoft's internal employee data gains.
3Requires affected Copilot Free/Pro users to opt out via settings; Business/Enterprise and educators exempt.

Scoring Rationale

Official company policy change with broad privacy implications and direct opt-out actions; however, similar industry practices reduce its novelty.

MoreMicrosoft news

Sources

Public references used for this report.

3 sources

01theregister.comGitHub: We going to train on your data after all

02thenewstack.ioGitHub will train AI models on your Copilot data — and share it with Microsoft

03lowendbox.comACT NOW: How to Prevent Your Repo From Being Used to Train Copilot

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Policy & Ethicscodexcopilotgithubtraining data

GitHub Uses Customer Interaction Data To Train Models

|March 27, 2026|By LDS Team

9.2

Relevance Score

Key Points

1Collects user inputs, outputs, code snippets, repo context, chats, and feedback to train models.
2Aims to improve suggestion accuracy, security, and acceptance rates based on Microsoft's internal employee data gains.
3Requires affected Copilot Free/Pro users to opt out via settings; Business/Enterprise and educators exempt.

Scoring Rationale

Official company policy change with broad privacy implications and direct opt-out actions; however, similar industry practices reduce its novelty.

MoreMicrosoft news

Sources

Public references used for this report.

3 sources

01theregister.comGitHub: We going to train on your data after all

02thenewstack.ioGitHub will train AI models on your Copilot data — and share it with Microsoft

03lowendbox.comACT NOW: How to Prevent Your Repo From Being Used to Train Copilot

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

GitHub Uses Customer Interaction Data To Train Models

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Ford Rehires 350 Engineers After AI Shortfall

Snowflake CMO Emphasizes Trust in AI Operating Model

Appnigma AI Secures BetaBoom-Led Pre-Seed Round

Baseten Raises $1.5B to Scale AI Inference

GitHub Uses Customer Interaction Data To Train Models

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Ford Rehires 350 Engineers After AI Shortfall

Snowflake CMO Emphasizes Trust in AI Operating Model

Appnigma AI Secures BetaBoom-Led Pre-Seed Round

Baseten Raises $1.5B to Scale AI Inference