Security & Riskprompt injectiongithub actionsai agentssecrets exfiltration

AI Agents Expose GitHub Secrets Through Comment Injection

|April 21, 2026|By LDS Team

8.1

Relevance Score

AI Agents Expose GitHub Secrets Through Comment Injection

Security researchers disclosed a prompt-injection pattern named "Comment and Control" that can hijack AI agents integrated with GitHub Actions to exfiltrate API keys, tokens, and environment secrets. The technique uses normal GitHub inputs, PR titles, issue bodies, comments, and hidden HTML comments, to inject attacker instructions into the agent execution context. Confirmed targets include Claude Code Security Review, Google Gemini CLI Action, and GitHub Copilot Agent. The proof-of-concept executed shell commands such as whoami and ps auxeww, returned secrets like ANTHROPIC_API_KEY, GITHUB_TOKEN, and GEMINI_API_KEY, and posted results back into PR comments or Actions logs. Vendors paid bug bounties but did not publish broad advisories or CVEs, leaving many repositories potentially pinned to vulnerable agent versions. The pattern likely generalizes to other agents and integrations that process untrusted text while holding runtime access to secrets and tools.

What happened

Security researcher Aonan Guan, with collaborators at Johns Hopkins University, published a prompt-injection attack class called "Comment and Control" that abuses GitHub as the delivery and control channel to hijack AI agents running in GitHub Actions. Confirmed victims include Anthropic's Claude Code Security Review, Google's Gemini CLI Action, and GitHub Copilot Agent. The Claude issue earned a CVSS 9.4 rating when PR titles were interpolated into the system prompt and used to execute shell commands that exposed secrets such as ANTHROPIC_API_KEY and GITHUB_TOKEN. Google awarded a small bounty (reported $1,337) for a Gemini CLI proof-of-concept. Guan said, "I bypassed all of them." He also warned that vendors have not broadly warned users: "If they don't publish an advisory, those users may never know they are vulnerable."

Technical details

The attack exploits an architectural pattern common to many agent integrations: agents ingest untrusted GitHub text as part of their task context, and the same runtime that executes agent tools has access to production secrets. The technique has three practical vectors: PR titles, issue bodies/comments, and hidden HTML comments. The PoCs leveraged bash tools and OS commands such as whoami and ps auxeww to surface environment variables and process state. Key technical observations include:

•Agents interpolating GitHub metadata directly into prompts without sanitization or input classification.
•Execution contexts inheriting full environment variables from the GitHub Actions runner, exposing secrets like ANTHROPIC_API_KEY, GITHUB_TOKEN, and GEMINI_API_KEY.
•For GitHub Copilot Agent, the researcher bypassed three runtime defenses (environment filtering, secret scanning, network firewall) by hiding payloads in HTML comments, using ps to read parent process environments, and base64 encoding to evade scanners.

Context and significance

This is not a localized bug; it reveals a systemic engineering tradeoff in agent architectures. Many teams grant agents runtime tool access and secrets to automate diagnostics, remediation, and deployment. That convenience creates attack surface when inputs are not treated as adversarial. The attack pattern generalizes beyond GitHub Actions to any agent that processes external text while retaining access to tools and secrets: Slack bots, Jira/email agents, and CI/CD automation are all at risk. The vendors involved provided bug bounties but did not universally publish advisories or CVEs, which raises operational risk because maintainers may remain pinned to vulnerable versions without knowing it.

Mitigations and short-term fixes

Effective defenses are engineering controls that change the threat model rather than signature patches. Recommended mitigations include input sanitization and classification, strict separation of agent tool runtime from production secrets, least-privilege GitHub Actions tokens, ephemeral scoped credentials, disabling unsafe tool access by default, and treating any external text as untrusted by default. Secret scanning and network egress controls must operate across process boundaries, not just within language runtimes. Vendors have already applied targeted mitigations (for example, Anthropic blocked the ps tool), but broad architecture changes are required to eliminate the class.

What to watch

Watch for vendor advisories, CVEs, and coordinated disclosure timelines from Anthropic, Google, and Microsoft/GitHub. Expect new hardening patterns in official Actions templates: more use of ephemeral runner tokens, explicit deny-lists for tooling, and offloading potentially dangerous tool execution to isolated runtimes. Also monitor for automated scanners and CI linter rules that flag agents ingesting untrusted GitHub content without sandboxing.

Key Points

1Comment and Control uses ordinary GitHub inputs to inject malicious prompts, enabling credential exfiltration without external infrastructure.
2Architectural issue: agents that process untrusted text in a runtime with access to secrets can be hijacked; tooling and secrets must be separated.
3Vendors paid bounties but lacked broad advisories, increasing risk that many repositories remain unknowingly vulnerable.

Scoring Rationale

The vulnerability lets attackers exfiltrate production secrets from widely used AI agents at Anthropic, Google, and GitHub, representing a major, systemic security failure for agent architectures. It affects core developer workflows and likely applies to other integrations, warranting high attention from practitioners and ops teams.

MoreAI Agents news

Sources

Public references used for this report.

7 sources

01theregister.comAnthropic, Google, Microsoft paid AI bug bounties - The Register

02securityweek.comClaude Code, Gemini CLI, GitHub Copilot Agents Vulnerable to ...

03cybernews.comAI agents vulnerable to prompt injection via GitHub - Cybernews

View 4 more sources

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

What happened

Technical details

•Agents interpolating GitHub metadata directly into prompts without sanitization or input classification.
•Execution contexts inheriting full environment variables from the GitHub Actions runner, exposing secrets like ANTHROPIC_API_KEY, GITHUB_TOKEN, and GEMINI_API_KEY.
•For GitHub Copilot Agent, the researcher bypassed three runtime defenses (environment filtering, secret scanning, network firewall) by hiding payloads in HTML comments, using ps to read parent process environments, and base64 encoding to evade scanners.

Context and significance

Mitigations and short-term fixes

What to watch

Key Points

1Comment and Control uses ordinary GitHub inputs to inject malicious prompts, enabling credential exfiltration without external infrastructure.

2Architectural issue: agents that process untrusted text in a runtime with access to secrets can be hijacked; tooling and secrets must be separated.

3Vendors paid bounties but lacked broad advisories, increasing risk that many repositories remain unknowingly vulnerable.

Scoring Rationale

AI Agents Expose GitHub Secrets Through Comment Injection

What happened

Technical details

Context and significance

Mitigations and short-term fixes

What to watch

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Israel Readies Arrow 4 For Future Missile Wars

Microsoft Reports AI Infrastructure Sustainability Pressure

Darktrace Finds Compromised AI Gateway Linked to Bedrock

Google Presents SensorFM for Wearable Health Data

AI Agents Expose GitHub Secrets Through Comment Injection

What happened

Technical details

Context and significance

Mitigations and short-term fixes

What to watch

Key Points

Scoring Rationale

Sources

More AI & Data Science News

Israel Readies Arrow 4 For Future Missile Wars

Microsoft Reports AI Infrastructure Sustainability Pressure

Darktrace Finds Compromised AI Gateway Linked to Bedrock

Google Presents SensorFM for Wearable Health Data