VS Code Adds BYOK for Copilot and Streaming

Visual Studio Code 1.117 introduces three practical updates for developers: Bring Your Own Key (BYOK) for Copilot Business and Enterprise users, experimental incremental chat rendering to stream responses block-by-block, and a terminal fix that lets the Copilot CLI launch from any terminal profile. BYOK lets teams connect OpenRouter, Ollama, OpenAI, Google and other providers via their own API keys, giving organizations control over model choice, compliance, cost, and local inference. Incremental rendering is configurable via chat.experimental.incrementalRendering.* settings and reduces perceived latency for long responses. The release targets enterprise workflow flexibility and improved UX for Copilot-powered chat and CLI flows.
What happened
Visual Studio Code 1.117, released April 22, 2026, adds three targeted improvements: Bring Your Own Key (BYOK) for Copilot Business and Copilot Enterprise, experimental incremental chat rendering, and the ability to launch the Copilot CLI from any terminal profile. The BYOK feature lets teams plug their own model API keys directly into VS Code chat, supporting providers such as OpenRouter, Ollama, OpenAI, and Google. Incremental rendering streams chat responses block-by-block to reduce perceived wait time. The terminal change fixes a long-standing startup failure when non-default shells like fish or Git Bash are used.
Technical details
BYOK is enabled by default and is controllable via the administrative policy named "Bring Your Own Language Model Key" in the organization settings on GitHub.com. After admin enablement, members can configure or install language model provider extensions and supply provider API keys inside VS Code, routing chat requests to those external endpoints rather than Microsoft-hosted model endpoints. This supports both cloud-hosted providers and on-prem or local inference stacks like Ollama.
Technical details
Incremental response rendering is exposed as experimental settings. Key config options are chat.experimental.incrementalRendering.enabled (default: true), chat.experimental.incrementalRendering.animationStyle (options: none, fade, rise, blur, scale, slide, reveal; default: fade), and chat.experimental.incrementalRendering.buffering (options: off, word, paragraph; default: off). The implementation renders content block-by-block as tokens arrive instead of relying on timer-based flushes, so longer outputs appear sooner and feel more conversational.
Technical details
The Copilot CLI can now be launched from any terminal profile in VS Code, addressing a bug where non-default shells prevented the terminal-based CLI from starting. This is a UX and developer workflow improvement rather than a change to the CLI itself.
Context and significance
The BYOK capability is a concrete step in the broader industry trend toward developer and enterprise control over model choice and data locality. By letting organizations use their own keys and model providers, Microsoft reduces backend load and gives teams options for compliance, cost optimization, and latency improvements. Integration with local inference tools like Ollama lets developers run open-source models on-prem or on-device, which matters for sensitive code and regulated industries.
Context and significance
Incremental rendering is a user-facing optimization that mirrors improvements seen across chat UIs and LLM clients; practitioners get a smoother developer experience without changing model architectures. The configurable buffering and animation settings let teams balance immediacy against the risk of displaying partial or malformed Markdown.
What to watch
Administrators should audit and decide whether to enable BYOK and which providers to allow via policy, since BYOK changes where inference occurs. Watch for extension ecosystem updates that add provider integrations, and monitor adoption of local inference stacks like Ollama inside enterprise developer environments. Security and compliance teams will need to update guidance on secret management and API key rotation when keys are stored or referenced from developer machines.
Bottom line
This is a pragmatic release: 1.117 prioritizes operational flexibility and incremental UX gains over new model capabilities. For teams that need model choice, on-prem inference, or better Copilot streaming behavior, the update materially improves daily developer workflows.
Scoring Rationale
This update is a meaningful productivity and governance improvement for enterprise developers rather than a frontier model or infrastructure milestone. BYOK materially affects operational choices and compliance, meriting a mid-high product score. Recent timing subtracts a small freshness penalty.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problemsStep-by-step roadmaps from zero to job-ready — curated courses, salary data, and the exact learning order that gets you hired.


