Gamer Hosts Open-Source LLM With Vulkan Backend

A developer experimented with hosting small LLMs on a Windows gaming PC (AMD RX 9070 XT, Ryzen 7 9700X) using Jan (llama.cpp) with a Vulkan-compiled backend to leverage GPU acceleration. They ran Jan in server mode behind a NetBird VPN, exposed an OpenAI-compatible API, and reported roughly 50 tokens on Mistral 3 14B. The post documents a practical home-hosted LLM and remote access workflow.
Key Points
- 1Runs Jan (llama.cpp) with a Vulkan-compiled backend on an AMD RX 9070 XT GPU.
- 2Avoids proprietary tools; enables GPU acceleration where Ollama lacks AMD support on Windows.
- 3Exposes an OpenAI-compatible API via Jan server and NetBird VPN for encrypted remote access.
Scoring Rationale
Practical, reproducible home-hosting guide with clear steps; limited by single-user anecdote and modest benchmark reporting.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems