Nexthink deploys OpenSearch vector search for Spark agent

According to a guest post on the AWS Blog by Nexthink, the company combined Amazon OpenSearch Service vector search, Amazon Bedrock embeddings and infrastructure-as-code to power its autonomous IT-support agent Spark. The blog reports Spark achieves a 77% resolution rate at first contact and is deployed across 12 AWS Regions. The post frames vector search as necessary for semantic retrieval at sub-second latency across millions of documents and warns that incorrect retrieval can produce harmful automated actions if an agent executes the wrong remediation. The writeup attributes embeddings generation to V2 via Amazon Bedrock and describes the retrieval layer as the foundation for safe, accurate autonomous resolution.
What happened
Per a guest post on the AWS Blog authored by Nexthink, the company implemented Amazon OpenSearch Service vector search together with Amazon Bedrock embeddings and infrastructure-as-code to support its autonomous IT-support agent Spark. The blog reports Spark resolves issues at a 77% first-contact resolution rate and is deployed across 12 AWS Regions. The post says embeddings were generated using V2 through Amazon Bedrock and that retrieval must operate at sub-second latency across millions of documents to serve enterprise users.
Technical details
Per the AWS blog post, Nexthink built a retrieval layer on OpenSearch Service with vector search to enable semantic matching of user queries to documentation, historical resolutions, and remediation scripts. The post emphasises semantic retrieval over keyword matching and cites the operational risk of incorrect retrieval leading to dangerous automated commands as a guardrail rationale for high-precision vectors and verification before action.
Editorial analysis
Industry-pattern observations: production AI agents increasingly rely on vector search plus high-quality embeddings to bridge natural-language queries and operational knowledge bases. Practitioners deploying autonomous remediation should expect engineering effort around embedding quality, indexing strategies, latency optimization, and safe execution gating.
What to watch
For practitioners: monitor retrieval latency at scale, embedding drift over time, and mechanisms the team uses for verification before action. Public metrics such as first-contact resolution and regional deployments provide useful operational benchmarks when evaluating similar agent architectures.
Scoring Rationale
This is a notable production case study showing semantic vector search and managed cloud embeddings powering a live autonomous IT-support agent at scale, with concrete operational metrics (77% first-contact resolution, 12 AWS Regions). Practically useful for infrastructure and retrieval architects but not a frontier research contribution.
Practice with real Retail & eCommerce data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Retail & eCommerce problems


