HubSpot Deploys Sidekick To Accelerate Code Reviews

HubSpot’s Developer Experience AI team rolled out Sidekick, an LLM-driven code reviewer, across every pull request over the past six months, cutting engineer feedback latency by about 90% and peaking at a 99.76% reduction in September. They migrated from a Kubernetes/Claude-based Crucible to Aviator, an internal Java agent framework, and added a Judge Agent evaluator to filter noise and improve review quality.
Key Points
- 1Deploys Sidekick LLM reviewer across all pull requests, reducing feedback latency ~90%, 99.76% peak
- 2Migrates from Kubernetes/Claude Crucible to Aviator Java agent framework for speed, flexibility, multi-model support
- 3Adds Judge Agent to evaluate succinctness, accuracy, actionability, dramatically reducing noise and improving trust
Scoring Rationale
Demonstrates measurable operational improvements and reproducible architecture, but innovation is primarily incremental and confined to an internal implementation.
Sources
Public references used for this report.
Practice with real SaaS & B2B data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all SaaS & B2B problems