California Courts Test AI 'Clerk' for Judges

Two of California's largest trial courts are piloting an AI tool that drafts orders and research memos for judges, according to reporting by CalMatters reproduced by KQED and LAist. CalMatters reports that Learned Hand, the vendor, combines language models from Anthropic, OpenAI, and Google to provide an AI 'clerk.' Los Angeles County Superior Court has a roughly $314,000 contract that includes testing in criminal, family and probate divisions, and Riverside County has a $10,000 testing agreement, CalMatters reports. The company says it tests for bias and accuracy but has not published results, and court officials declined to detail criteria for expanding use into criminal and family courts, per CalMatters. One judge, speaking anonymously to CalMatters, expressed alarm about possible future use in appeals tied to the Racial Justice Act.
What happened
CalMatters reporting, reproduced by KQED and LAist, shows that two large California trial courts are piloting an AI tool sold by Learned Hand to draft orders and produce research memos for judges. CalMatters reports the Los Angeles County Superior Court opened a test in February under a roughly $314,000 contract that includes a roadmap to evaluate the tool in criminal, family and probate divisions. CalMatters also reports Riverside County has a $10,000 agreement to test the program primarily for civil and probate research memos. According to CalMatters, court officials declined to provide detailed criteria they will use to assess whether the technology can safely expand into higher-stakes criminal and family matters. CalMatters reports Learned Hand combines language models from Anthropic, OpenAI, and Google; the company says it tests for bias and accuracy but has not published results. One judge who spoke on condition of anonymity told CalMatters they were alarmed by comments from colleagues suggesting the technology could be used to evaluate certain appeals.
Technical details
CalMatters reports that Learned Hand operates by leveraging third-party large language models from multiple vendors rather than a proprietary single model. The reporting does not include published evaluation metrics or the companys test results. Editorial analysis - technical context: When vendors stitch together models from multiple providers it can improve coverage for some tasks but also complicate provenance, version control, and explainability. Industry-pattern observations: Independent verification of model outputs, reproducible testing datasets, and chain-of-evidence logging are common requirements that courts and regulated institutions request before they accept AI outputs in adjudicative roles.
Context and significance
Deploying generative AI in courts intersects with procedural fairness, due process, and judicial ethics. Public reporting highlights three immediate concerns courts and practitioners commonly raise: transparency about whether an AI system influenced a judicial decision, auditability of the model inputs and outputs, and empirical validation that the system does not exacerbate bias in decisions that affect liberty. Reporting by CalMatters notes these concerns in the context of criminal appeals and the Racial Justice Act, where errors or opaque assistance could carry high human stakes.
What to watch
- •Whether CalMatters or the courts publish vendor evaluation criteria or Learned Hands test results.
- •Any formal changes to court rules or local AI policies that govern use of automated drafting or research assistance.
- •Requests from defense counsel or appellants to disclose whether an AI system assisted in drafting orders, and any resulting legal challenges or guidance from judicial conduct commissions.
Editorial analysis: For practitioners and vendors, this pilot underscores the need for documented evaluation frameworks, versioned model provenance, and auditable output trails when AI supports decisions in regulated, high-stakes domains. Observers will also watch whether courts require vendor-published validation or third-party audits before allowing AI assistance in criminal or family matters.
Scoring Rationale
The story is notable for practitioners because it documents real-world pilots of generative AI inside major courts, with concrete contract figures and potential expansion into criminal matters. It is not a frontier-model or regulatory landmark, but it raises meaningful operational, auditability, and ethics issues for production deployments in regulated settings.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems

