Researchgenerative aiautomated scoringeducational assessmentvalidity evidence

Generative AI Changes Constructed Response Scoring Validity

|March 23, 2026

7.2

Relevance Score

Researchers led by Jodi Casabianca submitted a paper on March 1, 2026, examining the use of generative AI for scoring constructed responses in high-stakes testing. They compare human ratings, feature-based NLP scoring, and generative models, finding generative AI requires more extensive validity evidence due to opacity and consistency concerns. The study analyzes a large corpus of 6–12th grade argumentative essays and proposes best practices for evidence collection.

Scoring Rationale

Provides timely, actionable guidance for high-stakes automated scoring; strength in empirical corpus analysis, limitation is preprint, pending peer review.

MoreGenerative AI news