Mod-Guide Applies LLM RAG Feedback to Moderation
A new arXiv paper (arXiv:2606.13397, Dipto Das et al., submitted June 11, 2026) introduces Mod-Guide, an LLM-based content-moderation feedback system that folds community-created narratives into a retrieval-augmented generation (RAG) pipeline, built and evaluated with members of Bangladesh's Hindu and Chakma communities - the country's largest religious and Indigenous ethnic minorities, per the paper. The authors report that RAG-enhanced moderation responses were rated more contextually accurate and were perceived differently across ethnic lines in mixed-method evaluations with minority and majority participants. The study frames its contribution around surfacing culturally specific interpretations of insensitive speech that a base LLM might otherwise miss.
What happened
For content-moderation teams serving underrepresented communities, a new arXiv paper offers a concrete method rather than a general call for more inclusive data: Mod-Guide (arXiv:2606.13397, Dipto Das et al., submitted June 11, 2026) integrates a community co-created corpus into a retrieval-augmented generation (RAG) pipeline, built and evaluated with members of Bangladesh's Hindu and Chakma communities - the country's largest religious and Indigenous ethnic minorities. The authors report that RAG-enhanced moderation responses were rated more contextually accurate, and were perceived differently across ethnic lines, in mixed-method evaluations with minority and majority participants.
Technical context
Per the paper, the authors co-created a culturally grounded corpus of insensitive-speech examples directly with community members, then used it as retrieval context so the base LLM can surface culturally specific interpretations of implicit or normative harms it might otherwise miss. This is a concrete instance of a broader pattern: RAG is increasingly used to inject external, curated context into LLM outputs to improve factuality and domain sensitivity, but that benefit is bounded by retrieval quality, corpus representativeness, and prompt design.
Industry context
Human-computer-interaction and AI-ethics research increasingly calls for participatory data practices when models affect marginalized communities. Co-creating datasets with impacted groups can raise model relevance and community trust, but it also raises open questions about scope, governance, and long-term maintenance of culturally specific corpora - questions this paper does not resolve.
What to watch
Whether the authors release the co-created corpus or evaluation data, how the RAG pipeline's reproducibility holds up under peer review, and whether the approach generalizes beyond the Bangladesh Hindu and Chakma context studied here.
Key Points
- 1Co-created, culturally grounded corpora can supply RAG layers with context that improves detection of implicit, culturally specific harms.
- 2Mixed-method evaluation across majority and minority participants reveals perception gaps that matter for moderation calibration and community trust.
- 3Participatory dataset construction raises governance and maintenance questions practitioners must treat as ongoing operational concerns.
Scoring Rationale
Verified single-source arXiv paper with a genuine participatory methodology across a real minority community. Relevant to AI-ethics and content-moderation practitioners, but an early academic study, not a production system. Single-source (paper is the origin document; only an author-hosted PDF mirror found, not independent coverage).
Sources
Public references used for this report.
Practice with real FinTech & Trading data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all FinTech & Trading problems