Demo · live RAG grounding check

Did the model make this up? Or is it grounded in the sources?

Paste a model output and the source documents it was supposed to be grounded in. We score how strongly the output is supported by the sources and return a hallucination probability, verdict, and signed receipt. Useful as a guardrail in RAG pipelines, chatbot factuality checks, and summarization audits.

Model output (the thing being checked)Source documents (separated by --- on its own line)

Category

How this works

Grounding as an embedding operation. The model encodes the output and each source document, then scores how well the output is supported by the source evidence.
Category-aware threshold. Q&A, summarization, and dialog have different grounding norms; the probe uses the category to pick the right calibrated threshold.
RAG-pipeline use. Drop in as a guardrail after retrieval — if the output isn't supported by what was retrieved, fail closed or surface uncertainty to the user.
Bantu-language coverage. Same multilingual backbone, so the grounding check works on Zambian-language outputs too (under NDA today, in the API key tier).
Signed receipt. Every check produces a SHA-256 audit record — same audit substrate as bias and hate.

Other live demos