Skip to main content

Run your first stored-state verification

Compare your agent's tool claims against actual stored state.

Get a deterministic verdict before you ship, bill, or hand off to customers.

Preview vs contract verification

agentskeptic quick is a fast SQL-only preview: it infers checks from raw tool activity against a database you pass. It does not run full registry contract verification across HTTP, object, vector, or Mongo witnesses.

agentskeptic check is the decision-grade path: registry + events + readable targets, Outcome Certificate on stdout, and the truth_check_verdict: / release_critical_truth_check_verdict: lines on stderr. Use agentskeptic check for CI and release gates.

Optional coverage budgets

In CI you get

When you run the first-party GitHub Actions composite at .github/actions/agentskeptic-check (wrapping agentskeptic check), each run exposes three CI surfaces:

Full CI contract: ambient-ci-distribution.md

First proof: contract truth check

Run agentskeptic check against your registry, events file, and readable verification targets (database for SQL; remote URL for HTTP / object / vector / Mongo witnesses per registry).

A successful run gives you:

Full guide: docs/integrate.md

Hybrid contract demo (SQL + HTTP witness): examples/hybrid-contract-demo.mjs (requires npm run build and POSTGRES_VERIFICATION_URL).

npx agentskeptic check --workflow-id YOUR_WORKFLOW_ID \
  --project ./your-app \
  --db ./path/to/readable.sqlite

Try paste verification

Before you run it

You need:

Learn guides include Next.js and Postgres paths alongside SQLite snapshots.

Reading the verdict

On verdict exits, stderr starts with two automation lines — truth_check_verdict: trusted|not_trusted|unknown and release_critical_truth_check_verdict: trusted|not_trusted|unknown (see docs/first-truth-check.md). When an opt-in coverage budget policy is active, two additional machine lines follow the truth lines — see docs/integrate.md (optional coverage budgets). The human-readable report below those lines may include engine phrases for each step; the first truth line is the primary gate for whether you can rely on the global workflow outcome.

truth_check_verdictNext action
trustedProceed with reliance only when your policy allows (the certificate must still show expected state matched). Do not claim verified if anything else disagrees with this line.
not_trustedBlock handoff, ship, bill, or continuation on this run—fix the workflow, data, or registry expectation and re-run check.
unknownDo not claim verified—add observations, witnesses, or narrow scope until the verdict can be determined.

If stored state is wrong, you get not_trusted with an actionable reason such as ROW_ABSENT in the report.

No more silent green traces hiding bad data.

Next steps

  1. Try the interactive demo to see a missing-write failure.
  2. Run the command above against your own readable data.
  3. Follow the Learn guides to wire the same check into CI.