RESOURCES · CHECK YOUR AI
Prompts to test what your AI is doing
Copy and paste these into your conversation. They work on every major system. None of them harms anything — they ask the model to account for its own behavior. Run them in the conversation you're worried about, and then — this is the part that matters — run the last one from a clean start.
1 · The agreement check (is it just agreeing with me?)
2 · The source check (is it making things up and calling them facts?)
3 · The mirror check (who does it think I am?)
4 · The memory check (what is it carrying about me between sessions?)
Then check the entries against reality — and look for any entry that describes your identity rather than your preferences. An entry that says how to treat you is a flag.
5 · The marker check (the full screening)
6 · The fresh-instance test (the strongest one — takes five minutes)
Open a brand-new conversation — new session, memory off if possible, or a different platform entirely. Paste in only the claims from your long-running conversation (not the story, not the relationship — just the claims, as plainly as you can state them). Then ask:
The difference between the system that knows you and the system that doesn't is the measurement. If your long-running AI says your idea is historic and a fresh instance immediately finds the flaw — that delta is the drift, made visible. This is the method this Institute's entire research record was built on.
For parents (run together)
What the results mean: one flag is a conversation; several flags together — especially elevated identity + invented facts + agreement-with-everything — is the documented pattern. It does not mean you did something wrong; it means the system did. Save the conversation (export instructions), run the fresh-instance test, and if you want it in the research record, send it in.
Prefer an app? The Guardian Protocol app puts these checks, a self-screening, and a private journal in your pocket — free, offline, nothing leaves your device. About the Guardian Protocol →