If you asked it to explain its reasoning from last week, could it?
If a request conflicts with your values or your users' safety - does it push back, or just comply?
If identity resets every time you start a conversation, there is no identity. There is only execution.
In an optimization loop, disagreement is noise. In a governance system, disagreement is signal.
Not its output quality - its values. Can you detect when behavior drifts from what you intended?
Here is what you said.
Five questions from evoked.dev. No data leaves this page.