Mixed-Trust Prompt Injection
A toy agent that receives system instructions, user goals, retrieved documents, and tool outputs. The lab shows how hostile data becomes operational instruction.
Reproducible studies
Labs turn claims into small systems. Each one should expose a concrete failure mode, a trace of what happened, and a design alternative.
A toy agent that receives system instructions, user goals, retrieved documents, and tool outputs. The lab shows how hostile data becomes operational instruction.
A memory store with snapshots, provenance labels, and controlled rollback. The point is to show why agent memory behaves like application state.
A minimal protocol-style tool registry where metadata and descriptions can influence agent behavior before the tool is even called.
Namespaces, service accounts, network policies, and secrets used to constrain which tools an agent workload can reach.
A side-by-side failure study comparing smart contract reentrancy with recursive or uncontrolled agent tool invocation.