June 8, 2026
Building Trustworthy Cyber Agent Evaluations
A short opening note on why realistic cyber-agent evaluation needs end-to-end ranges, auditable harnesses, and attention to evaluation awareness.