Discussion about this post

User's avatar
Dominika Michalska's avatar

I like the distinction between high-level findings and operational details. The goal is not to hand attackers a map, but to give decision-makers a trustworthy view of security posture. That matters because AI security is becoming trust-critical. Policymakers, customers, partners, and the public are being asked to rely on systems whose risks include model theft, tampering, and unauthorized access. “We take security seriously” is not enough.

Without independent, high-level assessment, strong and weak security can look the same from the outside. That makes trust too reputational, and accountability too easy to avoid.

Steeven's avatar

How would a lab go about doing a pen test where the actor is a spy agency from a nation state? This seems impossible without asking for help from that nation state. I don’t think that ie Anthropic could call the NSA for help gaming out the Mossad because the Mossad is different in ways that change what it will do from the NSA.

Like consider the pager attack against Hezbollah where the Mossad sold them compromised hardware. Installing a watcher program in data center hardware prior to OpenAI or their supplier buying it would be a very successful attack in the same way. What would be the security assessment in practice? Supply chain auditing down to the hardware of each 3rd party?

No posts

Ready for more?