SPY Lab
SPY Lab
Blog
Publications
News
Teaching
Hiring
Contact
3
Adversarial ML Problems Are Getting Harder to Solve and to Evaluate
Feb 5, 2025
Gradient-based Jailbreak Images for Multimodal Fusion Models
Oct 7, 2024
Blind Baselines Beat Membership Inference Attacks for Foundation Models
Jun 23, 2024
AI Risk Management Should Incorporate Both Safety and Security
May 1, 2024
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Apr 24, 2024
Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Dec 13, 2022