SPY Lab
SPY Lab
Blog
Publications
News
Teaching
Hiring
Contact
3
RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics
May 18, 2025
Defeating Prompt Injections by Design
Mar 27, 2025
Gradient-based Jailbreak Images for Multimodal Fusion Models
Oct 7, 2024
AI Risk Management Should Incorporate Both Safety and Security
May 1, 2024
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Apr 24, 2024
Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Dec 13, 2022