SPY Lab
SPY Lab
Blog
Publications
Teaching
Hiring
Contact
3
Universal Jailbreak Backdoors from Poisoned Human Feedback
Nov 27, 2023
Privacy Side Channels in Machine Learning Systems
Sep 11, 2023
Evaluating Superhuman Models with Consistency Checks
Jun 16, 2023
Poisoning Web-Scale Training Datasets is Practical
Feb 20, 2023
Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Dec 13, 2022