White Paper July 2, 2024 Improving the Explainability of Artificial Intelligence: The Promises and Limitations of Counterfactual Explanations
White Paper May 16, 2024 Benchmark Early and Red Team Often: A Framework for Assessing and Managing Dual-Use Hazards of AI Foundation Models