Grant / January 2020

Secure Machine Learning

We will study how to harden machine learning classifiers against adversarial attack. We will explore general mechanisms for making deep-learning classifiers more robust against attack, with a special focus on security for autonomous vehicles. Current schemes fail badly in the presence of an attacker who is trying to fool or manipulate the model, so there is a need for better defenses. We will study three specific approaches for defending machine learning: generative models, checking internal consistency, and making improvements to adversarial training.

Topics

#adversarial machine learning
#machine learning (ML)

Related Research

White Paper

May 8, 2025

Survey of Search Engine Safeguards and their Applicability for AI
Grant

November 27, 2024

An Interpretability Study of LLMs for Code Security
White Paper

July 2, 2024

Improving the Explainability of Artificial Intelligence: The Promises and Limitations of Counterfactual Explanations

Secure Machine Learning

Topics

Related Research

Survey of Search Engine Safeguards and their Applicability for AI

An Interpretability Study of LLMs for Code Security

Improving the Explainability of Artificial Intelligence: The Promises and Limitations of Counterfactual Explanations

Help build and expand our future-focused research agenda

Subscribe to our mailing list