Bridging machine learning and cryptography in defence against adversarial attacks

Bridging machine learning and cryptography in defence against adversarial attacks.png

O. Taran, S. Rezaeifar, and S. Voloshynovskiy

Workshop on Objectionable Content and Misinformation (WOCM), ECCV 2018

Most of the deep learning architectures are vulnerable to so called adversarial examples. This questions the security of deep neural networks (DNN) for many security- and trust-sensitive domains. The majority of the proposed existing adversarial attacks are based on the differentiability of the DNN cost function. Defence strategies are mostly based on machine learning and signal processing principles that either try to detect-reject or filter out the adversarial perturbations and completely neglect the classical cryptographic component in the defence. In this work, we propose a new defence mechanism based on the second Kerckhoffs’s cryptographic principle which states that the defence and classification algorithm are supposed to be known, but not the key. To be compliant with the assumption that the attacker does not have access to the secret key, we will primarily focus on a gray-box scenario and do not address a white-box one.

PDF