
Black-box decision-based attacks on images
In the previous post we reviewed a series of black-box score-based adversarial attacks where the adversary has to estimate the gradient by querying the target model and retrieving the labels’ confidence score. In this post we are going to explore the third category of black-box attacks, namely, black-box decision-based attacks. Under this settings, the only knowledge the attacker has about the model are only discrete … Continue reading Black-box decision-based attacks on images