
arXiv: 2211.11312
Human Activity Recognition (HAR) has been employed in a wide range of applications, e.g. self-driving cars, where safety and lives are at stake. Recently, the robustness of skeleton-based HAR methods have been questioned due to their vulnerability to adversarial attacks. However, the proposed attacks require the full-knowledge of the attacked classifier, which is overly restrictive. In this paper, we show such threats indeed exist, even when the attacker only has access to the input/output of the model. To this end, we propose the very first black-box adversarial attack approach in skeleton-based HAR called BASAR. BASAR explores the interplay between the classification boundary and the natural motion manifold. To our best knowledge, this is the first time data manifold is introduced in adversarial attacks on time series. Via BASAR, we find on-manifold adversarial samples are extremely deceitful and rather common in skeletal motions, in contrast to the common belief that adversarial samples only exist off-manifold. Through exhaustive evaluation, we show that BASAR can deliver successful attacks across classifiers, datasets, and attack modes. By attack, BASAR helps identify the potential causes of the model vulnerability and provides insights on possible improvements. Finally, to mitigate the newly identified threat, we propose a new adversarial training approach by leveraging the sophisticated distributions of on/off-manifold adversarial samples, called mixed manifold-based adversarial training (MMAT). MMAT can successfully help defend against adversarial attacks without compromising classification accuracy.
Accepted in Pattern Recognition. arXiv admin note: substantial text overlap with arXiv:2103.05266
/dk/atira/pure/subjectarea/asjc/1700/1702; name=Artificial Intelligence, FOS: Computer and information sciences, Adversarial robustness, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, skeletal action recognition, Skeletal action recognition, /dk/atira/pure/subjectarea/asjc/1700/1707; name=Computer Vision and Pattern Recognition, Black-box attack, On-manifold adversarial samples, /dk/atira/pure/subjectarea/asjc/1700/1711; name=Signal Processing, adversarial robustness, on-manifold adversarial samples, /dk/atira/pure/subjectarea/asjc/1700/1712; name=Software
/dk/atira/pure/subjectarea/asjc/1700/1702; name=Artificial Intelligence, FOS: Computer and information sciences, Adversarial robustness, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, skeletal action recognition, Skeletal action recognition, /dk/atira/pure/subjectarea/asjc/1700/1707; name=Computer Vision and Pattern Recognition, Black-box attack, On-manifold adversarial samples, /dk/atira/pure/subjectarea/asjc/1700/1711; name=Signal Processing, adversarial robustness, on-manifold adversarial samples, /dk/atira/pure/subjectarea/asjc/1700/1712; name=Software
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 7 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
