Towards logical specification of adversarial examples in machine learning

Marwa Zeroual; Brahim Hamid; Morayo Adedjouma; Jason Jaskolka

doi:10.1109/TrustCom56396.2022.00226

Poster De Conférence Année : 2022

Towards logical specification of adversarial examples in machine learning

(1) , (2, 3) , (1) , (4)

1
2
3
4

Marwa Zeroual

Fonction : Auteur
PersonId : 1291630
ORCID : 0000-0002-4990-4145

Département Ingénierie Logiciels et Systèmes

Brahim Hamid

Fonction : Auteur
PersonId : 750865
IdHAL : brahim-hamid
ORCID : 0000-0002-2199-3916
IdRef : 119018586

Advancing Rigorous Software and System Engineering

Université Toulouse - Jean Jaurès

Morayo Adedjouma

Fonction : Auteur
PersonId : 1144104
ORCID : 0000-0003-0218-028X
IdRef : 162109113

Département Ingénierie Logiciels et Systèmes

Jason Jaskolka

Fonction : Auteur

Laboratoire d'Intégration des Systèmes et des Technologies

Résumé

The use of Artificial Intelligence (AI)-based systems, using particularly Machine Learning (ML) classifiers, is growing rapidly and finding uses in many industries. Most of these industries have critical safety, security, and dependability requirements. Despite this rapid growth, interest in the security of these systems has only arisen in the last few years and it is not yet well-studied. There is a want for a formal notion of security for ML systems, similar to that used in classical information security. We took this statement toward security threat modeling and analysis in ML-based systems, focusing on the adversarial example threat. An adversarial example threat is an input of the classifier that was maliciously modified to induce a misclassification. Identifying this threat at the architecture design stage before proceeding with system development is a critical milestone in the development process of secure ML systems. In this paper, we propose an approach to adversarial example threat specification and detection in component-based software architecture models. We use first-order and modal logic as an abstract and technology-independent formalism. The general idea of the approach is to specify the threat as property of a modeled system such that the violation of the specified property indicates the presence of the threat. We demonstrate the applicability of the method through a classifier used in a recommendation system.

Mots clés

adversarial examples machine learning classifiers threat logical specification arguments

Domaines

Intelligence artificielle [cs.AI] Théorie et langage formel [cs.FL]

Fichier principal

Towards_logical_specification_of_adversarial_examples_in_machine_learning.pdf (978.24 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Contributeur MAP CEA : Connectez-vous pour contacter le contributeur

https://cea.hal.science/cea-04292759

Soumis le : vendredi 17 novembre 2023-19:00:15

Dernière modification le : mardi 3 septembre 2024-11:16:05

Dates et versions

cea-04292759 , version 1 (17-11-2023)

Identifiants

HAL Id : cea-04292759 , version 1
DOI : 10.1109/TrustCom56396.2022.00226

Citer

Marwa Zeroual, Brahim Hamid, Morayo Adedjouma, Jason Jaskolka. Towards logical specification of adversarial examples in machine learning. IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2022), Dec 2022, Wuhan, China. IEEE, 2022 IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), pp.1575-1580, 2022, ⟨10.1109/TrustCom56396.2022.00226⟩. ⟨cea-04292759⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA UNIV-TLSE2 CNRS UT1-CAPITOLE DRT CEA-UPSAY UNIV-PARIS-SACLAY LIST IRIT IRIT-ARGOS IRIT-FSL GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT IRIT-UT2J TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

277 Consultations

65 Téléchargements

Towards logical specification of adversarial examples in machine learning

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager