Transformers in Natural Language Processing

François Yvon

doi:10.1007/978-3-031-24349-3_6

Chapitre D'ouvrage Année : 2023

Transformers in Natural Language Processing

(1)

François Yvon

Fonction : Auteur
PersonId : 5347
IdHAL : francois-yvon
ORCID : 0000-0002-7972-7442
IdRef : 057593531

Traitement du Langage Parlé - LISN

Résumé

This chapter presents an overview of the state of the art in natural language processing, exploring one specific computational architecture, the Transformer model, which plays a central role in a wide range of applications. This architecture condenses many advances in neural learning methods and can be exploited in many ways : to learn representations for linguistic entities ; to generate coherent utterances and answer questions ; to perform utterance transformations, an illustration being their automatic translation. These different facets of the architecture will be successively presented, also allowing us to discuss its limitations.

Mots clés

Natural Language Processing Machine Learning Language Models Neural Machine Translation

Domaines

Informatique et langage [cs.CL] Apprentissage [cs.LG] Traitement du texte et du document

Fichier principal

Transformers-en.pdf (980.98 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

François Yvon : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04224531

Soumis le : lundi 2 octobre 2023-10:14:28

Dernière modification le : mercredi 18 décembre 2024-08:40:53

Archivage à long terme le : mercredi 3 janvier 2024-18:37:06

Dates et versions

hal-04224531 , version 1 (02-10-2023)

Identifiants

HAL Id : hal-04224531 , version 1
DOI : 10.1007/978-3-031-24349-3_6

Citer

François Yvon. Transformers in Natural Language Processing. Mohamed Chetouani; Virginia Dignum; Paul Lukowicz; Carles Sierra. Human-Centered Artificial Intelligence. Advanced Lectures, 13500, Springer International Publishing, pp.81-105, 2023, Lecture Notes in Computer Science, 978-3-031-24348-6. ⟨10.1007/978-3-031-24349-3_6⟩. ⟨hal-04224531⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CENTRALESUPELEC UNIV-PARIS-SACLAY LISN GS-COMPUTER-SCIENCE LISN-TLP

269 Consultations

583 Téléchargements