STYLEWAVEGAN: STYLE-BASED SYNTHESIS OF DRUM SOUNDS WITH EXTENSIVE CONTROLS USING GENERATIVE ADVERSARIAL NETWORKS

Antoine Lavault; Axel Roebel; Matthieu Voiry

Communication Dans Un Congrès Année : 2022

STYLEWAVEGAN: STYLE-BASED SYNTHESIS OF DRUM SOUNDS WITH EXTENSIVE CONTROLS USING GENERATIVE ADVERSARIAL NETWORKS

(1) , (1) , (2)

1
2

Antoine Lavault

Fonction : Auteur
PersonId : 1142551

Analyse et synthèse sonores [Paris]

Axel Roebel

Fonction : Auteur

Analyse et synthèse sonores [Paris]

Matthieu Voiry

Fonction : Auteur

Apeira Technologies

Résumé

In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of StyleGAN, a state-of-the-art image generator. By conditioning StyleWaveGAN on both the type of drum and several audio descriptors, we are able to synthesize waveforms faster than real-time on a GPU directly in CD quality up to a duration of 1.5s while retaining a considerable amount of control over the generation. We also introduce an alternative to the progressive growing of GANs and experimented on the effect of dataset balancing for generative tasks. The experiments are carried out on an augmented subset of a publicly available dataset comprised of different drums and cymbals. We evaluate against two recent drum generators, WaveGAN and NeuroDrum, demonstrating significantly improved generation quality (measured with the Frechet Audio Distance) and interesting results with perceptual features.

Domaines

Son [cs.SD]

Fichier principal

47.pdf (587.39 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Antoine Lavault : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03693950

Soumis le : lundi 13 juin 2022-11:37:12

Dernière modification le : samedi 7 octobre 2023-21:36:22

Dates et versions

hal-03693950 , version 1 (13-06-2022)

Identifiants

HAL Id : hal-03693950 , version 1

Citer

Antoine Lavault, Axel Roebel, Matthieu Voiry. STYLEWAVEGAN: STYLE-BASED SYNTHESIS OF DRUM SOUNDS WITH EXTENSIVE CONTROLS USING GENERATIVE ADVERSARIAL NETWORKS. 19th Sound and Music Computing Conference (SMC 2022), Jun 2022, Saint-Etienne, France. ⟨hal-03693950⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS IRCAM STMS SORBONNE-UNIVERSITE SU-SCIENCES

97 Consultations

125 Téléchargements

STYLEWAVEGAN: STYLE-BASED SYNTHESIS OF DRUM SOUNDS WITH EXTENSIVE CONTROLS USING GENERATIVE ADVERSARIAL NETWORKS

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager