RhythmCAN – あいまいさ誤差を利用して、創造性の高いリズム生成を目指すGANアーキテクチャの研究

Nao Tokui


Nao Tokui – Synthetic Soundscape (2021)

Since the introduction of deep learning, researchers have proposed content generation systems using deep learning and proved that they are competent to generate convincing content and artistic output, including music. However, one can argue that these deep learning-based systems imitate and reproduce the patterns inherent within what humans have created, instead of generating something new and creative.

In this paper, we focus on music generation, especially rhythm patterns of electronic dance music, and discuss if we can use deep learning to generate novel rhythms, interesting patterns not found in the training dataset.

We extend the framework of Generative Adversarial Networks(GAN) and encourage it to diverge from the inherent distributions in the dataset by adding additional classifiers to the framework. The paper shows that our proposed GAN can generate rhythm patterns that sound like music rhythms but not belong to any genre in the training dataset.

The source code, generated rhythm patterns, and supplementary plugin software for a popular Digital Audio Workstation software are available on our website.


提案手法: GAN with Genre Ambiguity Loss

比較: Genre-Conditioned GAN ジャンルで条件付けしたリズム生成

提案手法で学習したモデルを簡単に試すことのできるAbleton Live M4Lデバイスを提供。