RhythmCAN – あいまいさ誤差を利用して、創造性の高いリズム生成を目指すGANアーキテクチャの研究

徳井直生

Tokui, Nao. 2020. “Can GAN Originate New Electronic Dance Music Genres? — Generating Novel Rhythm Patterns Using GAN with Genre Ambiguity Loss.” arXiv [cs.SD]. arXiv. http://arxiv.org/abs/2011.13062.

AIを用いて (人の模倣ではない) 新しい音楽ジャンルを生み出すことができるか?

プロジェクトの概要

Since the introduction of deep learning, researchers have proposed content generation systems using deep learning and proved that they are competent to generate convincing content and artistic output, including music. However, one can argue that these deep learning-based systems imitate and reproduce the patterns inherent within what humans have created, instead of generating something new and creative.

In this paper, we focus on music generation, especially rhythm patterns of electronic dance music, and discuss if we can use deep learning to generate novel rhythms, interesting patterns not found in the training dataset.

We extend the framework of Generative Adversarial Networks(GAN) and encourage it to diverge from the inherent distributions in the dataset by adding additional classifiers to the framework. The paper shows that our proposed GAN can generate rhythm patterns that sound like music rhythms but not belong to any genre in the training dataset.

The source code, generated rhythm patterns, and supplementary plugin software for a popular Digital Audio Workstation software are available on our website.


生成されたリズムのサンプル

提案手法: GAN with Genre Ambiguity Loss

比較: Genre-Conditioned GAN ジャンルで条件付けしたリズム生成


提案手法で学習したモデルを簡単に試すことのできるAbleton Live M4Lデバイスを提供。

提案手法を用いて生成したリズムをベースにした楽曲

Nao Tokui – Synthetic Soundscape (2021)