Chrome Extension
WeChat Mini Program
Use on ChatGLM

Application Research of Short-Time Fourier Transform in Music Generation Based on the Parallel WaveGan System

IEEE Transactions on Industrial Informatics(2024)

Cited 0|Views0
No score
Abstract
Despite the widespread use of Fourier transform (FT) networks and generative adversarial networks (GANs) in audio signal processing, their practical effectiveness in unsupervised offline systems has not yet reached a fully satisfying level. Accumulating substantial experience in recent years, this article showcases how to construct an optimized, efficient music generation system. In the proposed system, the short-time Fourier transform is employed to divide a long music signal into equally sized short melodic segments. Each short melodic segment undergoes FT, and a nonautoregressive parallel WaveGAN system is trained by jointly optimizing multiresolution spectrograms and adversarial loss functions. This approach effectively captures the time–frequency distribution of real music waveforms. In essence, the proposed music generation system is a self-feedback unsupervised model relying on specific melody and note model pruning techniques. To further refine the music evaluation mechanism, in addition to conducting data analysis on the output melodies, subjective evaluation mechanisms are also incorporated.
More
Translated text
Key words
Music evaluation,music generation,parallel WaveGAN,short-time Fourier transform (STFT),unsupervised models
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined