Regression to Classification: Waveform Encoding for Neural Field-Based Audio Signal Representation

TaeSoo Kim,Daniel Rho, Gahui Lee, JaeHan Park,Jong Hwan Ko

ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2023)

引用 0|浏览0
暂无评分
摘要
Neural fields, also known as coordinate-based representations, are an emerging signal representation framework. This approach has also been used to represent audio signals, but the generated audio often contains noise. To reduce noise and improve representation quality, we propose using waveform encoding in the neural field. Instead of yielding real numbers for each temporal coordinate, this involves using discrete integers as outputs, with waveform-encoded integers as target classes, and treating the representation problem as a classification task rather than a regression problem. The experimental results show that waveform encoding can improve the audio quality of neural fields across a variety of audio datasets.
更多
查看译文
关键词
neural fields,implicit neural representation,audio representations,waveform coding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要