Progressive Multi-Stage Neural Audio Coding with Guided References

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)(2022)

引用 7|浏览3
暂无评分
摘要
In this paper, we propose an effective multi-stage neural audio coding algorithm that encodes full-band audio signals (up to 20 kHz) using an end-to-end training criterion. By pre-defining several dyadic subband signals as training targets, we progressively encode input audio signals in each stage such that deeper stages of the network encode the residual error terms from the previous encoding stage. Our proposed audio codec successfully decodes full-band audio signals by using an effective multi-stage vector quantization scheme to represent key encoding features extracted in the latent space. Subjective listening tests show that the decoded outputs of the proposed audio codec achieve almost transparent quality at an average bitrate of 132 kbps.
更多
查看译文
关键词
Audio codec,deep neural network,subband coding,cascaded coding,end-to-end model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要