谷歌浏览器插件
订阅小程序
在清言上使用

Fréchet Audio Distance: A Reference-Free Metric for Evaluating Music Enhancement Algorithms

arXiv: Audio and Speech Processing(2019)

引用 54|浏览15
暂无评分
摘要
We propose the Fr ' echet Audio Distance (FAD), a novel, reference-free evaluation metric for music enhancement algorithms. We demonstrate how typical evaluation metrics for speech enhancement and blind source separation can fail to accurately measure the perceived effect of a wide variety of distortions. As an alternative, we propose adapting the Fr ' echet Inception Distance (FID) metric used to evaluate generative image models to the audio domain. FAD is validated using a wide variety of artificial distortions and is compared to the signal based metrics signal to distortion ratio (SDR), cosine distance, and magnitude L2 distance. We show that, with a correlation coefficient of 0:52, FAD correlates more closely with human perception than either SDR, cosine distance or magnitude L2 distance, with correlation coefficients of 0.39, -0.15 and -0.01 and respectively.
更多
查看译文
关键词
music,enhancement,algorithms,distance,reference-free
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要