Renovating Names in Open-Vocabulary Segmentation Benchmarks
arxiv(2024)
摘要
Names are essential to both human cognition and vision-language models.
Open-vocabulary models utilize class names as text prompts to generalize to
categories unseen during training. However, name qualities are often overlooked
and lack sufficient precision in existing datasets. In this paper, we address
this underexplored problem by presenting a framework for "renovating" names in
open-vocabulary segmentation benchmarks (RENOVATE). Through human study, we
demonstrate that the names generated by our model are more precise descriptions
of the visual segments and hence enhance the quality of existing datasets by
means of simple renaming. We further demonstrate that using our renovated names
enables training of stronger open-vocabulary segmentation models. Using
open-vocabulary segmentation for name quality evaluation, we show that our
renovated names lead to up to 16
on various benchmarks across various state-of-the-art models. We provide our
code and relabelings for several popular segmentation datasets (ADE20K,
Cityscapes, PASCAL Context) to the research community.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要