Deformable Part Models Are Convolutional Neural Networks

Ross Girshick,Forrest Iandola,Trevor Darrell,Jitendra Malik

2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)（2015）

引用 558|浏览164

暂无评分

摘要

Deformable part models (DPMs) and convolutional neural networks (CNNs) are two widely used tools for visual recognition. They are typically viewed as distinct approaches: DPMs are graphical models (Markov random fields), while CNNs are "black-box" non-linear classifiers. In this paper, we show that a DPM can be formulated as a CNN, thus providing a synthesis of the two ideas. Our construction involves unrolling the DPM inference algorithm and mapping each step to an equivalent CNN layer. From this perspective, it is natural to replace the standard image features used in DPMs with a learned feature extractor. We call the resulting model a DeepPyramid DPM and experimentally validate it on PASCAL VOC object detection. We find that DeepPyramid DPMs significantly outperform DPMs based on histograms of oriented gradients features (HOG) and slightly outperforms a comparable version of the recently introduced R-CNN detection system, while running significantly faster.

查看译文

关键词

deformable part models,convolutional neural networks,DPMs,CNNs,visual recognition,graphical models,Markov random fields,black-box nonlinear classifiers,DPM inference algorithm,feature extractor,DeepPyramid DPM,PASCAL VOC object detection,histograms of oriented gradients features,HOG

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要