Transcription of Spatial Pyramid Pooling in Deep Convolutional Networks …
{{id}} {{{paragraph}}}
1 Spatial Pyramid Pooling in Deep ConvolutionalNetworks for Visual RecognitionKaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian SunAbstract Existing deep Convolutional neural Networks (CNNs) require a fixed-size ( , 224 224) input image. This require-ment is artificial and may reduce the recognition accuracy for the images or sub-images of an arbitrary size/ scale . In thiswork, we equip the Networks with another Pooling strategy, Spatial Pyramid Pooling , to eliminate the above requirement. Thenew network structure, called SPP-net, can generate a fixed-length representation regardless of image size/ scale . Pyramidpooling is also robust to object deformations. With these advantages, SPP-net should in general improve all CNN-based imageclassification methods. On the ImageNet 2012 dataset, we demonstrate that SPP-net boosts the accuracy of a variety of CNNarchitectures despite their different designs.
Abstract—Existing deep convolutional neural networks (CNNs) require a fixed-size (e.g., 224 224) input image. This require- ... new network structure, called SPP-net, can generate a fixed-length representation regardless of image size/scale. ... networks [3] cannot; 2) SPP uses multi-level spatial bins, while the sliding window pooling uses ...
Domain:
Source:
Link to this page:
Please notify us if you found a problem with this document:
{{id}} {{{paragraph}}}