Heterogeneous Multi-column ConvNets with a fusion framework for object recognition

Y. Li; F. Sohel; M. Bennamoun; H. Lei

doi:10.1109/WACV.2015.108

Back

Conference paper

Heterogeneous Multi-column ConvNets with a fusion framework for object recognition

Y. Li, F. Sohel, M. Bennamoun and H. Lei

2015 IEEE Winter Conference on Applications of Computer Vision, pp.773-780

IEEE Winter Conference on Applications of Computer Vision (WACV) 2015 (Waikoloa, HI, 05/01/2015–09/01/2015)

2015

DOI: https://doi.org/10.1109/WACV.2015.108

Abstract

The purpose of this paper is to investigate heterogeneous multi-column ConvNets (MCCNN) and fusion methods for them. We first construct heterogeneous MCCNN by combining ConvNets with different structures. We then use different fusion methods to check their performances to find out the effect of fusion methods for MCCNN. We also propose a novel sliding window based fusion framework which defines a specific subset of columns to be picked up from MCCNN for fusion. Two different strategies (exhaustive sliding window and sliding window from training) are investigated to determine the best performance of the fusion process. We tested the heterogeneous MCCNN and sliding window fusion on the MNIST dataset for optical character recognition. Experiments show that MCCNN improved the accuracy of recognition compared with a single column of ConvNets. Moreover, sliding window fusion is a more generalized fusion method and consistently achieves better results compared with the traditional fusion methods. We also tested the MCCNN and sliding window fusion on CIFAR-10 and Caltech-256 datasets. We achieved superior results compared to existing state-of-the-art techniques.

Details

Title: Heterogeneous Multi-column ConvNets with a fusion framework for object recognition
Authors/Creators: Y. Li (Author/Creator) - University of Electronic Science and Technology of China
F. Sohel (Author/Creator) - School of Computer Science and Software Engineering
M. Bennamoun (Author/Creator) - UWA Oceans Institute
H. Lei (Author/Creator) - University of Electronic Science and Technology of China
Publication Details: 2015 IEEE Winter Conference on Applications of Computer Vision, pp.773-780
Conference: IEEE Winter Conference on Applications of Computer Vision (WACV) 2015 (Waikoloa, HI, 05/01/2015–09/01/2015)
Identifiers: 991005545108107891
Murdoch Affiliation: Murdoch University
Language: English
Resource Type: Conference paper

Metrics

46 Record Views