Linear regression-based classifier for audio visual person identification

M.R. Alam; R. Togneri; F. Sohel; M. Bennamoun; I. Naseem

doi:10.1109/ICCSPA.2013.6487281

Back

Conference paper

Linear regression-based classifier for audio visual person identification

M.R. Alam, R. Togneri, F. Sohel, M. Bennamoun and I. Naseem

2013 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA), pp.1-5

1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA) 2013 (Sharjah, 12/02/2013–14/02/2013)

2013

DOI: https://doi.org/10.1109/ICCSPA.2013.6487281

Files and links (1)

url

Link to Published Version *Subscription may be requiredView

Abstract

This paper presents an audio visual (AV) person identification system using Linear Regression-based Classifier (LRC) for person identification. Class specific models are created by stacking q-dimensional speech and image vectors from the training data. The person identification task is considered a linear regression problem, i.e., a test (speech or image) feature vector is expressed as a linear combination of the (speech or image) model of the class it belongs to. The Euclidean distance between a test feature vector and the estimated response vectors for all the class specific models are used as matching scores. These matching scores from both modalities are normalized using the min-max score normalization technique and then combined using the the sum rule of fusion. The system was tested on 88 subjects from the AusTalk AV database. Experimental results show that the identification accuracy after AV fusion is higher compared to the identification accuracy of an individual modality.

Details

Title: Linear regression-based classifier for audio visual person identification
Authors/Creators: M.R. Alam (Author/Creator) - The University of Western Australia
R. Togneri (Author/Creator) - The University of Western Australia
F. Sohel (Author/Creator) - The University of Western Australia
M. Bennamoun (Author/Creator) - The University of Western Australia
I. Naseem (Author/Creator) - Karachi Institute of Economics and Technology
Publication Details: 2013 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA), pp.1-5
Conference: 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA) 2013 (Sharjah, 12/02/2013–14/02/2013)
Identifiers: 991005540377407891
Murdoch Affiliation: Murdoch University
Language: English
Resource Type: Conference paper

Metrics

61 Record Views