Learning-based composite metrics for improved caption evaluation

N. Sharif; M. Bennamoun; L.R. White; S.A.A. Shah

Back

Learning-based composite metrics for improved caption evaluation

Conference paper

Open access

Learning-based composite metrics for improved caption evaluation

N. Sharif, M. Bennamoun, L.R. White and S.A.A. Shah

56th Annual Meeting of Association for Computational Linguistics (Melbourne, Australia, 15/07/2018–20/07/2018)

2018

Files and links (2)

pdf

composite metrics.pdfDownload View

Open Access

url

Conference WebsiteView

Abstract

The evaluation of image caption quality is a challenging task, which requires the assessment of two main aspects in a caption: adequacy and fluency. These quality aspects can be judged using a combination of several linguistic features. However, most of the current image captioning metrics focus only on specific linguistic facets, such as the lexical or semantic, and fail to meet a satisfactory level of correlation with human judgements at the sentence-level. We propose a learning-based framework to incorporate the scores of a set of lexical and semantic metrics as features, to capture the adequacy and fluency of captions at different linguistic levels. Our experimental results demonstrate that composite metrics draw upon the strengths of standalone measures to yield improved correlation and accuracy.

Details

Title: Learning-based composite metrics for improved caption evaluation
Authors/Creators: N. Sharif (Author/Creator)
M. Bennamoun (Author/Creator)
L.R. White (Author/Creator)
S.A.A. Shah (Author/Creator)
Conference: 56th Annual Meeting of Association for Computational Linguistics (Melbourne, Australia, 15/07/2018–20/07/2018)
Identifiers: 991005540351107891
Murdoch Affiliation: Murdoch University
Language: English
Resource Type: Conference paper

Metrics

104 File views/ downloads

104 Record Views