Dual-Phase Framework for Few-Shot Hyperspectral Image Classification with Spatiospectral Masked Autoencoder and Episode Training

Wijayanti Nurul Khotimah; Mohammed Bennamoun; Farid Boussaid; Lian Xu; Ferdous Sohel

doi:10.1109/TGRS.2025.3541263

Back

Dual-Phase Framework for Few-Shot Hyperspectral Image Classification with Spatiospectral Masked Autoencoder and Episode Training

Journal article

Peer reviewed

Dual-Phase Framework for Few-Shot Hyperspectral Image Classification with Spatiospectral Masked Autoencoder and Episode Training

Wijayanti Nurul Khotimah, Mohammed Bennamoun, Farid Boussaid, Lian Xu and Ferdous Sohel

IEEE transactions on geoscience and remote sensing, Vol.63, 4404616

2025

DOI: https://doi.org/10.1109/TGRS.2025.3541263

Abstract

Autoencoders

combination of global and local prototypes

Data mining

Data models

Feature extraction

Few-shot learning

HSI classification

hybrid episode learning

Hyperspectral imaging

Image reconstruction

Phase transformers

Prototypes

prototypical network

Spatiospectral Masked AutoEncoder

Three-dimensional displays

Training

This paper introduces a two-phase learning approach for hyperspectral image (HSI) classification using few-shot learning. For the first phase, we present a novel spatiospectral masked autoencoder (ssMAE) - an advanced self-supervised learner. For the ssMAE backbone network, we designed a transformer encoder-decoder network, where we replaced the linear layer that is used as the initial feature embedding with a 3D convolutional layer to better extract local spectral-spatial features from 3D visible sub-patches. By tapping into vast unlabelled data, the ssMAE learns general HSI features. In the second phase, the ssMAE encoder is fine-tuned to extract discriminative features for classification by using the few-shot labelled training samples. This is achieved through a unique hybrid episode learning method that integrates the ssMAE encoder in a prototypical network. We innovate with a mix of global and local prototypes (CGL prototype) to refine label predictions. This technique maximizes data usage, focuses on specific samples, and mitagates issues from subpar episodes. Tested on three HSI datasets, our approach outperforms alternative few-shot methods. The code will be made publicly available at https://github.com/Weejaa04/SSMAE.

Details

Title: Dual-Phase Framework for Few-Shot Hyperspectral Image Classification with Spatiospectral Masked Autoencoder and Episode Training
Authors/Creators: Wijayanti Nurul Khotimah - Sepuluh Nopember Institute of Technology
Mohammed Bennamoun - The University of Western Australia
Farid Boussaid - The University of Western Australia
Lian Xu - The University of Western Australia
Ferdous Sohel - Murdoch University, Centre for Crop and Food Innovation
Publication Details: IEEE transactions on geoscience and remote sensing, Vol.63, 4404616
Publisher: IEEE; PISCATAWAY
Number of pages: 1
Grant note: 201902220213903 / Lembaga Pengelola Dana Pendidikan (10.13039/501100014538) UWA2002-003RTX / Grains Research and Development Corporation (10.13039/501100000980) DP210101682 / Australian Research Council (10.13039/501100000923)
Identifiers: 991005743938607891
Murdoch Affiliation: School of Information Technology
Language: English
Resource Type: Journal article

UN Sustainable Development Goals (SDGs)

This output has contributed to the advancement of the following goals:

Source: InCites

Metrics

21 Record Views

5 Times Cited - Web of Science

InCites Highlights

These are selected metrics from InCites Benchmarking & Analytics tool, related to this output

Collaboration types: Domestic collaboration; International collaboration
Citation topics: 4 Electrical Engineering, Electronics & Computer Science; 4.169 Remote Sensing; 4.169.91 Vegetation Mapping
Web Of Science research areas: Engineering, Electrical & Electronic; Geochemistry & Geophysics; Imaging Science & Photographic Technology; Remote Sensing
ESI research areas: Geosciences