Abstract
Hyperspectral image (HSI) classification is vital for environmental monitoring, land cover mapping, and precision agriculture, but its effectiveness is often constrained by the scarcity of labeled samples and high spectral similarity among classes. To address these challenges, we propose CrossCapsViT, a hybrid classification framework that integrates Capsule Networks (CapsNets) and Vision Transformers (ViTs) through a CrossAttention Fusion (CAF) mechanism and a cross-layer adaptive fusion module, enabling richer and more discriminative spectral–spatial feature learning. To further improve efficiency in data scarce scenarios, we embed an Actor–Critic reinforcement learning based active learning (RAL) strategy that jointly leverages accuracy, uncertainty, and diversity in its reward design, guiding the selection of the most informative samples while reducing labeling effort. Experiments conducted on four benchmark datasets (KSC, PU, HU2013, and Salinas) and a custom UAV based saltmarsh dataset (Derrymore, collected with a Pika-L sensor) demonstrate that CrossCapsViT with RAL consistently outperforms CapsViT and other baseline models in terms of classification accuracy, robustness, and generalizability. The proposed framework achieves up to 25% improvement in class-level accuracy on challenging vegetation classes, while reducing dependence on large annotated datasets, highlighting its potential for practical deployment in real-world ecological monitoring and remote sensing applications.
| Original language | English |
|---|---|
| Pages (from-to) | 16314-16332 |
| Number of pages | 19 |
| Journal | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Volume | 19 |
| Early online date | 27 Apr 2026 |
| DOIs | |
| Publication status | Published - 2026 |
Keywords
- active learning
- actor-critic model
- capsule networks (CapsNets)
- hyperspectral imaging
- photogrammetry
- reinforcement learning
- remote sensing
- visual transformer (ViT)
Fingerprint
Dive into the research topics of 'Enhancing hyperspectral image classification through reinforcement learning guided active learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver