Abstract
Rotation invariance is essential for precise object level segmentation in UAV aerial imagery, where targets can have arbitrary orientations and exhibit fine scale details. Conventional segmentation architectures like UNet rely on convolution operators that are not rotation-invariant, leading to degraded segmentation accuracy across varying viewpoints. Rotation invariance can be achieved by expanding the filter bank across multiple orientations; however, this significantly increases computational cost and memory requirement. In this paper, we introduce a GPU-optimized rotation-invariant convolution framework that eliminates the traditional data lowering (im2col) step required for matrix multiplication based convolution. By exploiting structured data sharing among symmetrically rotated filters, our method achieves multi-orientation convolution with greatly reduced memory requirements and computational redundancy. We further generalize the approach to accelerate convolution with arbitrary (non-symmetric) rotation angles. Integrated into a UNet segmentation model, the framework yields up to a 5.7% improvement in accuracy over the non-rotation-aware baseline. Across extensive benchmarks, the proposed convolution achieves 20–57% faster training and 15–45% lower energy consumption than cuDNN, while maintaining accuracy comparable to state of-the-art rotation-invariant methods. Because the scatter-based operator greatly reduces intermediate feature dimensionality, the efficiency of our design also enables practical sixteen-o rientation convolution and pooling, yielding further accuracy gains that areinfeasible for conventional rotation-invariant implementations. Our sixteen-orientation approach achieves competitive accuracy on multiple datasets compared with state-of-the-art UAV segmentation networks. These results demonstrate that the proposed method provides an effective and efficient alternative to existing rotation-invariant convolution frameworks.
| Original language | English |
|---|---|
| Number of pages | 18 |
| Journal | IEEE Transactions on Geoscience & Remote Sensing |
| Early online date | 16 Feb 2026 |
| DOIs | |
| Publication status | E-pub ahead of print - 16 Feb 2026 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- dense convolution
- scatter operation
- rotation invariant
- acceleration
- GPU
Fingerprint
Dive into the research topics of 'Accelerated rotation-invariant convolution for UAV image segmentation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver