Common Practices and Taxonomy in Deep Multiview Fusion for Remote Sensing Applications

Mena, Francisco; Arenas, Diego; Nuske, Marlon; Dengel, Andreas

doi:10.1109/jstars.2024.3361556

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

2024

DOI: 10.1109/jstars.2024.3361556

|View full text |Cite

Common Practices and Taxonomy in Deep Multiview Fusion for Remote Sensing Applications

Francisco Mena,

Diego Arenas,

Marlon Nuske

et al.

Abstract: The advances in remote sensing technologies have boosted applications for Earth observation. These technologies provide multiple observations or views with different levels of information. They might contain static or temporary views with different levels of resolution, in addition to having different types and amounts of noise due to sensor calibration or deterioration. A great variety of deep learning models have been applied to fuse the information from these multiple views, known as deep multiview or multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article3

Relationship

Self Cite0

Independent3

Authors

Journals

Cited by 3 publications

References 235 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

An Adaptive Multiview SAR Automatic Target Recognition Network Based on Image Attention

Zhang,

Duan,

Zhang

et al. 2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

An Adaptive Multiview SAR Automatic Target Recognition Network Based on Image Attention

Zhang,

Duan,

Zhang

et al. 2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Explainable Multimodal Learning in Remote Sensing: Challenges and Future Directions

Günther,

Najjar,

Dengel

2024

IEEE Geosci. Remote Sensing Lett.

View full text Add to dashboard Cite

Enhancing Apple Cultivar Classification Using Multiview Images

Krug,

Hutschenreuther

2024

J. Imaging

View full text Add to dashboard Cite

Apple cultivar classification is challenging due to the inter-class similarity and high intra-class variations. Human experts do not rely on single-view features but rather study each viewpoint of the apple to identify a cultivar, paying close attention to various details. Following our previous work, we try to establish a similar multiview approach for machine-learning (ML)-based apple classification in this paper. In our previous work, we studied apple classification using one single view. While these results were promising, it also became clear that one view alone might not contain enough information in the case of many classes or cultivars. Therefore, exploring multiview classification for this task is the next logical step. Multiview classification is nothing new, and we use state-of-the-art approaches as a base. Our goal is to find the best approach for the specific apple classification task and study what is achievable with the given methods towards our future goal of applying this on a mobile device without the need for internet connectivity. In this study, we compare an ensemble model with two cases where we use single networks: one without view specialization trained on all available images without view assignment and one where we combine the separate views into a single image of one specific instance. The two latter options reflect dataset organization and preprocessing to allow the use of smaller models in terms of stored weights and number of operations than an ensemble model. We compare the different approaches based on our custom apple cultivar dataset. The results show that the state-of-the-art ensemble provides the best result. However, using images with combined views shows a decrease in accuracy by 3% while requiring only 60% of the memory for weights. Thus, simpler approaches with enhanced preprocessing can open a trade-off for classification tasks on mobile devices.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Common Practices and Taxonomy in Deep Multiview Fusion for Remote Sensing Applications

Cited by 3 publications

References 235 publications

An Adaptive Multiview SAR Automatic Target Recognition Network Based on Image Attention

An Adaptive Multiview SAR Automatic Target Recognition Network Based on Image Attention

Explainable Multimodal Learning in Remote Sensing: Challenges and Future Directions

Enhancing Apple Cultivar Classification Using Multiview Images

Contact Info

Product

Resources

About