Causal Reasoning Meets Visual Representation Learning: A Prospective Study

Liu, Yang; Wei, Yu-Shen; Yan, Hong; Li, Guanbin; Lin, Liang

doi:10.1007/s11633-022-1362-z

Cited by 27 publications

(11 citation statements)

References 216 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The ability to judge the goodness of received information enables us to select positive information while eliminating negative ones. This concept has been applied to various computer vision tasks [28][29][30].…”

Section: Label Enhancement Methodsmentioning

confidence: 99%

A progressive segmentation with weight contrast label enhancement for weakly supervised video salient object detection

Liang

et al. 2023

IET Image Processing

View full text Add to dashboard Cite

Scribble labels have gained increasing attention in the field of weakly supervised video salient object detection (VSOD). Based on scribble labels, latest methods can spread labeled pixels to unlabeled regions using local coherence loss, but predicted objects often lose detail and boundary information. In this work, a novel method based on back‐foreground weight contrast is proposed that adds label enhancement points to facilitate the model to learn the edge, detail and location of salient object. Additionally, a new VSOD framework based on global structural localization is introduced. Enhanced scribble labels are used to assist the model for global localization, and then the located regions are finely segmented by the trained model. Extensive experiments demonstrate that the method achieves the state‐of‐the‐art performance on common VSOD datasets, with an improvement of 3.75%, 4.68%, and 0.88% in S‐measure, F‐measure, and MAE, respectively.

show abstract

Section: Label Enhancement Methodsmentioning

confidence: 99%

A progressive segmentation with weight contrast label enhancement for weakly supervised video salient object detection

Liang

et al. 2023

IET Image Processing

View full text Add to dashboard Cite

show abstract

“…For explaining black-box visual classifiers, [56] formulated a causal extension to the paradigm of instance-wise features selection and obtain a subset of input features that has the greatest causal effect on the model's output. [57] conducted a comprehensive review of existing causal reasoning methods for visual representation learning and pointed out the importance of causal reasoning in visual representation learning.…”

Section: B Causal Learningmentioning

confidence: 99%

Unsupervised Domain Adaptation Semantic Segmentation of High-Resolution Remote Sensing Imagery With Invariant Domain-Level Prototype Memory

Zhu

Sun

Yang

et al. 2023

IEEE Trans. Geosci. Remote Sensing

View full text Add to dashboard Cite

Semantic segmentation of high-resolution remote sensing imagery (HRSI) suffers from the domain shift, resulting in poor performance of the model in another unseen domain. Unsupervised domain adaptive (UDA) semantic segmentation aims to adapt the semantic segmentation model trained on the labeled source domain to an unlabeled target domain. However, the existing UDA semantic segmentation models tend to align pixels or features based on statistical information related to labels in source and target domain data, and make predictions accordingly, which leads to uncertainty and fragility of prediction results. In this paper, we propose a causal prototype-inspired contrast adaptation (CPCA) method to explore the invariant causal mechanisms between different HRSIs domains and their semantic labels. It firstly disentangles causal features and bias features from the source and target domain images through a causal feature disentanglement module. Then, a causal prototypical contrast module is used to learn domain invariant causal features. To further de-correlate causal and bias features, a causal intervention module is introduced to intervene on the bias features to generate counterfactual unbiased samples. By forcing the causal features to meet the principles of separability, invariance and intervention, CPCA can simulate the causal factors of source and target domains, and make decisions on the target domain based on the causal features, which can observe improved generalization ability. Extensive experiments under three crossdomain tasks indicate that CPCA is remarkably superior to the state-of-the-art methods.

show abstract

Section: Causal Mechanismsmentioning

confidence: 99%

“…Traditional feature learning methods are prone to learning pseudo-correlation properties introduced by confounding factors, which is not conducive to model generalisation across domains 11 . Causal inference can eliminate this pseudocorrelation by replacing the conditional distribution with an intervening distribution 11 . In the image task, Wang 12 proposed a causal attention module to help deep models learn causal features with robustness by annotating contextual information obfuscation factors in an unsupervised manner.…”

Section: Causal Mechanismsmentioning

confidence: 99%

Pedestrian re-identification domain generalization algorithm based on causal strong and weak alignment

Mo,

Hu,

Yuan

et al. 2023

Fifth International Conference on Artificial Intelligence and Computer Science (AICS 2023)

View full text Add to dashboard Cite

The domain generalisation pedestrian re-identification problem aims to generalise features learned in a known pedestrian data domain to an unknown pedestrian target domain. Most traditional domain generalization methods assume that the statistical properties between features and categories remain consistent across data domains. However, actual pedestrian data is often susceptible to domain factors such as lighting, colour shifts, and camera differences. This results in the data distribution of pedestrians under different domains not being identical and hinders the generalisation of the model. To address the above issues, this paper proposes a representation learning algorithm based on causal strong and weak alignment by constructing a structural causal model for the pedestrian domain generalization problem from the perspective of causal inference. The algorithm first performs causal intervention on the input pedestrian data to obtain causally enhanced images, then the image features are fed into the strong alignment module to achieve feature alignment in each dimension and obtain a preliminary invariant representation, finally, the features are then subjected to the constraints of the weak alignment module contrastive loss to further optimise the causal features under different cameras and improve the stability of the model's cross-domain causal prediction. The method was compared and ablation experiments were carried out on the Market-1501 and DukeMTMC-reID datasets, demonstrating the effectiveness of the proposed method.

show abstract

Causal Reasoning Meets Visual Representation Learning: A Prospective Study

Cited by 27 publications

References 216 publications

A progressive segmentation with weight contrast label enhancement for weakly supervised video salient object detection

A progressive segmentation with weight contrast label enhancement for weakly supervised video salient object detection

Unsupervised Domain Adaptation Semantic Segmentation of High-Resolution Remote Sensing Imagery With Invariant Domain-Level Prototype Memory

Pedestrian re-identification domain generalization algorithm based on causal strong and weak alignment

Contact Info

Product

Resources

About