CDDSA: Contrastive domain disentanglement and style augmentation for generalizable medical image segmentation

Medical Image Analysis - Tập 89 - Trang 102904 - 2023
Ran Gu1, Guotai Wang1,2, Jiangshan Lu1, Jingyang Zhang3,4, Wenhui Lei5,2, Yinan Chen6,7, Wenjun Liao8, Shichuan Zhang8, Kang Li7, Dimitris N. Metaxas9, Shaoting Zhang1,6,2
1School of Mechanical and Electrical Engineering, University of Electronic Science and Technology of China, Chengdu, China
2Shanghai AI Lab, Shanghai, China
3School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
4School of Biomedical Engineering, ShanghaiTech University, Shanghai, China
5School of Electronic, Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China
6SenseTime Research, Shanghai, China
7West China Hospital-SenseTime Joint Lab, West China Biomedical Big Data Center, Sichuan University, Chengdu, China
8Department of Radiation Oncology, Sichuan Cancer Hospital and Institute, University of Electronic Science and Technology of China, Chengdu, China
9Department of Computer Science, Rutgers University, Piscataway, NJ, 08854, USA

Tài liệu tham khảo

Bengio, 2013, Representation learning: A review and new perspectives, IEEE Trans. Pattern Anal. Mach. Intell., 35, 1798, 10.1109/TPAMI.2013.50 Chaitanya, 2020, Contrastive learning of global and local features for medical image segmentation with limited annotations, Adv. Neural Inf. Process. Syst., 33 Chartsias, 2019, Disentangled representation learning in cardiac image analysis, Med. Image Anal., 58, 10.1016/j.media.2019.101535 Chartsias, 2020, Disentangle, align and fuse for multimodal and semi-supervised image segmentation, IEEE Trans. Med. Imaging, 40, 781, 10.1109/TMI.2020.3036584 Chen, 2018, Semantic-aware generative adversarial nets for unsupervised domain adaptation in chest X-Ray segmentation, 143 Chen, 2020, A simple framework for contrastive learning of visual representations, 1597 Dou, 2019, Domain generalization via model-agnostic learning of semantic features, Adv. Neural Inf. Process. Syst., 32 Duan, 2020 Fick, 2021, Domain-specific cycle-GAN augmentation improves domain generalizability for mitosis detection, 40 Fumero, 2011, RIM-ONE: An open retinal image database for optic nerve evaluation, 1 Ganin, 2016, Domain-adversarial training of neural networks, J. Mach. Learn. Res., 17, 189 Gatys, L.A., Ecker, A.S., Bethge, M., 2016. Image style transfer using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 2414–2423. Gu, 2021, CA-Net: Comprehensive attention convolutional neural networks for explainable medical image segmentation, IEEE Trans. Med. Imaging, 40, 699, 10.1109/TMI.2020.3035253 Gu, 2021, Domain composition and attention for unseen-domain generalizable medical image segmentation, 241 Gu, 2022, Contrastive semi-supervised learning for domain adaptive segmentation across similar anatomical structures, IEEE Trans. Med. Imaging, 1 Guan, 2021, Domain adaptation for medical image analysis: a survey, IEEE Trans. Biomed. Eng., 69, 1173, 10.1109/TBME.2021.3117407 Hadsell, 2006, Dimensionality reduction by learning an invariant mapping, 1735 He, K., Fan, H., Wu, Y., Xie, S., Girshick, R., 2020. Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 9729–9738. Hu, 2022, Domain and content adaptive convolution based multi-source domain generalization for medical image segmentation, IEEE Trans. Med. Imaging, 42, 233, 10.1109/TMI.2022.3210133 Hu, 2022, Domain and content adaptive convolution for domain generalization in medical image segmentation, IEEE Trans. Med. Imaging, 1 Huang, X., Liu, M.-Y., Belongie, S., Kautz, J., 2018. Multimodal unsupervised image-to-image translation. In: Proceedings of the European Conference on Computer Vision (ECCV). pp. 172–189. Jang, 2017, Categorical reparameterization with gumbel-softmax Kamnitsas, 2017, Unsupervised domain adaptation in brain lesion segmentation with adversarial networks, 597 Kang, G., Jiang, L., Yang, Y., Hauptmann, A.G., 2019. Contrastive adaptation network for unsupervised domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 4893–4902. Kingma, 2014, Auto-encoding variational bayes Lei, 2021, Contrastive learning of relative position regression for one-shot object localization in 3D medical images, 155 Li, Y., Gong, M., Tian, X., Liu, T., Tao, D., 2018c. Domain generalization via conditional invariant representations. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32. Li, 2022, Domain generalization on medical imaging classification using episodic training with task augmentation, Comput. Biol. Med., 141, 10.1016/j.compbiomed.2021.105144 Li, H., Pan, S.J., Wang, S., Kot, A.C., 2018b. Domain generalization with adversarial feature learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5400–5409. Li, D., Yang, Y., Song, Y.-Z., Hospedales, T., 2018a. Learning to generalize: Meta-learning for domain generalization. In: Proceedings of the AAAI Conference on Artificial Intelligence, Vol.32. Liao, 2022, Automatic delineation of gross tumor volume based on magnetic resonance imaging by performing a novel semi-supervised learning framework in nasopharyngeal carcinoma, Int. J. Radiat. Oncol. Biol. Phys. Liu, Q., Chen, C., Qin, J., Dou, Q., Heng, P.-A., 2021. Feddg: Federated domain generalization on medical image segmentation via episodic learning in continuous frequency space. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 1013–1023. Liu, 2020, Shape-aware meta-learning for generalizing prostate mri segmentation to unseen domains, 475 Meng, 2019, Representation disentanglement for multi-task learning with application to fetal ultrasound, 47 Muandet, 2013, Domain generalization via invariant feature representation, 10 Ning, 2021, A new bidirectional unsupervised domain adaptation segmentation framework, 492 Oord, 2018 Orlando, 2020, Refuge challenge: A unified framework for evaluating automated methods for glaucoma assessment from fundus photographs, Med. Image Anal., 59, 10.1016/j.media.2019.101570 Pei, 2021, Disentangle domain features for cross-modality cardiac image segmentation, Med. Image Anal., 71, 10.1016/j.media.2021.102078 Ronneberger, 2015, U-net: Convolutional networks for biomedical image segmentation, 234 Shen, 2017, Deep learning in medical image analysis, Annu. Rev. Biomed. Eng., 19, 221, 10.1146/annurev-bioeng-071516-044442 Sivaswamy, 2015, A comprehensive retinal image dataset for the assessment of glaucoma from the optic nerve head analysis, JSM Biomed. Imaging Data Pap., 2, 1004 Tran, L., Yin, X., Liu, X., 2017. Disentangled representation learning gan for pose-invariant face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1415–1424. Tzeng, E., Hoffman, J., Saenko, K., Darrell, T., 2017. Adversarial discriminative domain adaptation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 7167–7176. Wang, 2022, Generalizing to unseen domains: A survey on domain generalization, IEEE Trans. Knowl. Data Eng., 10.1109/TKDE.2022.3178128 Wang, 2020, DoFE: Domain-oriented feature embedding for generalizable fundus image segmentation on unseen datasets, IEEE Trans. Med. Imaging, 10.1109/TMI.2020.3015224 Wang, 2023, Editorial for special issue on explainable and generalizable deep learning methods for medical image computing, Med. Image Anal., 84, 10.1016/j.media.2022.102727 Wang, X., Zhang, R., Shen, C., Kong, T., Li, L., 2021. Dense contrastive learning for self-supervised visual pre-training. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 3024–3033. Wu, 2022, FPL-UDA: Filtered pseudo label-based unsupervised cross-modality adaptation for vestibular schwannoma segmentation, 1 Xie, 2020, MI2GAN: Generative adversarial network for medical image domain adaptation using mutual information constraint, 516 Xie, 2022, Unsupervised domain adaptation for medical image segmentation by disentanglement learning and self-training, IEEE Trans. Med. Imaging, 10.1109/TMI.2022.3192303 Yang, 2019, Unsupervised domain adaptation via disentangled representations: Application to cross-modality liver segmentation, 255 You, 2022, Momentum contrastive voxel-wise representation learning for semi-supervised volumetric medical image segmentation, 639 Zhang, 2020, Generalizing deep learning for medical image segmentation to unseen domains via deep stacked transformation, IEEE Trans. Med. Imaging, 39, 2531, 10.1109/TMI.2020.2973595 Zhou, Z., Qi, L., Shi, Y., 2022a. Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain Specific Image Restoration. In: Proceedings of the European Conference on Computer Vision (ECCV). Zhou, 2022, Generalizable medical image segmentation via random amplitude mixup and domain-specific image restoration, 420 Zhou, Z., Qi, L., Yang, X., Ni, D., Shi, Y., 2022c. Generalizable cross-modality medical image segmentation via style augmentation and dual normalization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 20856–20865. Zhu, J.-Y., Park, T., Isola, P., Efros, A.A., 2017. Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 2223–2232.