A dual-branch model with inter- and intra-branch contrastive loss for long-tailed recognition

Neural Networks - Tập 168 - Trang 214-222 - 2023
Qiong Chen1, Tianlin Huang1,2, Geren Zhu1, Enlu Lin1
1School of Computer Science and Engineering, South China University of Technology, China
2Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application, China

Tài liệu tham khảo

Alshammari, S., Wang, Y. X., Ramanan, D., & Kong, S. (2022). Long-tailed recognition via weight balancing. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 6897–6907). Buda, 2018, A systematic study of the class imbalance problem in convolutional neural networks, Neural Networks, 106, 249, 10.1016/j.neunet.2018.07.011 Cai, J., Wang, Y., & Hwang, J. N. (2021). Ace: Ally complementary experts for solving long-tailed recognition in one-shot. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 112–121). Cao, 2019, Learning imbalanced datasets with label-distribution-aware margin loss, Advances in Neural Information Processing Systems, 32 Chawla, 2002, SMOTE: synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 16, 321, 10.1613/jair.953 Chen, 2020, A simple framework for contrastive learning of visual representations, 1597 Chou, 2020, Remix: rebalanced mixup, 95 Cui, Y., Jia, M., Lin, T. Y., Song, Y., & Belongie, S. (2019). Class-balanced loss based on effective number of samples. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9268–9277). Cui, 2022, Reslt: Residual learning for long-tailed recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 10.1109/TPAMI.2022.3174892 Cui, J., Zhong, Z., Liu, S., Yu, B., & Jia, J. (2021). Parametric contrastive learning. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 715–724). Deng, 2009, Imagenet: A large-scale hierarchical image database, 248 Drummond, C., Holte, R. C., et al. (2003). C4. 5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling. In Workshop on learning from imbalanced datasets II, vol. 11 (pp. 1–8). He, K., Fan, H., Wu, Y., Xie, S., & Girshick, R. (2020). Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9729–9738). He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770–778). Hong, Y., Han, S., Choi, K., Seo, S., Kim, B., & Chang, B. (2021). Disentangling label distribution for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 6626–6636). Islam, 2021 Kang, B., Li, Y., Xie, S., Yuan, Z., & Feng, J. (2020). Exploring balanced feature spaces for representation learning. In International conference on learning representations. Kang, 2020, Decoupling representation and classifier for long-tailed recognition Li, T., Cao, P., Yuan, Y., Fan, L., Yang, Y., Feris, R. S., et al. (2022). Targeted supervised contrastive learning for long-tailed recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 6918–6928). Li, S., Gong, K., Liu, C. H., Wang, Y., Qiao, F., & Cheng, X. (2021). Metasaug: Meta semantic augmentation for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5212–5221). Liang, 2020, Paying deep attention to both neighbors and multiple tasks, 140 Lin, T.-Y., Goyal, P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision (pp. 2980–2988). Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., & Yu, S. X. (2019). Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 2537–2546). Loshchilov, 2016 Van der Maaten, 2008, Visualizing data using t-sne, Journal of Machine Learning Research, 9 Menon, 2021, Long-tail learning via logit adjustment More, 2016 Park, S., Lim, J., Jeon, Y., & Choi, J. Y. (2021). Influence-balanced loss for imbalanced visual classification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 735–744). Pouyanfar, 2018, Dynamic sampling in convolutional neural networks for imbalanced data classification, 112 Shu, 2019, Meta-weight-net: Learning an explicit mapping for sample weighting, Advances in Neural Information Processing Systems, 32 Smith, 2022 Snell, 2017, Prototypical networks for few-shot learning, Advances in Neural Information Processing Systems, 30 Wang, P., Han, K., Wei, X. S., Zhang, L., & Wang, L. (2021). Contrastive learning based hybrid networks for long-tailed image classification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 943–952). Wang, 2020 Wang, J., Lukasiewicz, T., Hu, X., Cai, J., & Xu, Z. (2021). Rsg: A simple but effective module for learning imbalanced datasets. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3784–3793). Wang, 2019, Implicit semantic data augmentation for deep networks, Advances in Neural Information Processing Systems, 32 Wang, 2017, Learning to model the tail, Advances in Neural Information Processing Systems, 30 Wang, 2022, Dynamic auxiliary soft labels for decoupled learning, Neural Networks, 151, 132, 10.1016/j.neunet.2022.03.027 Xiang, 2020, Learning from multiple experts: Self-paced knowledge distillation for long-tailed classification, 247 Yang, 2022, A survey on long-tailed visual recognition, International Journal of Computer Vision, 1 Zhang, 2017 Zhang, S., Li, Z., Yan, S., He, X., & Sun, J. (2021). Distribution alignment: A unified framework for long-tail visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 2361–2370). Zhang, 2018, Cross-domain attribute representation based on convolutional neural network, 134 Zhang, Y., Wei, X.-S., Zhou, B., & Wu, J. (2021). Bag of tricks for long-tailed visual recognition with deep convolutional neural networks. In Proceedings of the AAAI conference on artificial intelligence, vol. 35, no. 4 (pp. 3447–3455). Zhong, Z., Cui, J., Liu, S., & Jia, J. (2021). Improving calibration for long-tailed recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 16489–16498). Zhou, B., Cui, Q., Wei, X.-S., & Chen, Z. M. (2020). Bbn: Bilateral-branch network with cumulative learning for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9719–9728). Zhu, J., Wang, Z., Chen, J., Chen, Y. P. P., & Jiang, Y.-G. (2022). Balanced contrastive learning for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 6908–6917).