Generative adversarial networks via a composite annealing of noise and diffusion

Pattern Recognition - Tập 146 - Trang 110034 - 2024
Kensuke Nakamura1, Simon Korman2, Byung-Woo Hong1
1Computer Science Department, Chung-Ang University, Seoul, Republic of Korea
2Department of Computer Science, University of Haifa, Haifa, Israel

Tài liệu tham khảo

Goodfellow, 2014, Generative adversarial nets, 2672 Bond-Taylor, 2021, Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models, IEEE Trans. Pattern Anal. Mach. Intell. L.D. Tran, S.M. Nguyen, M. Arai, GAN-based noise model for denoising real images, in: Proceedings of the Asian Conference on Computer Vision, 2020. Jin, 2020, AI-GAN: Asynchronous interactive generative adversarial network for single image rain removal, Pattern Recognit., 100, 10.1016/j.patcog.2019.107143 Souibgui, 2020, De-gan: A conditional generative adversarial network for document enhancement, IEEE Trans. Pattern Anal. Mach. Intell., 44, 1180, 10.1109/TPAMI.2020.3022406 Liu, 2023, WSDS-GAN: A weak-strong dual supervised learning method for underwater image enhancement, Pattern Recognit. Yan, 2021, IsGAN: Identity-sensitive generative adversarial network for face photo-sketch synthesis, Pattern Recognit., 119, 10.1016/j.patcog.2021.108077 Wang, 2023, GGD-GAN: Gradient-guided dual-branch adversarial networks for relic sketch generation, Pattern Recognit., 141, 10.1016/j.patcog.2023.109586 de Rosa, 2021, A survey on text generation using generative adversarial networks, Pattern Recognit., 119, 10.1016/j.patcog.2021.108098 Lai, 2017, Semi-supervised learning for optical flow with generative adversarial networks Tian, 2022 Yi, 2019, Generative adversarial network in medical imaging: A review, Med. Image Anal., 58, 10.1016/j.media.2019.101552 Xu, 2023, A comprehensive survey of image augmentation techniques for deep learning, Pattern Recognit., 10.1016/j.patcog.2023.109347 Bosquet, 2023, A full data augmentation pipeline for small object detection based on generative adversarial networks, Pattern Recognit., 133, 10.1016/j.patcog.2022.108998 Nagarajan, 2017, Gradient descent GAN optimization is locally stable Mescheder, 2018, Which training methods for GANs do actually converge?, 3481 H. Berard, G. Gidel, A. Almahairi, P. Vincent, S. Lacoste-Julien, A Closer Look at the Optimization Landscapes of Generative Adversarial Networks, in: 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. Kurach, 2019, A large-scale study on regularization and normalization in GANs, 3581 Wang, 2020, Stabilizing training of generative adversarial nets via langevin stein variational gradient descent, IEEE Trans. Neural Netw. Learn. Syst., 1 M. Arjovsky, L. Bottou, Towards Principled Methods for Training Generative Adversarial Networks, in: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings, 2017. Krogh, 1992, A simple weight decay can improve generalization, 950 R. Sutton, Two problems with back propagation and other steepest descent learning procedures for networks, in: Proceedings of the Eighth Annual Conference of the Cognitive Science Society, 1986, pp. 823–832. X. Mao, Q. Li, H. Xie, R.Y. Lau, Z. Wang, S. Paul Smolley, Least squares generative adversarial networks, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2794–2802. Nowozin, 2016, F-GAN: Training generative neural samplers using variational divergence minimization, 271 Cai, 2020, Utilizing amari-alpha divergence to stabilize the training of generative adversarial networks, Entropy, 22, 410, 10.3390/e22040410 Arjovsky, 2017, Wasserstein generative adversarial networks, 214 S. Jenni, P. Favaro, On stabilizing generative adversarial training with noise, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 12145–12153. Feng, 2021, Understanding noise injection in gans, 3284 Chen, 2021, NM-GAN: Noise-modulated generative adversarial network for video anomaly detection, Pattern Recognit., 116, 10.1016/j.patcog.2021.107969 Biemond, 1979, An edge-preserving recursive noise-smoothing algorithm for image data, IEEE Trans. Syst. Man Cybern., 9, 622, 10.1109/TSMC.1979.4310091 Qiu, 1996, An improved recursive median filtering scheme for image processing, IEEE Trans. Image Process., 5, 646, 10.1109/83.491340 Albu, 2009, One scan shadow compensation and visual enhancement of color images, 3133 Townsend, 2001, Walsh spectrum computations using Cayley graphs, 110 Radford, 2015 Karras, 2020, Training generative adversarial networks with limited data, Adv. Neural Inf. Process. Syst., 33, 12104 Salimans, 2016, Improved techniques for training GANs, 2234 T. Miyato, T. Kataoka, M. Koyama, Y. Yoshida, Spectral Normalization for Generative Adversarial Networks, in: 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings, 2018. Kodali, 2017 Roth, 2017, Stabilizing training of generative adversarial networks through regularization, 2018 Heusel, 2017, GANs trained by a two time-scale update rule converge to a local Nash equilibrium Schaefer, 2020, Implicit competitive regularization in GANs, vol. 119, 8533 Zhang, 2019, Progressive augmentation of GANs, 6249 Zhang, 2016 Hazan, 2017, Adversarial perturbations of deep neural networks, 311 Sajjadi, 2018, Tempered adversarial networks, 4451 Karras, 2017 A. Karnewar, O. Wang, MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR, 2020. Zhao, 2020, Differentiable augmentation for data-efficient gan training, Adv. Neural Inf. Process. Syst., 33, 7559 Tran, 2021, On data augmentation for gan training, IEEE Trans. Image Process., 30, 1882, 10.1109/TIP.2021.3049346 Shi, 2023, Cutout with patch-loss augmentation for improving generative adversarial networks against instability, Comput. Vis. Image Underst., 10.1016/j.cviu.2023.103761 Jun, 2020, Distribution augmentation for generative modeling, 5006 T. Chen, X. Zhai, M. Ritter, M. Lucic, N. Houlsby, Self-supervised gans via auxiliary rotation loss, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 12154–12163. C. Sønderby, J. Caballero, L. Theis, W. Shi, F. Huszár, Amortised MAP Inference for Image Super-resolution, in: International Conference on Learning Representations, 2017, pp. 1–17. Wang, 2023, Diffusion-GAN: Training GANs with diffusion Witkin, 1987, Scale-space filtering, 329 Lindeberg, 2013 A. Bora, E. Price, A.G. Dimakis, AmbientGAN: Generative models from lossy measurements, in: International Conference on Learning Representations, 2018, pp. 1–22. Kingma, 2014 Z. Liu, P. Luo, X. Wang, X. Tang, Deep Learning Face Attributes in the Wild, in: Proceedings of International Conference on Computer Vision, ICCV, 2015. Yu, 2015 M.-E. Nilsback, A. Zisserman, Automated Flower Classification over a Large Number of Classes, in: Indian Conference on Computer Vision, Graphics and Image Processing, 2008. Chen, 2016, Infogan: Interpretable representation learning by information maximizing generative adversarial nets, 1 D. Pathak, P. Krahenbuhl, J. Donahue, T. Darrell, A.A. Efros, Context encoders: Feature learning by inpainting, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 2536–2544. Odena, 2017, Conditional image synthesis with auxiliary classifier gans, 2642 H. Zhang, T. Xu, H. Li, S. Zhang, X. Wang, X. Huang, D.N. Metaxas, Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5907–5915. J.-Y. Zhu, T. Park, P. Isola, A.A. Efros, Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, in: Computer Vision (ICCV), 2017 IEEE International Conference on, 2017. A. Brock, J. Donahue, K. Simonyan, Large Scale GAN Training for High Fidelity Natural Image Synthesis, in: International Conference on Learning Representations, 2019. T. Karras, S. Laine, T. Aila, A style-based generator architecture for generative adversarial networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4401–4410. X. Mao, Q. Li, H. Xie, R.Y. Lau, Z. Wang, S. Paul Smolley, Least squares generative adversarial networks, in: Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2794–2802. LeCun, 1998, Gradient-based learning applied to document recognition, Proc. IEEE, 86, 2278, 10.1109/5.726791