Nội dung được dịch bởi AI, chỉ mang tính chất tham khảo

Giải Mờ Ảnh với Học Tập Từ Điển Kết Hợp

Springer Science and Business Media LLC - Tập 114 - Trang 248-271 - 2014

Shiming Xiang¹, Gaofeng Meng¹, Ying Wang¹, Chunhong Pan¹, Changshui Zhang²

¹National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China

²Tsinghua National Laboratory for Information Science and Technology (TNList), Department of Automation, Tsinghua University, Beijing, China

Tóm tắt

Giải mờ ảnh là một bài toán thách thức trong lĩnh vực tính toán thị giác. Truyền thống, nhiệm vụ này được giải quyết như một bài toán nghịch mà đóng trong chính hình ảnh. Bài báo này giới thiệu một khung học tập dựa trên việc khai thác và sử dụng kiến thức ẩn trong một khối lượng lớn dữ liệu có sẵn để giải mờ ảnh. Để đạt được điều này, thuật toán của chúng tôi được phát triển dưới khung khái niệm của học tập từ điển kết hợp. Cụ thể, given các cặp mẫu ảnh bị mờ và ảnh tương ứng rõ nét của chúng, một mô hình học tập được xây dựng để học một cặp từ điển. Trong số đó, một từ điển chịu trách nhiệm cho việc biểu diễn các ảnh rõ nét, trong khi từ điển kia phụ trách việc biểu diễn các ảnh bị mờ. Về lý thuyết, mô hình học tập được phân tích với các biểu diễn thưa kết hợp cho các mẫu huấn luyện. Khi các nguyên tử trong các từ điển này được liên kết với nhau từng cặp một, thông tin tái cấu trúc có thể được truyền giữa các ảnh rõ nét và bị mờ. Trong giai đoạn ứng dụng, từ điển bị mờ được sử dụng để tái cấu trúc tuyến tính ảnh bị mờ cần phục hồi. Sau đó, các hệ số tái cấu trúc được giữ nguyên cùng với từ điển rõ nét để phục hồi kết quả cuối cùng. Điểm mạnh chính của phương pháp của chúng tôi nằm ở việc nó hoạt động trong trường hợp không biết các kernel mờ. Các thí nghiệm so sánh cho thấy tính hợp lệ của phương pháp của chúng tôi.

Từ khóa

#giải mờ ảnh #học tập từ điển #tính toán thị giác #mô hình học tập #biểu diễn thưa

Tài liệu tham khảo

Afonso, M. V., Bioucas-Dias, J. M., & Figueiredo, M. (2010). Fast image recovery using variable splitting and constrained optimization. IEEE Transactions on Image Processing, 19(9), 2345–2356. Aharon, M., Elad, M., & Bruckstein, A. M. (2006). K-svd: An algorithm for designing of over-complete dictionaries for sparse representationn. IEEE Transactions on Signal Processing, 54(11), 4311–4322. Bar, L., Sochen, N., & Kiryati, N. (2007). Restoration of images with piecewise space-variant blur. In International conference on scale space and variational methods in computer vision (pp. 533–544). Ischia, Italy. Barbero, A., & Sra, S. (2011). Fast newton-type methods for total variation regularization. In International conference on machine learning (pp. 313–320). Bellevue, Washington, USA. Batra, D., Kowdle, A., Parikh, D., Luo, J., & Chen, T. (2010). icoseg: Interactive co-segmentation with intelligent scribble guidance. In IEEE conference on computer vision and pattern recognition (pp. 3169–3176). San Francisco, CA, USA. Batra, D., Kowdle, A., Parikh, D., Luo, J., & Chen, T. (2011). Interactively co-segmentating topically related images with intelligent scribble guidance. International Journal of Computer Vision, 93(3), 273–292. Beck, A., & Teboulle, M. (2009a). Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Transactions on Image Processing, 18(11), 2419–2434. Beck, A., & Teboulle, M. (2009b). A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2(1), 183–202. Bioucas-Dias, J. M., & Figueiredo, M. (2007). A new twist: Two-step iterative shrinkage/thresholding algorithms for image restoration. IEEE Transactions on Image Processing, 16(12), 2992–3004. Bronstein, M. M., Bronstein, A. M., Zibulevsky, M., & Zeevi, Y. Y. (2005). Blind deconvolution of images using optimal sparse representations. IEEE Transactions on Image Processing, 14(6), 726–736. Chantas, G. K., Galatsanos, N. P., Molina, R., & Katsaggelos, A. K. (2010). Variational bayesian image restoration with a product of spatially weighted total variation image priors. IEEE Transactions on Image Processing, 19(2), 351–362. Couzinie-Devy, F., Mairal, J., Bach, F., & Ponce, J. (2011). Dictionary learning for deblurring and digital zoom. CoRR. arXiv:1110.0957. Dabov, K., Foi, A., Katkovnik, V., & Egiazarian, K. (2007). Image denoising by sparse 3d transform-domain collaborative filtering. IEEE Transactions on Image Processing, 16(8), 2080–2095. Danielyan, A., Katkovnik, V., & Egiazarian, K. (2012). Bm3d frames and variational image deblurring. IEEE Transactions on Image Processing, 21(4), 1715–1728. Dong, W., Zhang, L., Shi, G., & Li, X. (2013). Nonlocally centralized sparse representation for image restoration. IEEE Transactions on Image Processing, 22(4), 1620–1630. Dong, W., Zhang, L., Shi, G., & Wu, X. (2011). Image deblurring and super-resolution by adaptive sparse domain selection and adaptive regularization. IEEE Transactions on Image Processing, 20(7), 1838–1857. Elad, M., & Aharon, M. (2006). Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image Processing, 15(12), 3736–3745. Elad, M., Figueiredo, M., & Ma, Y. (2010). On the role of sparse and redundant representations in image processing. Proceedings of the IEEE, 98(6), 972–982. Engan, K., Aase, S. O., & Husoy, J. H. (2000). Multi-frame compression: Theory and design. EURASIP Signal Processing, 80(10), 2121–2140. Fergus, R., Singh, B., Hertzmann, A., Roweis, S. T., & Freeman, W. T. (2006). Removing camera shake from a single photograph. ACM Transactions on Graphics (SIGGRAPH), 25(3), 787–794. Goldluecke, B., & Cremers, D. (2010). An approach to vectorial total variation based on geometric measure theory. In IEEE international conference on computer vision and pattern recognition (pp. 327–333). San Francisco, CA, USA. Gorodnitsky, I. F., & Rao, B. D. (1997). Sparse signal reconstruction from limited data using focuss: A re-weighted norm minimization algorithm. IEEE Transactions on Signal Processing, 45(3), 600–616. He, L., Qi, H., & Zaretzki, R. (2013). Beta process joint dictionary learning for coupled feature spaces with application to single image super-resolution. In IEEE conference on computer vision and pattern recognition (pp. 345–352). Portland, Oregon, USA. Hu, Z., Huang, J. B., & Yang, M. H. (2010). Single image deblurring with adaptive dictionary learning. In IEEE international conference on image processing (pp. 1169–1172). Hong Kong, China. Jenatton, R., Mairal, J., Obozinski, G., & Bach, F. (2011). Proximal methods for hierarchical sparse coding. Journal of Machine Learning Research, 12, 2297–2334. Jia, K., Wang, X., & Tang, X. (2013). Image transformation based on learning dictionaries across image spaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(2), 367–380. Joshi, N., Kang, S. B., Zitnick, C. L., & Szeliski, R. (2010). Image deblurring using inertial measurement sensors. In SIGGRAPH (pp. 1–8). Los Angeles, CA, USA. Joshi, N., Zitnick, C. L., Szeliski, R., & Kriegman, D. J. (2009). Image deblurring and denoising using color priors. In International conference on computer vision and pattern recognition (pp. 1–8). Miami, FL, USA. Kamilov, U., Bostan, E., & Unser, M. (2012). Wavelet shrinkage with consistent cycle spinning generalizes total variation denoising. IEEE Signal Processing Letters, 19(4), 187–190. Kenig, T., Kam, Z., & Feuer, A. (2010). Blind image deconvolution using machine learning for three-dimensional microscopy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(12), 2191–2204. Levin, A., Fergus, R., Durand, F., & Freeman, W. T. (2007). Coded exposure photography: Motion deblurring using fluttered shutter. ACM Transactions on Graphics (SIGGRAPH), 26(3), 70:1–70:9. Levin, A., Weiss, Y., Durand, F., & Freeman, W. T. (2009). Understanding and evaluating blind deconvolution algorithms. In International conference on computer vision and pattern recognition (pp. 1–8). Miami, FL, USA. Li, D., Mersereau, R. M., & Simske, S. (2007). Blind image deconvolution through support vector regression. IEEE Transactions on Neural Networks, 18(3), 931–935. Lou, Y., Bertozzi, A. L., & Soatto, S. (2011). Direct sparse deblurring. Journal of Mathematical Imaging and Vision, 39(1), 1–12. Lucy, L. (1974). An iterative technique for the rectification of observed distributions. The Astronomical Journal, 79(6), 745–754. Ma, L., Moisan, L., Yu, J., & Zeng, T. (2013). A dictionary learning approach for poisson image deblurring. IEEE Transactions on Image Processing, 32(7), 1277–1289. Ma, S., Yin, W., Zhang, Y., & Chakraborty, A. (2008). An efficient algorithm for compressed MR imaging using total variation and wavelets. In IEEE international conference on computer vision and pattern recognition (pp. 1–8). Anchorage, Alaska, USA. Mairal, J., Bach, F., & Ponce, J. (2012). Task-driven dictionary learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 791–804. Mairal, J., Bach, F., Ponce, J., & Sapiro, G. (2010). Online learning for matrix factorization and sparse coding. Journal of Machine Learning Research, 11, 19–60. Mairal, J., Bach, F., Ponce, J., Sapiro, G., & Zisserman, A. (2009). Non-local sparse models for image restoration. In IEEE international conference on computer vision (pp. 2272–2279). Kyoto, Japan. Mairal, J., Elad, M., & Sapiro, G. (2008). Sparse representation for color image restoration. IEEE Transactions on Image Processing, 17(1), 53–69. Mairal, J., Jenatton, R., Obozinski, G., & Bach, F. (2011). Convex and network flow optimization for structured sparsity. Journal of Machine Learning Research, 12, 2681–2720. Peleg, T., Eldar, Y., & Elad, M. (2013). Exploiting statistical dependencies in sparse representations for signal recovery. IEEE Transactions on Signal Processing, 60(5), 2286–2303. Portilla, J. (2009). Image restoration through l0 analysis-based sparse optimization in tight frames. In IEEE international conference on image processing (pp. 3909–3912). Cairo, Egypt. Richardson, W. H. (1972). Bayesian-based iterative method of image restoration. Journal of the Optical Society of America, 62(1), 55–59. Rodriguez, P., & Wohlberg, B. (2009). Efficient minimization method for a generalized total variation functional. IEEE Transactions on Image Processing, 18(2), 322–332. Rubinstein, R., Peleg, T., & Elad, M. (2013). Analysis k-svd: A dictionary-learning algorithm for the analysis sparse model. IEEE Transactions on Signal Processing, 61(3), 661–677. Rubinstein, R., Zibulevsky, M., & Elad, M. (2010). Double sparsity: Learning sparse dictionaries for sparse signal approximation. IEEE Transactions on Signal Processing, 58(3), 1553–1564. Rudin, L. I., Osher, S., & Fatemi, E. (1992). Nonlinear total variation based noise removal algorithms. Physica D, 60(1), 259–268. Shan, Q., Jia, J., & Agarwala, A. (2008). High-quality motion deblurring from a single image. ACM Transactions on Graphics (SIGGRAPH), 27(3), 73:1–73:10. Su, M., & Basu, M. (2002). A hybrid learning system for image deblurring. Pattern Recognition, 35(12), 2881–2894. Takeda, H., Farsiu, S., & Milanfar, P. (2008). Deblurring using regularized locally adaptive kernel regression. IEEE Transactions on Image Processing, 17(4), 550–563. Tropp, J. A. (2004). Greed is good: Algorithmic results for sparse approximation. IEEE Transactions on Information Theory, 50(10), 2231–2242. Wang, S., Zhang, L., Liang, Y., & Pan, Q. (2012). Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis. In IEEE conference on computer vision and pattern recognition (pp. 2216–2223). Providence, Rhode Island, USA. Wang, Z., & Li, Q. (2011). Information content weighting for perceptual image quality assessment. IEEE Transactions on Image Processing, 20(5), 1185–1198. Wen, Y. W., Chan, R. H., & Yip, A. M. (2012). A primal-dual method for total-variation-based wavelet domain inpainting. IEEE Transactions on Image Processing, 21(1), 106–114. Whyte, O., Sivic, J., Zisserman, A., & Ponce, J. (2012). Non-uniform deblurring for shaken images. International Journal of Computer Vision, 98(2), 168–186. Xiang, S., Nie, F., & Zhang, C. (2010). Semi-supervised classification via local spline regression. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(11), 2039–2053. Xu, L., Zheng, S., & Jia, J. (2013). Unnatural l0 sparse representation for natural image deblurring. In IEEE conference on computer vision and pattern recognition (pp. 1107–1114). Portland, Oregon, USA. Yang, J., Wang, Z., Lin, Z., Cohen, S., & Huang, T. S. (2012). Coupled dictionary training for image super-resolution. IEEE Transactions on Image Processing, 21(8), 3467–3478. Yang, J., Wright, J., Huang, T. S., & Ma, Y. (2010). Image super-resolution via sparse representation. IEEE Transactions on Signal Processing, 19(11), 2861–2873. Yuan, L., Sun, J., Quan, L., & Shum, H. Y. (2007). Image deblurring with blurred/noisy image pairs. ACM Transactions on Graphics (SIGGRAPH), 26(3), 1–10. Zeyde, R., Elad, M., & Protter, M. (2010). On single image scale-up using sparse-representations. In Proceedings of the 7th international conference on curves and surfaces (pp. 711–730). Avignon, France. Zhang, H., Wipf, D. P., & Zhang, Y. (2013). Multi-image blind deblurring using a coupled adaptive sparse prior. In IEEE conference on computer vision and pattern recognition (pp. 1051–1058). Portland, Oregon, USA. Zhang, H., Yang, J., Zhang, Y., & Huang, T. S. (2011). Sparse representation based blind image deblurring. In IEEE international conference on multimedia and expo (pp. 1–6). Barcelona, Spain.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]