Nội dung được dịch bởi AI, chỉ mang tính chất tham khảo

Mạng Mặt Dày: Một Trình Phát Hiện Mặt Dày Dựa Trên Ngữ Cảnh Toàn Cục Và Cơ Chế Chú Ý Thị Giác

Springer Science and Business Media LLC - Tập 19 - Trang 247-256 - 2022

Lin Song¹, Jin-Fu Yang^1,2, Qing-Zhen Shang¹, Ming-Ai Li¹

¹Faculty of Information Technology, Beijing University of Technology, Beijing, China

²Beijing Key Laboratory of Computational Intelligence and Intelligent Systems, Beijing, China

Tóm tắt

Phát hiện khuôn mặt đã đạt được nhiều bước tiến lớn nhờ vào các mạng nơ-ron tích chập. Tuy nhiên, phát hiện khuôn mặt dày vẫn là một thách thức đang mở do sự biến đổi quy mô khuôn mặt lớn, khuôn mặt nhỏ và sự che khuất nghiêm trọng. Bài báo này trình bày một trình phát hiện khuôn mặt dày mạnh mẽ sử dụng ngữ cảnh toàn cục và các cơ chế chú ý thị giác có thể cải thiện đáng kể độ chính xác của việc phát hiện. Cụ thể, một mô-đun kết hợp ngữ cảnh toàn cục với phản hồi từ trên xuống được đề xuất nhằm cải thiện khả năng nhận diện các khuôn mặt nhỏ. Hơn nữa, một cơ chế chú ý thị giác được áp dụng để giải quyết vấn đề che khuất. Kết quả thực nghiệm trên các tập dữ liệu khuôn mặt công khai WIDER FACE và FDDB cho thấy hiệu quả của phương pháp được đề xuất.

Từ khóa

#Phát hiện khuôn mặt #mạng nơ-ron tích chập #ngữ cảnh toàn cục #chú ý thị giác #che khuất

Tài liệu tham khảo

F. Schroff, D. Kalenichenko, J. Philbin. FaceNet: A unified embedding for face recognition and clustering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Boston, USA, pp. 815–823, 2015. DOI: https://doi.org/10.1109/CVPR.2015.7298682. F. F. Zhang, T. Z. Zhang, Q. R. Mao, C. S. Xu. Joint pose and expression modeling for facial expression recognition. In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Salt Lake City, USA, pp. 3359–3368, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00354. K. P. Zhang, Z. P. Zhang, Z. F. Li, Y. Qiao. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Processing Letters, vol. 23, no. 10, pp. 1499–1503, 2016. DOI: https://doi.org/10.1109/LSP.2016.2603342. P. Y. Hu, D. Ramanan. Finding tiny faces. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Honolulu, USA, pp. 1522–1530, 2017. DOI: https://doi.org/10.1109/CVPR.2017.166. M. Najibi, P. Samangouei, R. Chellappa, L. S. Davis. SSH: Single stage headless face detector. In Proceedings of IEEE International Conference on Computer Vision, IEEE, Venice, Italy, pp. 4885–4894, 2017. DOI: https://doi.org/10.1109/ICCV.2017.522. J. K. Deng, J. Guo, Y. X. Zhou, J. K. Yu, I. Kotsia, S. Zafeiriou. RetinaFace: Single-stage dense face localisation in the wild. [Online], Available: https://arxiv.org/abs/1905.00641, 2019. T. Y. Lin, P. Goyal, R. Girshick, K. M. He, P. Dollár. Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318–327, 2020. DOI: https://doi.org/10.1109/TPAMI.2018.2858826. V. Jain, E. Learned-Miller. FDDB: A Benchmark for Face Detection in Unconstrained Settings, Technical Report UM-CS-2010–009, University of Massachusetts, USA, 2010. S. Yang, P. Luo, C. C. Loy, X. O. Tang. WIDER FACE: A face detection benchmark. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Las Vegas, USA, pp. 5525–5533, 2016. DOI: https://doi.org/10.1109/CV-PR.2016.596. C. C. Zhu, Y. T. Zheng, K. Luu, M. Savvides. CMS-RCNN: Contextual multi-scale region-based CNN for unconstrained face detection. Deep Learning for Biometrics, B. Bhanu, A. Kumar, Eds., Cham, Germany: Springer, pp. 57–79, 2017. DOI: https://doi.org/10.1007/978-3-319-61657-5_3. S. Q. Ren, K. M. He, R. Girshick, J. Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 91–99, 2015. DOI: https://doi.org/10.5555/2969239.2969250. T. Xu, D. K. Du, Z. Q. He, J. T. Liu. PyramidBox: A context-assisted single shot face detector. In Proceedings of the 15th European Conference on Computer Vision, Springer, Munich, Germany, pp. 812–828, 2018. DOI: https://doi.org/10.1007/978-3-030-01240-3_49. S. F. Zhang, X. Y. Zhu, Z. Lei, H. L. Shi, X. B. Wang, S. Z. Li. S.3FD: Single shot scale-invariant face detector. In Proceedings of IEEE International Conference on Computer Vision, IEEE, Venice, Italy, pp. 192–201, 2017. DOI: https://doi.org/10.1109/ICCV.2017.30. M. Jaderberg, K. Simonyan, A. Zisserman, K. Kavukcuoglu. Spatial transformer networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2017–2025, 2015. DOI: https://doi.org/10.5555/2969442.2969465. J. Hu, L. Shen, S. Albanie, G. Sun, E. H. Wu. Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 8, pp. 2011–2023, 2020. DOI: https://doi.org/10.1109/TPAMI.2019.2913372. S. Woo, J. Park, J. Y. Lee, I. S. Kweon. CBAM: Convolutional block attention module. In Proceedings of the 15th European Conference on Computer Vision, Springer, Munich, Germany, pp. 3–19, 2018. DOI: https://doi.org/10.1007/978-3-030-01234-21. J. F. Wang, Y. Yuan, G. Yu. Face attention network: An effective face detector for the occluded faces. [Online], Available: https://arxiv.org/abs/1711.07246, 2017. A. G. Howard, M. L. Zhu, B. Chen, D. Kalenichenko, W. J. Wang, T. Weyand, M. Andreetto, H. Adam. MobileNets: Efficient convolutional neural networks for mobile vision applications. [Online], Available: https://arxiv.org/abs/1704.04861, 2017. K. M. He, X. Y. Zhang, S. Q. Ren, J. Sun. Deep residual learning for image recognition. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Las Vegas, USA, pp. 770–778, 2016. DOI: https://doi.org/10.1109/CVPR.2016.90. S. S. Farfade, M. J. Saberian, L. J. Li. Multi-view face detection using deep convolutional neural networks. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, ACM, Shanghai, China, pp. 643–650, 2015. DOI: https://doi.org/10.1145/2671188.2749408. H. X. Li, Z. Lin, X. H. Shen, J. Brandt, G. Hua. A convolutional neural network cascade for face detection. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Boston, USA, pp. 5325–5334, 2015. DOI: https://doi.org/10.1109/CVPR.2015.7299170. R. Girshick. Fast R-CNN. In Proceedings of IEEE International Conference on Computer Vision, IEEE, Santiago, Chile, pp. 1440–1448, 2015. DOI: https://doi.org/10.1109/ICCV.2015.169. R. Ranjan, V. M. Patel, R. Chellappa. A deep pyramid deformable part model for face detection. In Proceedings of the 7th IEEE International Conference on Biometrics Theory, Applications and Systems, IEEE, Arlington, USA, pp. 1–8, 2015. DOI: https://doi.org/10.1109/BTAS.2015.7358755. J. H. Yu, Y. N. Jiang, Z. Y. Wang, Z. M. Cao, T. Huang. UnitBox: An advanced object detection network. In Proceedings of the 24th ACM International Conference on Multimedia, ACM, Amsterdam, The Netherlands, pp. 516–520, 2016. DOI: https://doi.org/10.1145/2964284.2967274.

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích ảnh hưởng của các bài báo, công bố khoa học Việt Nam và Quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ SciBase

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Hệ thống hội thảo khoa học Việt Nam

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA

Thông tin liên hệ & hỗ trợ

Đơn vị chủ quản, phát triển và vận hành: Công ty Cổ phần Metis

Địa chỉ liên hệ: 26A Lê Đức Thọ, Phường Từ Liêm, Thành phố Hà Nội

Số giấy chứng nhận ĐKKD: 0109293202 cấp ngày 03/08/2020 tại Sở Kế hoạch và Đầu tư thành phố Hà Nội

Người quản lý và chịu trách nhiệm nội dung: Nguyễn Ngọc Sơn

Hotline: 0566.685.688

Email: [email protected]