Nội dung được dịch bởi AI, chỉ mang tính chất tham khảo

Thuật toán tối thiểu bình phương mở rộng cho nhận diện khuôn mặt mạnh mẽ thông qua mẫu gương phụ

Soft Computing - Tập 20 - Trang 3177-3187 - 2015

Changbin Shao^1,2, Xiaoning Song^1,2, Xibei Yang², Xiaojun Wu¹

¹School of Internet of Things Engineering, Jiangnan University, Wuxi, People’s Republic of China

²School of Computer Science and Engineering, Jiangsu University of Science and Technology, Zhenjiang, People’s Republic of China

Tóm tắt

Sự thay đổi của hình ảnh khuôn mặt với các góc nhìn và ánh sáng phân cực làm tăng độ không chắc chắn trong việc nhận diện khuôn mặt. Trên thực tế, những mẫu gương tổng hợp có thể được công nhận như là đại diện cho sự lệch trái-phải của các tư thế hoặc ánh sáng của khuôn mặt. Hình ảnh khuôn mặt đối xứng được tạo ra từ hình ảnh khuôn mặt gốc cũng cung cấp thêm nhiều quan sát về khuôn mặt, điều này hữu ích để cải thiện độ chính xác của việc nhận diện khuôn mặt. Trong bài báo này, theo hiểu biết của chúng tôi, đây là lần đầu tiên thuật toán phân loại tối thiểu bình phương (MSEC) nổi tiếng được sử dụng để thực hiện nhận diện khuôn mặt trên một cơ sở dữ liệu khuôn mặt mở rộng bằng các mẫu gương tổng hợp, được gọi là phân loại tối thiểu bình phương mở rộng (EMSEC). Bằng cách điều chỉnh quy tắc phân loại MSE, chúng tôi đã thêm các mẫu gương vào tập huấn luyện để đạt được hiệu suất phân loại tốt hơn. Đầu tiên, chúng tôi kết hợp các mẫu huấn luyện gốc và các mẫu gương được tổng hợp từ các mẫu huấn luyện gốc theo từng người để tạo ra các mẫu huấn luyện hỗn hợp. Thứ hai, thuật toán EMSEC khai thác các mẫu huấn luyện hỗn hợp để đạt được ma trận chiếu có thể biến đổi tốt nhất các mẫu huấn luyện hỗn hợp thành các nhãn lớp được xác định trước. Thứ ba, ma trận chiếu được khai thác để đồng thời lấy kết quả biến đổi của mẫu thử và người hàng xóm gần nhất từ tập hợp mẫu huấn luyện hỗn hợp. Cuối cùng, chúng tôi cuối cùng phân loại mẫu thử bằng cách kết hợp các kết quả biến đổi của mẫu thử và người hàng xóm gần nhất. Như một sự mở rộng của MSEC, EMSEC giảm độ không chắc chắn của quan sát khuôn mặt bằng các mẫu gương phụ, do đó nó có hiệu suất phân loại mạnh mẽ hơn so với MSEC truyền thống. Kết quả thử nghiệm trên các cơ sở dữ liệu ORL, GT và FERET cho thấy EMSEC có khả năng tổng quát tốt hơn so với MSEC truyền thống.

Từ khóa

#nhận diện khuôn mặt #thuật toán tối thiểu bình phương #mẫu gương #độ chính xác #phân loại mạnh mẽ

Tài liệu tham khảo

Belhumeour PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces versus fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720 Belloni A, Chernozhukov V (2011) \(l_1 \)-penalized quantile regression in high-dimensional sparse models. Ann Stat 39(1):82–130 Beymer D, Poggio T (1995) Face recognition from one example view. In: Proceedings of the Fifth International Conference on Computer Vision, pp 500–507 Bickel PJ, Ritov Y, Tsybakov AB (2009) Simultaneous analysis of Lasso and Dantzig selector. Ann Stat 37:1705–1732 Billings SA, Lee KL (2002) Nonlinear Fisher discriminant analysis using a minimum squared error cost function and the orthogonal least squares algorithm. Neural Netw 15(1):263–270 Chen S, Hong X, Luk BL, Harris CJ (2009) Orthogonal-least-squares regression: a unified approach for data modelling. Neurocomputing 72(10–12):2670–2681 Dalton LA, Dougherty ER (2011) Bayesian minimum mean-square error estimation for classification error—Part I: definition and the Bayesian MMSE error estimator for discrete classification. IEEE Trans Signal Process 59(1):115–129 Dalton LA, Dougherty ER (2012) Exact sample conditioned MSE performance of the Bayesian MMSE estimator for classification error—Part I: representation. IEEE Trans Signal Process 60(5):2575–2587 Dalton LA, Dougherty ER (2012) Optimal mean-square-error calibration of classifier error estimators under Bayesian models. Pattern Recognit 45(6):2308–2320 Deng WH, Hu JN, Guo J, Cai WD, Feng DG (2010) Robust, accurate and efficient face recognition from a single training image: a uniform pursuit approach. Pattern Recognit 43:1748–1762 Deng W, Hu J, Guo J (2012) Extended SRC: undersampled face recognition via intraclass variant dictionary. IEEE Trans Pattern Anal Mach Intell 34(9):1864–1870 Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York Hastie T, Park MY (2007) \(l_1 \)-Regularization path algorithm for generalized linear models. J R Stat Soc B 69:659–677 Hastie T, Tibshirani R, Friedman J et al (2005) The elements of statistical learning: data mining, inference and prediction. Math Intell 27(2):83–85 Jian M, Lam KM, Dong J (2011) Illumination compensation and enhancement for face recognition. In: Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit Conference (APSIPA ASC’2011), paper Wed-AM.RS6 Liu C, Wechsler H (2002) Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition. IEEE Trans Image Process 11(4):467–476 Liu XM, Chen T, Kumar BVKV (2003) Face authentication for multiple subjects using eigenflow[J]. Pattern Recognit 36(2):313–328 Markovsky I, Huffel SV (2007) Overview of total least-squares methods. Signal Process 87(10):2283–2302 Martinez AM, Benavente R (1998) The AR face database. CVC Technical Report #24 Naseem I, Togneri R, Bennamoun M (2010) Linear regression for face recognition. IEEE Trans Pattern Anal Mach Intell 32(11):2106–2112 Saber E, Murat A (1998) Tekalp, Frontal-view face detection and facial feature extraction using color, shape and symmetry based cost functions. Pattern Recognit Lett 19(8):669–680 Sharma A, Dubey A, Tripathi P, Kumar V (2010) Pose invariant virtual classifiers from single training image using novel hybrid-eigenfaces. Neurocomputing 73(10–12):1868–1880 Song XN, Zheng YJ, Wu XJ, Yang XB, Yang JY (2010) A complete fuzzy discriminant analysis approach for face recognition. Appl Soft Comput 10:208–214 Swets DL, Weng JJ (1996) Using discriminant eigenfeatures for image retrieval. IEEE Trans Pattern Anal Mach Intell 18(8):831–836 Tan X, Chen S, Zhou Z-H, Zhang F (2006) Face recognition from a single image per person: a survey. Pattern Recognit 39(9):1725–1745 Tang D, Zhu N, Yu F, Chen W, Tang T (2014) A novel sparse representation method based on virtual samples for face recognition. Neural Comput Appl 24(3–4):513–519 Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86 Vetter T (1998) Synthesis of novel views from a single face image. Int J Comput Vis 28(2):102–116 Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227 Wright J, Ma Y, Mairal J, Sapiro G (2010) Sparse representation for computer vision and pattern recognition. Proc IEEE 98(6):1031–1044 Xu X, He H, Hu D (2002) Efficient reinforcement learning using recursive least-squares methods. J Artif Intell Res 16:259–292 Xu Y, Yang J-Y, Yang J (2004) A reformative kernel Fisher discriminant analysis. Pattern Recognit 37(6):1299–1302 Xu Y, Zhang D, Jin Z, Li M, Yang JY (2006) A fast kernel-based nonlinear discriminant analysis for multi-class problems. Pattern Recognit 39(6):1026–1033 Xu Y, Zhang D, Yang J, Yang JY (2011) A two-phase test sample sparse representation method for use with face recognition. IEEE Trans Circuits Syst Video Technol 21(9):1255–1262 Xu Y, Zhu X, Li Z, Liu G et al (2013) Using the original and ‘symmetrical face’ training samples to perform representation based two-step face recognition. Pattern Recognit 46(4):1151–1158 Xu Y, Fang XZ, Zhu Q, Chen Y, You J, Liu H (2014) Modified minimum squared error algorithm for robust classification and face recognition experiments. Neurocomputing 135:253–261 Xu Y, Fang XZ, Li XL, Yang J, You J, Liu H, Teng SH (2014) Data uncertainty in face recognition. IEEE Trans Cybern 44(10):1950–1961 Xu Y, Li XL, Yang J, Zhang D (2014) Integrate the original face image and its mirror image for face recognition. Neurocomputing 131:191–199 Xu Y, Jin Z (2008) Down-sampling face images and low-resolution face recognition. In: Proc. 3rd Int. Conf. Innovative Comput. Inform. Control, pp 392–395 Xu J, Zhang X, Li Y (2001) Kernel MSEC algorithm: a unified framework for KFD, LS-SVM and KRR. In: Proceedings of the International Joint Conference on Neural Networks, pp 1486–14 91 Yang J, Yang JY (2003) Why can LDA be performed in PCA transformed space. Pattern Recognit 36(2):563–566 Yang J, Zhang D, Frangi AF, Yang JY (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Trans Pattern Anal Mach Intell 26(1):131–137 Yao G, Ding R (2012) Two-stage least squares based iterative identification algorithm for controlled autoregressive moving average (CARMA) systems. Comput Math Appl 63(5):975–984 Zhang L, et al (2011) Sparse representation or collaborative representation: which helps face recognition? In: Proceedings of the ICCV Zhao Y-P, Du Z-H, Zhang Z-A, Zhang H-B (2011) A fast method of feature extraction for kernel MSE. Neurocomputing 74(10):1654–1663 Zhu Q (2010) Reformative nonlinear feature extraction using kernel MSE. Neurocomputing 73(16–18):3334–3337

Scholar Hub - Công cụ hỗ trợ trích dẫn và phân tích khoa học Việt Nam

Về chúng tôi

Scholar Hub là công cụ hỗ trợ trích dẫn và phân tích các bài báo, công bố khoa học Việt Nam. Công cụ trợ giúp người nghiên cứu, tạp chí, đơn vị nghiên cứu tra cứu, phân tích và thống kê dữ liệu nghiên cứu khoa học tại Việt Nam và quốc tế.
ScholarHub KHÔNG đăng thông tin tổng hợp, KHÔNG đăng lại nội dung từ các trang báo chí Việt Nam hoặc trang thông tin điện tử khác tại Việt Nam.

Thông tin, cập nhật

Đăng ký Tạp chí tham gia vào Scholar Hub

Phản hồi ý kiến về Scholar Hub

Bài viết, nội dung cập nhật

Chủ đề khoa học

Website liên kết

Hệ thống CSDL Khoa học & Công nghệ

Phần mềm kiểm tra trùng lặp Kiểm Tra Tài Liệu

Phần mềm xuất bản tạp chí điện tử VOJS

Nền tảng trắc nghiệm và đề thi đa lĩnh vực LetQA