Parametric classification with soft labels using the evidential EM algorithm: linear discriminant analysis versus logistic regression

Advances in Data Analysis and Classification - Tập 11 Số 4 - Trang 659-690 - 2017
Benjamin Quost1, Thierry Denœux1, Shoumei Li2
1CNRS, Heudiasyc UMR 7253, Sorbonne Universités, Université de Technologie de Compiègne, Compiègne, France
2College of Applied Sciences, Beijing University of Technology, Beijing, China

Tóm tắt

Từ khóa


Tài liệu tham khảo

Abassi L, Boukhris I (2016) Crowd label aggregation under a belief function framework. In: Lehner F, Fteimi N (eds) Proceedings of 9th international conference on knowledge science, engineering and management, KSEM 2016, Passau, Germany, 5–7 Oct 2016. Springer, Cham, pp 185–196

Bishop CM (2006) Pattern recognition and machine learning. Springer, Berlin

Chapelle O, Schölkopf B, Zien A (2006) Semi-supervised learning. MIT Press, Cambridge

Cherfi ZL, Oukhellou L, Côme E, Denœux T, Aknin P (2012) Partially supervised independent factor analysis using soft labels elicited from multiple experts: application to railway track circuit diagnosis. Soft Comput 16(5):741–754

Côme E, Oukhellou L, Denœux T, Aknin P (2009) Learning from partially supervised data using mixture models and belief functions. Patt Recognit 42(3):334–348

Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12:1225–1261

Couso I, Dubois D (2017) Maximum likelihood under incomplete information: toward a comparison of criteria. In: Ferraro MB, Giordani P, Vantaggi B, Gagolewski M, Gil M Ángeles, Grzegorzewski P, Hryniewicz O (eds) Soft methods for data science. Springer, Cham, pp 141–148

Dempster AP (1967) Upper and lower probabilities induced by a multivalued mapping. Ann Math Stat 38:325–339

Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1–38

Denœux T (1995) A $$k$$ k -nearest neighbor classification rule based on Dempster–Shafer theory. IEEE Trans Syst Man Cybern 25(05):804–813

Denœux T (2013) Maximum likelihood estimation from uncertain data in the belief function framework. IEEE Trans Knowl Data Eng 25(1):119–130

Denœux T (2014) Likelihood-based belief function: justification and some extensions to low-quality data. Int J Approx Reason 55(7):1535–1547

Denoeux T, Kanjanatarakul O (2016) Beyond fuzzy, possibilistic and rough: an investigation of belief functions in clustering. In: Proceedings of the 8th international conference on soft methods in probability and statistics SMPS 2016, soft methods for data science, advances in intelligent and soft computing, AISC, vol 456. Springer, Rome, Italy, pp 157–164

Denœux T, Masson MH (2004) EVCLUS: evidential clustering of proximity data. IEEE Trans Syst Man Cybern B 34(1):95–109

Denœux T, Skarstein-Bjanger M (2000) Induction of decision trees for partially classified data. In: Proceedings of SMC’2000. IEEE, Nashville, TN, pp 2923–2928

Denœux T, Zouhal LM (2001) Handling possibilistic labels in pattern classification using evidential reasoning. Fuzzy Sets Syst 122(3):47–62

Denœux T, Sriboonchitta S, Kanjanatarakul O (2016) Evidential clustering of large dissimilarity data. Knowl Based Syst 106:179–195

Dubuisson S, Davoine F, Masson MH (2002) A solution for facial expression representation and recognition. Signal Process Image Commun 17(9):657–673

Elouedi Z, Mellouli K, Smets P (2001) Belief decision trees: theoretical foundations. Int J Approx Reason 28:91–124

Hasan A, Wang Z, Mahani A (2016) Fast estimation of multinomial logit models: R package mnlogit. J Stat Softw 75(1):1–24

Heitjan DF, Rubin DB (1991) Ignorability and coarse data. Ann Stat 19(4):2244–2253

Hüllermeier E (2014) Learning from imprecise and fuzzy observations: data disambiguation through generalized loss minimization. Int J Approx Reason 55(7):1519–1534

Hüllermeier E, Beringer J (2005) Learning from ambiguously labeled examples. In: Proceedings of the 6th international symposium on intelligent data analysis (IDA-05), Madrid, Spain

Jaffray JY (1989) Linear utility theory for belief functions. Oper Res Lett 8(2):107–112

Kanade T, Cohn J, Tian Y (2000) Comprehensive database for facial expression analysis. In: Proceedings of the fourth international conference of face and gesture recognition, Grenoble, France, pp 46–53

Li J (2013) Logistic regression. Course notes. http://sites.stat.psu.edu/~jiali/course/stat597e/notes2/logit.pdf

Liu ZG, Pan Q, Dezert J, Mercier G (2015) Credal c-means clustering method based on belief functions. Knowl Based Syst 74:119–132

Liu ZG, Pan Q, Dezert J, Mercier G (2017) Hybrid classification system for uncertain data. IEEE Trans Syst Man Cybern Syst (in press). https://doi.org/10.1109/TSMC.2016.2622247

Ma L, Destercke S, Wang Y (2016) Online active learning of decision trees with evidential data. Patt Recognit 52:33–45

Mardia KV (1970) Measures of multivariate skewness and kurtosis with applications. Biometrika 57(3):519–530

McLachlan GJ, Krishnan T (1997) The EM algorithm and extensions. Wiley, New York

McLachlan GJ, Peel D (2000) Finite mixture models. Wiley, New York

Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’08. ACM, New York, NY, USA, pp 551–559

Peters G, Crespo F, Lingras P, Weber R (2013) Soft clustering: fuzzy and rough approaches and their extensions and derivatives. Int J Approx Reason 54(2):307–322

Press SJ, Wilson S (1978) Choosing between logistic regression and discriminant analysis. J Am Stat Assoc 73(364):699–705

Quost B (2014) Logistic regression of soft labeled instances via the evidential EM algorithm. In: Cuzzolin F (ed) Proceedings of the third international conference on belief functions: theory and applications, BELIEF 2014. Oxford, UK, 26–28 Sept 2014. Springer, Cham, pp 77–86

Quost B, Denoeux T (2016) Clustering and classification of fuzzy data using the fuzzy EM algorithm. Fuzzy Sets Syst 286:134–156

Ramasso E, Denœux T (2013) Making use of partial knowledge about hidden states in HMMs: an approach based on belief functions. IEEE Trans Fuzzy Syst 21(6):1–11

Richard C (1998) Une méthodologie pour la détection à structure imposée. applications au plan temps-fréquence. Ph.D. thesis, Université de Technologie de Compiègne

Richard C, Lengellé R (1999) Data driven design and complexity control of time-frequency detectors. Sig Process 77:37–48

Rjab AB, Kharoune M, Miklos Z, Martin A (2016) Characterization of experts in crowdsourcing platforms. In: Vejnarová J, Kratochvíl V (eds) Proceedings of 4th international conference on belief functions: theory and applications, BELIEF 2016, Prague, Czech Republic, 21–23 Sept 2016. Springer, Cham, pp 97–104

Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton

Strat TM (1990) Decision analysis using belief functions. Int J Approx Reason 4(5–6):391–417

Sutton-Charani N, Destercke S, Denoeux T (2013) Learning decision trees from uncertain data with an evidential EM approach. In: 12th international conference on machine learning and applications, 2013, vol 1, pp 111–116

Sutton-Charani N, Destercke S, Denœux T (2014) Training and evaluating classifiers from evidential data: application to E2M decision tree pruning. In: Cuzzolin F (ed) Proceedings of the third international conference on belief functions: theory and applications, BELIEF 2014. Oxford, UK, 26–28 Sept 2014. Springer, Cham, pp 87–94

Trabelsi S, Elouedi Z, Mellouli K (2007) Pruning belief decision tree methods in averaging and conjunctive approaches. Int J Approx Reason 46(3):568–595

Zadeh LA (1978) Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst 1:3–28

Zhou K, Martin A, Pan Q (2014) Evidential-EM algorithm applied to progressively censored observations. In: Laurent A, Strauss O, Bouchon-Meunier B, Yager RR (eds) Proceedings of 15th international conference on information processing and management of uncertainty in knowledge-based systems, IPMU 2014, Montpellier, France, Part III, 15–19 July 2014. Springer, Cham, pp 180–189