Feature Selection
Tóm tắt
Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data-mining and machine-learning problems. The objectives of feature selection include building simpler and more comprehensible models, improving data-mining performance, and preparing clean, understandable data. The recent proliferation of big data has presented some substantial challenges and opportunities to feature selection. In this survey, we provide a comprehensive and structured overview of recent advances in feature selection research. Motivated by current challenges and opportunities in the era of big data, we revisit feature selection research from a data perspective and review representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data. Methodologically, to emphasize the differences and similarities of most existing feature selection algorithms for conventional data, we categorize them into four main groups: similarity-based, information-theoretical-based, sparse-learning-based, and statistical-based methods. To facilitate and promote the research in this community, we also present an open source feature selection repository that consists of most of the popular feature selection algorithms (http://featureselection.asu.edu/). Also, we use it as an example to show how to evaluate feature selection algorithms. At the end of the survey, we present a discussion about some open problems and challenges that require more attention in future research.
Từ khóa
Tài liệu tham khảo
Edoardo M. Airoldi , David M. Blei , Stephen E. Fienberg , and Eric P . Xing . 2009 . Mixed membership stochastic blockmodels. In NIPS. 33--40. Edoardo M. Airoldi, David M. Blei, Stephen E. Fienberg, and Eric P. Xing. 2009. Mixed membership stochastic blockmodels. In NIPS. 33--40.
Salem Alelyani , Jiliang Tang , and Huan Liu . 2013. Feature selection for clustering: A review. Data Clustering: Algorithms and Applications 29 ( 2013 ). Salem Alelyani, Jiliang Tang, and Huan Liu. 2013. Feature selection for clustering: A review. Data Clustering: Algorithms and Applications 29 (2013).
Hiromasa Arai Crystal Maung Ke Xu and Haim Schweitzer. 2016. Unsupervised feature selection by heuristic search with provable bounds on suboptimality. In AAAI. 666--672. Hiromasa Arai Crystal Maung Ke Xu and Haim Schweitzer. 2016. Unsupervised feature selection by heuristic search with provable bounds on suboptimality. In AAAI. 666--672.
Mustafa Bilgic Lilyana Mihalkova and Lise Getoor. 2010. Active learning for networked data. In ICML. 79--86. Mustafa Bilgic Lilyana Mihalkova and Lise Getoor. 2010. Active learning for networked data. In ICML. 79--86.
Stephen Boyd and Lieven Vandenberghe . 2004. Convex Optimization . Cambridge University Press . Stephen Boyd and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press.
Gavin Brown , Adam Pocock , Ming-Jie Zhao , and Mikel Luján . 2012 . Conditional likelihood maximisation: A unifying framework for information-theoretic feature selection . J. Mach. Learn. Res. 13 , 1 (2012), 27 -- 66 . Gavin Brown, Adam Pocock, Ming-Jie Zhao, and Mikel Luján. 2012. Conditional likelihood maximisation: A unifying framework for information-theoretic feature selection. J. Mach. Learn. Res. 13, 1 (2012), 27--66.
Xiao Cai Feiping Nie and Heng Huang. 2013. Exact top-k feature selection via ℓ<sub>2 0</sub>-norm constraint. In IJCAI. 1240--1246. Xiao Cai Feiping Nie and Heng Huang. 2013. Exact top-k feature selection via ℓ<sub>2 0</sub>-norm constraint. In IJCAI. 1240--1246.
Xiaojun Chang Feiping Nie Yi Yang and Heng Huang. 2014. A convex formulation for semi-supervised multi-label feature selection. In AAAI. 1171--1177. Xiaojun Chang Feiping Nie Yi Yang and Heng Huang. 2014. A convex formulation for semi-supervised multi-label feature selection. In AAAI. 1171--1177.
John C. Davis and Robert J . Sampson . 1986 . Statistics and Data Analysis in Geology. Vol. 646 . Wiley . New York. John C. Davis and Robert J. Sampson. 1986. Statistics and Data Analysis in Geology. Vol. 646. Wiley. New York.
Liang Du Zhiyong Shen Xuan Li Peng Zhou and Yi-Dong Shen. 2013. Local and global discriminative learning for unsupervised feature selection. In ICDM. 131--140. Liang Du Zhiyong Shen Xuan Li Peng Zhou and Yi-Dong Shen. 2013. Local and global discriminative learning for unsupervised feature selection. In ICDM. 131--140.
Richard O. Duda , Peter E. Hart , and David G . Stork . 2012 . Pattern Classification. John Wiley 8 Sons. Richard O. Duda, Peter E. Hart, and David G. Stork. 2012. Pattern Classification. John Wiley 8 Sons.
Ali El Akadi , Abdeljalil El Ouardighi , and Driss Aboutajdine . 2008 . A powerful feature selection approach based on mutual information . Int. J. Comput. Sci. Netw. Secur. 8 , 4 (2008), 116 . Ali El Akadi, Abdeljalil El Ouardighi, and Driss Aboutajdine. 2008. A powerful feature selection approach based on mutual information. Int. J. Comput. Sci. Netw. Secur. 8, 4 (2008), 116.
Christiane Fellbaum. 1998. WordNet. Wiley Online Library. Christiane Fellbaum. 1998. WordNet. Wiley Online Library.
Jerome Friedman , Trevor Hastie , and Robert Tibshirani . 2010. A note on the group lasso and a sparse group lasso. arXiv preprint arXiv:1001.0736 ( 2010 ). Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2010. A note on the group lasso and a sparse group lasso. arXiv preprint arXiv:1001.0736 (2010).
Keinosuke Fukunaga . 2013. Introduction to Statistical Pattern Recognition . Academic Press . Keinosuke Fukunaga. 2013. Introduction to Statistical Pattern Recognition. Academic Press.
Shuyang Gao , Greg Ver Steeg, and Aram Galstyan . 2016 . Variational information maximization for feature selection. In NIPS. 487--495. Shuyang Gao, Greg Ver Steeg, and Aram Galstyan. 2016. Variational information maximization for feature selection. In NIPS. 487--495.
C. W. Gini . 1912. Variability and mutability, contribution to the study of statistical distribution and relaitons. Studi Economico-Giuricici Della R ( 1912 ). C. W. Gini. 1912. Variability and mutability, contribution to the study of statistical distribution and relaitons. Studi Economico-Giuricici Della R (1912).
David E. Golberg . 1989. Genetic algorithms in search, optimization, and machine learning . Addison-Wesley . David E. Golberg. 1989. Genetic algorithms in search, optimization, and machine learning. Addison-Wesley.
Quanquan Gu Marina Danilevsky Zhenhui Li and Jiawei Han. 2012. Locality preserving feature learning. In AISTATS. 477--485. Quanquan Gu Marina Danilevsky Zhenhui Li and Jiawei Han. 2012. Locality preserving feature learning. In AISTATS. 477--485.
Quanquan Gu Zhenhui Li and Jiawei Han. 2011b. Generalized fisher score for feature selection. In UAI. 266--273. Quanquan Gu Zhenhui Li and Jiawei Han. 2011b. Generalized fisher score for feature selection. In UAI. 266--273.
Quanquan Gu Zhenhui Li and Jiawei Han. 2011c. Joint feature selection and subspace learning. In IJCAI. 1294--1299. Quanquan Gu Zhenhui Li and Jiawei Han. 2011c. Joint feature selection and subspace learning. In IJCAI. 1294--1299.
Isabelle Guyon , Steve Gunn , Masoud Nikravesh , and Lofti A Zadeh . 2008 . Feature Extraction: Foundations and Applications . Springer . Isabelle Guyon, Steve Gunn, Masoud Nikravesh, and Lofti A Zadeh. 2008. Feature Extraction: Foundations and Applications. Springer.
Mark A. Hall and Lloyd A . Smith . 1999 . Feature selection for machine learning: Comparing a correlation-based filter approach to the wrapper. In FLAIRS. 235--239. Mark A. Hall and Lloyd A. Smith. 1999. Feature selection for machine learning: Comparing a correlation-based filter approach to the wrapper. In FLAIRS. 235--239.
Satoshi Hara and Takanori Maehara. 2017. Enumerate lasso solutions for feature selection. In AAAI. 1985--1991. Satoshi Hara and Takanori Maehara. 2017. Enumerate lasso solutions for feature selection. In AAAI. 1985--1991.
Xiaofei He Deng Cai and Partha Niyogi. 2005. Laplacian score for feature selection. In NIPS. 507--514. Xiaofei He Deng Cai and Partha Niyogi. 2005. Laplacian score for feature selection. In NIPS. 507--514.
Chenping Hou Feiping Nie Dongyun Yi and Yi Wu. 2011. Feature selection via joint embedding learning and sparse regression. In IJCAI. 1324--1329. Chenping Hou Feiping Nie Dongyun Yi and Yi Wu. 2011. Feature selection via joint embedding learning and sparse regression. In IJCAI. 1324--1329.
Xia Hu Jiliang Tang Huiji Gao and Huan Liu. 2013. ActNeT: Active learning for networked texts in microblogging. In SDM. 306--314. Xia Hu Jiliang Tang Huiji Gao and Huan Liu. 2013. ActNeT: Active learning for networked texts in microblogging. In SDM. 306--314.
Rodolphe Jenatton , Julien Mairal , Francis R. Bach , and Guillaume R . Obozinski . 2010 . Proximal methods for sparse hierarchical dictionary learning. In ICML. 487--494. Rodolphe Jenatton, Julien Mairal, Francis R. Bach, and Guillaume R. Obozinski. 2010. Proximal methods for sparse hierarchical dictionary learning. In ICML. 487--494.
Ling Jian Jundong Li Kai Shu and Huan Liu. 2016. Multi-label informed feature selection. In IJCAI. 1627--1633. Ling Jian Jundong Li Kai Shu and Huan Liu. 2016. Multi-label informed feature selection. In IJCAI. 1627--1633.
Yi Jiang and Jiangtao Ren. 2011. Eigenvalue sensitive feature selection. In ICML. 89--96. Yi Jiang and Jiangtao Ren. 2011. Eigenvalue sensitive feature selection. In ICML. 89--96.
Seyoung Kim and Eric P Xing . 2009. Statistical estimation of correlated genome associations to a quantitative trait network. PLoS Genet. 5, 8 ( 2009 ). Seyoung Kim and Eric P Xing. 2009. Statistical estimation of correlated genome associations to a quantitative trait network. PLoS Genet. 5, 8 (2009).
Seyoung Kim and Eric P Xing. 2010. Tree-guided group lasso for multi-task regression with structured sparsity. In ICML. 543--550. Seyoung Kim and Eric P Xing. 2010. Tree-guided group lasso for multi-task regression with structured sparsity. In ICML. 543--550.
Kenji Kira and Larry A. Rendell . 1992. A practical approach to feature selection . In ICML Workshop. 249--256 . Kenji Kira and Larry A. Rendell. 1992. A practical approach to feature selection. In ICML Workshop. 249--256.
Daphne Koller and Mehran Sahami. 1995. Toward optimal feature selection. In ICML. 284--292. Daphne Koller and Mehran Sahami. 1995. Toward optimal feature selection. In ICML. 284--292.
Jundong Li , Harsh Dani , Xia Hu , and Huan Liu . 2017 . Radar: Residual analysis for anomaly detection in attributed networks. In IJCAI. 2152--2158. Jundong Li, Harsh Dani, Xia Hu, and Huan Liu. 2017. Radar: Residual analysis for anomaly detection in attributed networks. In IJCAI. 2152--2158.
Jundong Li Xia Hu Ling Jian and Huan Liu. 2016. Toward time-evolving feature selection on dynamic networks. In ICDM. 1003--1008. Jundong Li Xia Hu Ling Jian and Huan Liu. 2016. Toward time-evolving feature selection on dynamic networks. In ICDM. 1003--1008.
Jundong Li Xia Hu Liang Wu and Huan Liu. 2016. Robust unsupervised feature selection on networked data. In SDM. 387--395. Jundong Li Xia Hu Liang Wu and Huan Liu. 2016. Robust unsupervised feature selection on networked data. In SDM. 387--395.
Jundong Li Jiliang Tang and Huan Liu. 2017a. Reconstruction-based unsupervised feature selection: An embedded approach. In IJCAI. 2159--2165. Jundong Li Jiliang Tang and Huan Liu. 2017a. Reconstruction-based unsupervised feature selection: An embedded approach. In IJCAI. 2159--2165.
Jundong Li Liang Wu Osmar R. Zaïane and Huan Liu. 2017b. Toward personalized relational learning. In SDM. 444--452. Jundong Li Liang Wu Osmar R. Zaïane and Huan Liu. 2017b. Toward personalized relational learning. In SDM. 444--452.
Yifeng Li , Chih-Yu Chen , and Wyeth W . Wasserman . 2015 . Deep feature selection: Theory and application to identify enhancers and promoters. In RECOMB. 205--217. Yifeng Li, Chih-Yu Chen, and Wyeth W. Wasserman. 2015. Deep feature selection: Theory and application to identify enhancers and promoters. In RECOMB. 205--217.
Zechao Li Yi Yang Jing Liu Xiaofang Zhou and Hanqing Lu. 2012. Unsupervised feature selection using nonnegative spectral analysis. In AAAI. 1026--1032. Zechao Li Yi Yang Jing Liu Xiaofang Zhou and Hanqing Lu. 2012. Unsupervised feature selection using nonnegative spectral analysis. In AAAI. 1026--1032.
Hongfu Liu Haiyi Mao and Yun Fu. 2016a. Robust multi-view feature selection. In ICDM. 281--290. Hongfu Liu Haiyi Mao and Yun Fu. 2016a. Robust multi-view feature selection. In ICDM. 281--290.
Huan Liu and Hiroshi Motoda . 2007. Computational Methods of Feature Selection . CRC Press . Huan Liu and Hiroshi Motoda. 2007. Computational Methods of Feature Selection. CRC Press.
Huan Liu and Rudy Setiono. 1995. Chi2: Feature selection and discretization of numeric attributes. In ICTAI. 388--391. Huan Liu and Rudy Setiono. 1995. Chi2: Feature selection and discretization of numeric attributes. In ICTAI. 388--391.
Hongfu Liu Ming Shao and Yun Fu. 2016b. Consensus guided unsupervised feature selection. In AAAI. 1874--1880. Hongfu Liu Ming Shao and Yun Fu. 2016b. Consensus guided unsupervised feature selection. In AAAI. 1874--1880.
Jun Liu Shuiwang Ji and Jieping Ye. 2009a. Multi-task feature learning via efficient ℓ<sub>2 0</sub>-norm minimization. In UAI. 339--348. Jun Liu Shuiwang Ji and Jieping Ye. 2009a. Multi-task feature learning via efficient ℓ<sub>2 0</sub>-norm minimization. In UAI. 339--348.
Jun Liu , Shuiwang Ji , and Jieping Ye . 2009 b. SLEP: Sparse Learning with Efficient Projections . Arizona State University . Retrieved from http://www.public.asu.edu/∼jye02/Software/SLEP. Jun Liu, Shuiwang Ji, and Jieping Ye. 2009b. SLEP: Sparse Learning with Efficient Projections. Arizona State University. Retrieved from http://www.public.asu.edu/∼jye02/Software/SLEP.
Jun Liu and Jieping Ye. 2010. Moreau-Yosida regularization for grouped tree structure learning. In NIPS. 1459--1467. Jun Liu and Jieping Ye. 2010. Moreau-Yosida regularization for grouped tree structure learning. In NIPS. 1459--1467.
Mahdokht Masaeli , Yan Yan , Ying Cui , Glenn Fung , and Jennifer G . Dy . 2010 . Convex principal feature selection. In SDM. 619--628. Mahdokht Masaeli, Yan Yan, Ying Cui, Glenn Fung, and Jennifer G. Dy. 2010. Convex principal feature selection. In SDM. 619--628.
Crystal Maung and Haim Schweitzer. 2013. Pass-efficient unsupervised feature selection. In NIPS. 1628--1636. Crystal Maung and Haim Schweitzer. 2013. Pass-efficient unsupervised feature selection. In NIPS. 1628--1636.
Miller McPherson Lynn Smith-Lovin and James M Cook. 2001. Birds of a feather: Homophily in social networks. Ann. Rev. Sociol. (2001) 415--444. Miller McPherson Lynn Smith-Lovin and James M Cook. 2001. Birds of a feather: Homophily in social networks. Ann. Rev. Sociol. (2001) 415--444.
Feiping Nie Heng Huang Xiao Cai and Chris H Ding. 2010. Efficient and robust feature selection via joint -norms minimization. In NIPS. 1813--1821. Feiping Nie Heng Huang Xiao Cai and Chris H Ding. 2010. Efficient and robust feature selection via joint -norms minimization. In NIPS. 1813--1821.
Feiping Nie Shiming Xiang Yangqing Jia Changshui Zhang and Shuicheng Yan. 2008. Trace ratio criterion for feature selection. In AAAI. 671--676. Feiping Nie Shiming Xiang Yangqing Jia Changshui Zhang and Shuicheng Yan. 2008. Trace ratio criterion for feature selection. In AAAI. 671--676.
Feiping Nie Wei Zhu Xuelong Li and others. 2016. Unsupervised feature selection with structured graph optimization. In AAAI. 1302--1308. Feiping Nie Wei Zhu Xuelong Li and others. 2016. Unsupervised feature selection with structured graph optimization. In AAAI. 1302--1308.
Fabian Pedregosa , Gaël Varoquaux , Alexandre Gramfort , Vincent Michel , Bertrand Thirion , Olivier Grisel , Mathieu Blondel , Peter Prettenhofer , Ron Weiss , Vincent Dubourg , and others. 2011 . Scikit-learn: Machine learning in python . J. Mach. Learn. Res. 12 , Oct (2011), 2825 -- 2830 . Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, and others. 2011. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, Oct (2011), 2825--2830.
Hanyang Peng and Yong Fan. 2016. Direct sparsity optimization based feature selection for multi-class classification. In IJCAI. 1918--1924. Hanyang Peng and Yong Fan. 2016. Direct sparsity optimization based feature selection for multi-class classification. In IJCAI. 1918--1924.
Hanyang Peng and Yong Fan. 2017. A general framework for sparsity regularized feature selection via iteratively reweighted least square minimization. In AAAI. 2471--2477. Hanyang Peng and Yong Fan. 2017. A general framework for sparsity regularized feature selection via iteratively reweighted least square minimization. In AAAI. 2471--2477.
Simon Perkins and James Theiler. 2003. Online feature selection using grafting. In ICML. 592--599. Simon Perkins and James Theiler. 2003. Online feature selection using grafting. In ICML. 592--599.
Mingjie Qian and Chengxiang Zhai. 2013. Robust unsupervised feature selection. In IJCAI. 1621--1627. Mingjie Qian and Chengxiang Zhai. 2013. Robust unsupervised feature selection. In IJCAI. 1621--1627.
Debaditya Roy , K Sri Rama Murty, and C Krishna Mohan . 2015 . Feature selection using deep neural networks. In IJCNN. 1--6. Debaditya Roy, K Sri Rama Murty, and C Krishna Mohan. 2015. Feature selection using deep neural networks. In IJCNN. 1--6.
Ted Sandler , John Blitzer , Partha P. Talukdar , and Lyle H . Ungar . 2009 . Regularized learning with networks of features. In NIPS. 1401--1408. Ted Sandler, John Blitzer, Partha P. Talukdar, and Lyle H. Ungar. 2009. Regularized learning with networks of features. In NIPS. 1401--1408.
Qiang Shen , Ren Diao , and Pan Su. 2012. Feature selection ensemble. Turing-100 10 ( 2012 ), 289--306. Qiang Shen, Ren Diao, and Pan Su. 2012. Feature selection ensemble. Turing-100 10 (2012), 289--306.
Alexander Shishkin Anastasia Bezzubtseva Alexey Drutsa Ilia Shishkov Ekaterina Gladkikh Gleb Gusev and Pavel Serdyukov. 2016. Efficient high-order interaction-aware feature selection based on conditional mutual information. In NIPS. 4637--4645. Alexander Shishkin Anastasia Bezzubtseva Alexey Drutsa Ilia Shishkov Ekaterina Gladkikh Gleb Gusev and Pavel Serdyukov. 2016. Efficient high-order interaction-aware feature selection based on conditional mutual information. In NIPS. 4637--4645.
Sameer Singh Jeremy Kubica Scott Larsen and Daria Sorokina. 2009. Parallel large scale feature selection for logistic regression. In SDM. 1172--1183. Sameer Singh Jeremy Kubica Scott Larsen and Daria Sorokina. 2009. Parallel large scale feature selection for logistic regression. In SDM. 1172--1183.
Jiliang Tang , Salem Alelyani , and Huan Liu . 2014. Feature selection for classification: A review . Data Classification : Algorithms and Applications ( 2014 ), 37. Jiliang Tang, Salem Alelyani, and Huan Liu. 2014. Feature selection for classification: A review. Data Classification: Algorithms and Applications (2014), 37.
Jiliang Tang Xia Hu Huiji Gao and Huan Liu. 2013. Unsupervised feature selection for multi-view data in social media. In SDM. 270--278. Jiliang Tang Xia Hu Huiji Gao and Huan Liu. 2013. Unsupervised feature selection for multi-view data in social media. In SDM. 270--278.
Jiliang Tang Xia Hu Huiji Gao and Huan Liu. 2014. Discriminant analysis for unsupervised feature selection. In SDM. 938--946. Jiliang Tang Xia Hu Huiji Gao and Huan Liu. 2014. Discriminant analysis for unsupervised feature selection. In SDM. 938--946.
Jiliang Tang and Huan Liu. 2012a. Feature selection with linked data in social media. In SDM. 118--128. Jiliang Tang and Huan Liu. 2012a. Feature selection with linked data in social media. In SDM. 118--128.
Jiliang Tang and Huan Liu . 2013 . Coselect: Feature selection with instance selection for social media data. In SDM. 695--703. Jiliang Tang and Huan Liu. 2013. Coselect: Feature selection with instance selection for social media data. In SDM. 695--703.
Robert Tibshirani . 1996. Regression shrinkage and selection via the lasso. J. Roy. Stat. Soc. B ( 1996 ), 267--288. Robert Tibshirani. 1996. Regression shrinkage and selection via the lasso. J. Roy. Stat. Soc. B (1996), 267--288.
William T. Vetterling , Saul A. Teukolsky , and William H . Press . 1992 . Numerical Recipes : Example Book (C). Press Syndicate of the University of Cambridge . William T. Vetterling, Saul A. Teukolsky, and William H. Press. 1992. Numerical Recipes: Example Book (C). Press Syndicate of the University of Cambridge.
Michel Vidal-Naquet and Shimon Ullman. 2003. Object recognition with informative features and linear classification. In ICCV. 281--288. Michel Vidal-Naquet and Shimon Ullman. 2003. Object recognition with informative features and linear classification. In ICCV. 281--288.
Hua Wang Feiping Nie and Heng Huang. 2013. Multi-view clustering and feature learning via structured sparsity. In ICML. 352--360. Hua Wang Feiping Nie and Heng Huang. 2013. Multi-view clustering and feature learning via structured sparsity. In ICML. 352--360.
Huan Wang Shuicheng Yan Dong Xu Xiaoou Tang and Thomas Huang. 2007. Trace ratio vs. ratio trace for dimensionality reduction. In CVPR. 1--8. Huan Wang Shuicheng Yan Dong Xu Xiaoou Tang and Thomas Huang. 2007. Trace ratio vs. ratio trace for dimensionality reduction. In CVPR. 1--8.
Jie Wang and Jieping Ye. 2015. Multi-layer feature reduction for tree structured group lasso via hierarchical projection. In NIPS. 1279--1287. Jie Wang and Jieping Ye. 2015. Multi-layer feature reduction for tree structured group lasso via hierarchical projection. In NIPS. 1279--1287.
Qian Wang Jiaxing Zhang Sen Song and Zheng Zhang. 2014a. Attentional neural network: Feature selection using cognitive feedback. In NIPS. 2033--2041. Qian Wang Jiaxing Zhang Sen Song and Zheng Zhang. 2014a. Attentional neural network: Feature selection using cognitive feedback. In NIPS. 2033--2041.
Xiaokai Wei , Bokai Cao , and Philip S . Yu . 2016 a. Nonlinear joint unsupervised feature selection. In SDM. 414--422. Xiaokai Wei, Bokai Cao, and Philip S. Yu. 2016a. Nonlinear joint unsupervised feature selection. In SDM. 414--422.
Xiaokai Wei , Bokai Cao , and Philip S . Yu . 2016 b. Unsupervised feature selection on networks: A generative view. In AAAI. 2215--2221. Xiaokai Wei, Bokai Cao, and Philip S. Yu. 2016b. Unsupervised feature selection on networks: A generative view. In AAAI. 2215--2221.
Xiaokai Wei , Sihong Xie , and Philip S . Yu . 2015 . Efficient partial order preserving unsupervised feature selection on networks. In SDM. 82--90. Xiaokai Wei, Sihong Xie, and Philip S. Yu. 2015. Efficient partial order preserving unsupervised feature selection on networks. In SDM. 82--90.
Xiaokai Wei and Philip S . Yu . 2016 . Unsupervised feature selection by preserving stochastic neighbors. In AISTATS. 995--1003. Xiaokai Wei and Philip S. Yu. 2016. Unsupervised feature selection by preserving stochastic neighbors. In AISTATS. 995--1003.
Liang Wu , Jundong Li , Xia Hu , and Huan Liu . 2017. Gleaning wisdom from the past: Early detection of emerging rumors in social media . In SDM. SIAM , 99--107. Liang Wu, Jundong Li, Xia Hu, and Huan Liu. 2017. Gleaning wisdom from the past: Early detection of emerging rumors in social media. In SDM. SIAM, 99--107.
Xindong Wu Kui Yu Hao Wang and Wei Ding. 2010. Online streaming feature selection. In ICML. 1159--1166. Xindong Wu Kui Yu Hao Wang and Wei Ding. 2010. Online streaming feature selection. In ICML. 1159--1166.
Makoto Yamada , Avishek Saha , Hua Ouyang , Dawei Yin , and Yi Chang . 2014. N3LARS: Minimum redundancy maximum relevance feature selection for large and high-dimensional data. arXiv preprint arXiv:1411.2331 ( 2014 ). Makoto Yamada, Avishek Saha, Hua Ouyang, Dawei Yin, and Yi Chang. 2014. N3LARS: Minimum redundancy maximum relevance feature selection for large and high-dimensional data. arXiv preprint arXiv:1411.2331 (2014).
Howard Hua Yang and John E . Moody . 1999 . Data visualization and feature selection: New algorithms for nongaussian data. In NIPS. 687--693. Howard Hua Yang and John E. Moody. 1999. Data visualization and feature selection: New algorithms for nongaussian data. In NIPS. 687--693.
Yi Yang , Heng Tao Shen , Zhigang Ma, Zi Huang, and Xiaofang Zhou. 2011 . ℓ<sub>2,0</sub>-norm regularized discriminative feature selection for unsupervised learning. In IJCAI. 1589--1594. Yi Yang, Heng Tao Shen, Zhigang Ma, Zi Huang, and Xiaofang Zhou. 2011. ℓ<sub>2,0</sub>-norm regularized discriminative feature selection for unsupervised learning. In IJCAI. 1589--1594.
Lei Yu and Huan Liu. 2003. Feature selection for high-dimensional data: A fast correlation-based filter solution. In ICML. 856--863. Lei Yu and Huan Liu. 2003. Feature selection for high-dimensional data: A fast correlation-based filter solution. In ICML. 856--863.
Stella X. Yu and Jianbo Shi . 2003 . Multiclass spectral clustering. In ICCV. 313--319. Stella X. Yu and Jianbo Shi. 2003. Multiclass spectral clustering. In ICCV. 313--319.
Lei Yuan Jun Liu and Jieping Ye. 2011. Efficient methods for overlapping group lasso. In NIPS. 352--360. Lei Yuan Jun Liu and Jieping Ye. 2011. Efficient methods for overlapping group lasso. In NIPS. 352--360.
Sepehr Abbasi Zadeh Mehrdad Ghadiri Vahab S. Mirrokni and Morteza Zadimoghaddam. 2017. Scalable feature selection via distributed diversity maximization. In AAAI. 2876--2883. Sepehr Abbasi Zadeh Mehrdad Ghadiri Vahab S. Mirrokni and Morteza Zadimoghaddam. 2017. Scalable feature selection via distributed diversity maximization. In AAAI. 2876--2883.
Miao Zhang Chris H. Q. Ding Ya Zhang and Feiping Nie. 2014. Feature selection at the discrete limit. In AAAI. 1355--1361. Miao Zhang Chris H. Q. Ding Ya Zhang and Feiping Nie. 2014. Feature selection at the discrete limit. In AAAI. 1355--1361.
Peng Zhao , Guilherme Rocha , and Bin Yu. 2009. The composite absolute penalties family for grouped and hierarchical variable selection. The Annals of Statistics ( 2009 ), 3468--3497. Peng Zhao, Guilherme Rocha, and Bin Yu. 2009. The composite absolute penalties family for grouped and hierarchical variable selection. The Annals of Statistics (2009), 3468--3497.
Zheng Zhao and Huan Liu. 2008. Multi-source feature selection via geometry-dependent covariance analysis. In FSDM. 36--47. Zheng Zhao and Huan Liu. 2008. Multi-source feature selection via geometry-dependent covariance analysis. In FSDM. 36--47.
Zheng Zhao Lei Wang Huan Liu and others. 2010. Efficient spectral feature selection with minimum redundancy. In AAAI. 673--678. Zheng Zhao Lei Wang Huan Liu and others. 2010. Efficient spectral feature selection with minimum redundancy. In AAAI. 673--678.
Yao Zhou and Jingrui He. 2017. A randomized approach for crowdsourcing in the presence of multiple views. In ICDM. Yao Zhou and Jingrui He. 2017. A randomized approach for crowdsourcing in the presence of multiple views. In ICDM.
Ji Zhu , Saharon Rosset , Robert Tibshirani , and Trevor J . Hastie . 2004 . 1-norm support vector machines. In NIPS. 49--56. Ji Zhu, Saharon Rosset, Robert Tibshirani, and Trevor J. Hastie. 2004. 1-norm support vector machines. In NIPS. 49--56.
Pengfei Zhu Qinghua Hu Changqing Zhang and Wangmeng Zuo. 2016. Coupled dictionary learning for unsupervised feature selection. In AAAI. 2422--2428. Pengfei Zhu Qinghua Hu Changqing Zhang and Wangmeng Zuo. 2016. Coupled dictionary learning for unsupervised feature selection. In AAAI. 2422--2428.