A fully distributed framework for cost-sensitive data mining

Wei Fan1, Haixun Wang1, P.S. Yu1, S.J. Stolfo2
1IBM Thomas J. Watson Research Center, Hawthorne, NY, USA
2Department of Computer Science, Columbia University, New York, NY, USA

Tóm tắt

We propose a fully distributed system (as compared to centralized and partially distributed systems) for cost-sensitive data mining. Experimental results have shown that this approach achieves higher accuracy than both the centralized and partially distributed learning methods, however, it incurs much less training time, neither communication nor computation overhead.

Từ khóa

#Data mining #Credit cards #Distributed computing #Machine learning #Switches #Milling machines #Rivers #Computer science #Learning systems #Relational databases

Tài liệu tham khảo

10.1137/1.9781611972726.26 chan, 1996, An Extensible Meta-learning Approach for Scalable and Accurate Inductive Learning