Combining instance selection and self-training to improve data stream quantification

André Gustavo Maletzke1, Denis Moreira dos Reis1, Gustavo E. A. P. A. Batista1
1Laboratório de Inteligência Computacional (LABIC), Instituto de Ciências Matemáticas e de Computação (ICMC), Universidade de São Paulo, São Carlos, Brazil

Tóm tắt

Từ khóa


Tài liệu tham khảo

Hill DJ, Minsker BS (2010) Anomaly detection in streaming environmental sensor data: a data-driven modeling approach. Environ Model Softw 25(9):1014–1022.

Dyer KB, Capo R, Polikar R (2014) Compose: a semisupervised learning framework for initially labeled nonstationary streaming data. IEEE Trans Neural Netw Learn Syst 25(1):12–26. https://doi.org/10.1109/TNNLS.2013.2277712 .

Fanaee-T H, Gama J (2014) Event labeling combining ensemble detectors and background knowledge. Prog Artif Intell 2(2):113–127. https://doi.org/10.1007/s13748-013-0040-3 .

Nguyen HL, Woon YK, Ng WK (2015) A survey on data stream clustering and classification. KAIS 45:535–569. https://doi.org/10.1007/s10115-014-0808-1 .

Forman G (2005) Counting positives accurately despite inaccurate classification In: ECML, 564–575.. Springer.

Chen Y, Why A, Batista GEAPA, Mafra-Neto A, Keogh E (2014) Flying insect classification with inexpensive sensors. J Insect Behav 27(5):657–677. https://doi.org/10.1007/s10905-014-9454-4 .

Silva DF, Souza VMA, Ellis D, Keogh E, Batista GEAPA (2015) Exploring low cost laser sensors to identify flying insect species. J Intell Robot Syst 80(1):313–330. https://doi.org/10.1007/s10846-014-0168-9 .

Gao W, Sebastiani F (2016) From classification to quantification in tweet sentiment analysis. Soc Netw Anal Min 6(1). https://doi.org/10.1007/s13278-016-0327-z .

Vuk M, Curk T (2006) Roc curve, lift chart and calibration plot. Metodoloski zvezki 3(1):89.

Bella A, Ferri C, Hernández-Orallo J, Ramírez-Quintana MJ (2010) Quantification via probability estimators In: ICDM, 737–742, Sidney. https://doi.org/10.1109/ICDM.2010.75 .

Milli L, Monreale A, Rossetti G, Giannotti F, Pedreschi D, Sebastiani F (2013) Quantification trees In: ICDM, 528–536, Dallas. https://doi.org/10.1109/ICDM.2013.122 .

Maletzke A, Reis D, Batista G (2017) Quantification in data streams: initial results In: BRACIS, 43–48, Uberlândia. https://doi.org/10.1109/BRACIS.2017.74 .

Gama J, Medas P, Castillo G, Rodrigues P (2004) Learning with drift detection(Bazzan ALC, Labidi S, eds.). Springer, Berlin, Heidelberg.

Masud M, Gao J, Khan L, Han J, Thuraisingham BM (2011) Classification and novel class detection in concept-drifting data streams under time constraints. IEEE Trans Knowl Data Eng 23(6):859–874. https://doi.org/10.1109/TKDE.2010.61 .

dos Reis DM, Flach P, Matwin S, Batista GEAPA (2016) Fast unsupervised online drift detection using incremental Kolmogorov-Smirnov test In: ACM SIGKDD, 1545–1554, San Francisco. https://doi.org/10.1145/2939672.2939836 .

Souza VMA, Silva DF, Gama J, Batista GEAPA (2015) Data stream classification guided by clustering on nonstationary environments and extreme verification latency In: SDM, 873–881, Vancouver. https://doi.org/10.1137/1.9781611974010.98 .

Settles B (2010) Active learning literature survey. Univ Wis Madison 52(55-66):11.

Souza VMA, Rossi RG, Batista GEAPA, Rezende SO (2017) Unsupervised active learning techniques for labeling training sets: an experimental evaluation on sequential data. Intell Data Anal 21(5):1061–1095. https://doi.org/10.3233/IDA-163075 .

Zliobaite I, Bifet A, Pfahringer B, Holmes G (2014) Active learning with drifting streaming data. IEEE Trans Neural Netw Learn Syst 25(1):27–39.

Zhu X, Goldberg AB (2009) Introduction to semi-supervised learning. Synth Lect Artif Intell Mach Learn 3(1):1–130.

Taylor L (1963) Analysis of the effect of temperature on insects in flight. J Anim Ecol 32(1):99–117.

Mellanby K (1936) Humidity and insect metabolism. Nature 138:124–125.

Hammami N, Bedda M (2010) Improved tree model for arabic speech recognition In: ICCSIT, 521–526.. IEEE, Chengdu.

dos Reis D, Maletzke A, Batista G (2018) Unsupervised context switch for classification tasks on data streams with recurrent concepts In: ACM/SIGAPP.. ACM, Pau, France.

Maletzke A, dos Reis D, Batista G (2018) Combining instance selection and self-training to improve data stream quantification, Online Supplementary Material. https://sites.google.com/site/andregustavom/research/sqsi-is . Accessed 04 June 2018.