On-demand time-decaying bloom filters for telemarketer detection

Computer Communication Review - Tập 41 Số 5 - Trang 5-12 - 2011
Giuseppe Bianchi1, Nico d’Heureuse2, Saverio Niccolini2
1CNIT / University of Roma Tor Vergata, Roma, Italy
2NEC Laboratories Europe, NEC Europe Ltd., Heidelberg, Germany

Tóm tắt

Several traffic monitoring applications may benefit from the availability of efficient mechanisms for approximately tracking smoothed time averages rather than raw counts. This paper provides two contributions in this direction. First, our analysis of Time-decaying Bloom filters, formerly proposed data structures devised to perform approximate Exponentially Weighted Moving Averages on streaming data, reveals two major shortcomings: biased estimation when measurements are read in arbitrary time instants, and slow operation resulting from the need to periodically update all the filter's counters at once. We thus propose a new construction, called On-demand Time-decaying Bloom filter, which relies on a continuous-time operation to overcome the accuracy/performance limitations of the original window-based approach. Second, we show how this new technique can be exploited in thedesign of high performance stream-based monitoring applications, by developing VoIPSTREAM, a proof-of-concept real-time analysis version of a formerly proposed system for telemarketing call detection. Our validation results, carried out over real telephony data, show how VoIPSTREAM closely mimics the feature extraction process and traffic analysis techniques implemented in the offline system, at a significantly higher processing speed, and without requiring any storage of per-user call detail records.

Từ khóa


Tài liệu tham khảo

10.1145/510726.510749

10.1145/872757.872787

10.1016/j.jalgor.2003.12.001

N. d' Heureuse , S. Tartarelli , and S. Niccolini , " Analyzing telemarketer behavior in massive telecom data records," in 21st Tyrrhenian Workshop on Digital Communications: Trustworthy Internet , 2010 . N. d'Heureuse, S. Tartarelli, and S. Niccolini, "Analyzing telemarketer behavior in massive telecom data records," in 21st Tyrrhenian Workshop on Digital Communications: Trustworthy Internet, 2010.

10.1109/RIDE.2005.15

R. Martin and M. Menth , " Improving the timeliness of rate measurements." in 12th GI/ITG Conference on measuring, modelling and evaluation of computer and communication systems , 2004 . R. Martin and M. Menth, "Improving the timeliness of rate measurements." in 12th GI/ITG Conference on measuring, modelling and evaluation of computer and communication systems, 2004.

10.1145/362686.362692

A. Broder and M. Mitzenmacher , " Network applications of bloom filters: A survey ," in Internet Mathematics , 2002 , pp. 636 -- 646 . A. Broder and M. Mitzenmacher, "Network applications of bloom filters: A survey," in Internet Mathematics, 2002, pp. 636--646.

10.1145/285243.285287

S. Pontarelli , S. Teofili , and G. Bianchi , " Hardware-based on-the-fly per-flow scan detector prefilter." in Third COST TMA International Workshop on Traffic Monitoring and Analysis , 2011 . S. Pontarelli, S. Teofili, and G. Bianchi, "Hardware-based on-the-fly per-flow scan detector prefilter." in Third COST TMA International Workshop on Traffic Monitoring and Analysis, 2011.

G. Bianchi , E. Boschi , S. Teofili , and B. Trammell , " Measurement data reduction through variation rate metering." in INFOCOM'10 , 2010 , pp. 2187 -- 2195 . G. Bianchi, E. Boschi, S. Teofili, and B. Trammell, "Measurement data reduction through variation rate metering." in INFOCOM'10, 2010, pp. 2187--2195.

J. Rosenberg , H. Schulzrinne , G. Camarillo , A. Johnston , J. Peterson , R. Sparks , M. Handley , and E. Schooler , " SIP: Session Initiation Protocol," RFC 3261 (Proposed Standard) , Jun. 2002 , updated by RFCs 3265, 3853, 4320, 4916. J. Rosenberg, H. Schulzrinne, G. Camarillo, A. Johnston, J. Peterson, R. Sparks, M. Handley, and E. Schooler, "SIP: Session Initiation Protocol," RFC 3261 (Proposed Standard), Jun. 2002, updated by RFCs 3265, 3853, 4320, 4916.

ITU-T , "{Usage of cause and location in the Digital Subscriber Signalling System No . 1 and the Signalling System No. 7 ISDN User Part," ITU-T Recommendation Q.850 , May 1998 . ITU-T, "{Usage of cause and location in the Digital Subscriber Signalling System No. 1 and the Signalling System No. 7 ISDN User Part," ITU-T Recommendation Q.850, May 1998.

10.1007/978-3-642-00975-4_20