Learning to classify parallel input/output access patterns

IEEE Transactions on Parallel and Distributed Systems - Tập 13 Số 8 - Trang 802-813 - 2002
T.M. Madhyastha1, D.A. Reed2
1Department of Computer Engineering, University of California, Santa Cruz, CA, USA
2Department of Computer science, University of Illinois, Urbana, IL, USA

Tóm tắt

Input/output performance on current parallel file systems is sensitive to a good match of application access patterns to file system capabilities. Automatic input/output access pattern classification can determine application access patterns at execution time, guiding adaptive file system policies. In this paper, we examine and compare two novel input/output access pattern classification methods based on learning algorithms. The first approach uses a feedforward neural network previously trained on access pattern benchmarks to generate qualitative classifications. The second approach uses hidden Markov models trained on access patterns from previous executions to create a probabilistic model of input/output accesses. In a parallel application, access patterns can be recognized at the level of each local thread or as the global interleaving of all application threads. Classification of patterns at both levels is important for parallel file system performance; we propose a method for forming global classifications from local classifications. We present results from parallel and sequential benchmarks and applications that demonstrate the viability of this approach.

Từ khóa

#File systems #Pattern classification #Hidden Markov models #Pattern matching #Adaptive systems #Neural networks #Feedforward neural networks #Pattern recognition #Interleaved codes

Tài liệu tham khảo

lei, 1997, An Analytical Approach to File Prefetching, Proc USENIX 1997 Ann Technical Conf, 275288 10.1007/BF01113694 10.1016/0009-2614(95)00655-N kroeger, 1996, Predicting File-System Actions from Prior Events, Proc Usenix 1996 Ann Technical Conf, 319328 10.1007/BF01277519 10.1109/5.18626 10.1109/ICDCS.1990.89275 kleinrock, 1975, Queueing Systems, 1 10.1145/258612.258680 10.1109/HPDC.1996.546173 1991, Paragon XP/S Product Overview 10.1145/224538.224638 10.1016/0004-3702(89)90049-0 madhyastha, 1992, Porsonify: A Portable System for Data Sonification 10.1006/jcph.1995.1208 palmer, 1991, Fido: A Cache that Learns to Fetch, Proc Int Conf On Very Large Data Bases, 255262 henderson, 1994, Unstructured Spectral Element Methods: Parallel Algorithms and Simulations 10.1145/224056.224064 10.1109/PDIS.1991.183096 pool, 1996, Scalable I/O Initiative griffioen, 1994, Reducing File System Latency Using a Predictive Approach, Proc USENIX Summer Technical Conf, 197207 crandall, 1995, Characterization of a Suite of Input/Output Intensive Applications, Proc Supercomputing '95 charniak, 1993, Statistical Language Learning