Learning to classify parallel input/output access patterns
Tóm tắt
Input/output performance on current parallel file systems is sensitive to a good match of application access patterns to file system capabilities. Automatic input/output access pattern classification can determine application access patterns at execution time, guiding adaptive file system policies. In this paper, we examine and compare two novel input/output access pattern classification methods based on learning algorithms. The first approach uses a feedforward neural network previously trained on access pattern benchmarks to generate qualitative classifications. The second approach uses hidden Markov models trained on access patterns from previous executions to create a probabilistic model of input/output accesses. In a parallel application, access patterns can be recognized at the level of each local thread or as the global interleaving of all application threads. Classification of patterns at both levels is important for parallel file system performance; we propose a method for forming global classifications from local classifications. We present results from parallel and sequential benchmarks and applications that demonstrate the viability of this approach.
Từ khóa
#File systems #Pattern classification #Hidden Markov models #Pattern matching #Adaptive systems #Neural networks #Feedforward neural networks #Pattern recognition #Interleaved codesTài liệu tham khảo
lei, 1997, An Analytical Approach to File Prefetching, Proc USENIX 1997 Ann Technical Conf, 275288
10.1007/BF01113694
10.1016/0009-2614(95)00655-N
kroeger, 1996, Predicting File-System Actions from Prior Events, Proc Usenix 1996 Ann Technical Conf, 319328
10.1007/BF01277519
10.1109/5.18626
10.1109/ICDCS.1990.89275
kleinrock, 1975, Queueing Systems, 1
10.1145/258612.258680
10.1109/HPDC.1996.546173
1991, Paragon XP/S Product Overview
10.1145/224538.224638
10.1016/0004-3702(89)90049-0
madhyastha, 1992, Porsonify: A Portable System for Data Sonification
10.1006/jcph.1995.1208
palmer, 1991, Fido: A Cache that Learns to Fetch, Proc Int Conf On Very Large Data Bases, 255262
henderson, 1994, Unstructured Spectral Element Methods: Parallel Algorithms and Simulations
10.1145/224056.224064
10.1109/PDIS.1991.183096
pool, 1996, Scalable I/O Initiative
griffioen, 1994, Reducing File System Latency Using a Predictive Approach, Proc USENIX Summer Technical Conf, 197207
crandall, 1995, Characterization of a Suite of Input/Output Intensive Applications, Proc Supercomputing '95
charniak, 1993, Statistical Language Learning