Dynamic threshold for imbalance assessment on load balancing for multicore systems

Computers & Electrical Engineering - Tập 39 - Trang 338-348 - 2013
Ian K.T. Tan1, Ian Chai2, Poo Kuan Hoong1
1Faculty of Computing and Informatics, Multimedia University, Jalan Multimedia, Cyberjaya, 63100 Selangor, Malaysia
2Faculty of Engineering, Multimedia University, Jalan Multimedia, Cyberjaya, 63100 Selangor, Malaysia

Tài liệu tham khảo

Betts B. HPC cloud: supercomputing to go, February 2012 [cited 26 June 2012] [online]. Burt J. HP, IBM lead a growing HPC server market: IDC, June 2012 [cited 26 June 2012] [online]. Borkar, 2007, Thousand core chips: a technology perspective, 746 Nitta S. AMD confirmed to release new CPUs at CeBIT, probably bulldozer chips, February 2011 [online]. Dagum, 1998, OpenMP: an industry standard API for shared-memory programming, IEEE Comput Sci Eng, 5, 46, 10.1109/99.660313 Hughes P, Shurtleff G. LJ interviews Linus Torvalds, Linux J 29. <http://www.linuxjournal.com/article/146>. Russinovich, 2008 Mauerer, 2008 Dobson M. LXR, 2002 [online]. Stallings, 2008 Tanenbaum, 2005 Silberschatz, 2006 Nutt, 2003 Flynn, 2001 Molnar I. Modular scheduler core and completely fair scheduler [CFS], April 2007 [online]. Torvalds L. Modular scheduler core and completely fair scheduler CFS, 2007 [online]. Groves T, Knockel J, Schulte E. BFS vs. CFS – scheduler comparison, December 2009, [online]. Wong, 2008, Towards achieving fairness in the Linux scheduler, SIGOPS Oper Syst Rev, 42, 34, 10.1145/1400097.1400102 Feitelson, 1992, Gang scheduling performance benefits for fine-grain synchronization, J Parallel Distrib Comput, 16, 306, 10.1016/0743-7315(92)90014-E Squiillante, 1993, Using processor-cache affinity information in shared-memory multiprocessor scheduling, IEEE Trans Parallel Distrib Syst, 4, 131, 10.1109/71.207589 Snavely, 2000, Symbiotic job scheduling for a simultaneous multithreaded processor, 234 Tam, 2007, Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors, SIGOPS Oper Syst Rev, 41, 47, 10.1145/1272998.1273004 Li, 2007, Efficient operating system scheduling for performance-asymmetric multi-core architectures, 53:1 Lam J, Tan I, Ewe H, Ong B, Tan C. System’s perspective towards last level-cache architecture of CMP microprocessors. In: Proceedings of the HPC Asia 2009, 2009. p. 343–50. Boneti, 2008, A dynamic scheduler for balancing HPC applications, 41:1 Kavi, 2012, A comparative analysis of performance improvement schemes for cache memories, Comput Electr Eng, 38, 243, 10.1016/j.compeleceng.2011.12.008 Mahafzah, 2010, The hybrid dynamic parallel scheduling algorithm for load balancing on chained-cubic tree interconnection networks, J Supercomput, 52, 224, 10.1007/s11227-009-0288-3 Zheng, 2011, Periodic hierarchical load balancing for large supercomputers, Int J High Performance Comput Appl, 25, 371, 10.1177/1094342010394383 Lim J, Hoong PK, Yeoh E-T. Heuristic neighbor selection algorithm for decentralized load balancing in clustered heterogeneous computational environment. In: 14th International conference on advanced communication technology (ICACT), 2012. p. 1215–9. Tan, 2010, Pthreads performance characteristics on shared cache CMP, private cache CMP and SMP, 186 Lam JW, Tan I, Ong BL, Tan CK. Effective operating system scheduling domain hierarchy for core-cache awareness. In: TENCON 2009 – 2009 IEEE region 10 conference, 2009. p. 1–7. http://dx.doi.org/10.1109/TENCON.2009.5395800. Gu, 2012, An efficient scheduler of RTOS for multi/many-core system, Comput Electr Eng, 38, 785, 10.1016/j.compeleceng.2011.09.009 Calandrino JM, Hill C, Baumberger DP, Li T, Young JC, Hahn S. LinSched: the linux scheduler simulator. In: Proceedings of the 21st international conference on parallel and distributed computing and communications systems, 2008. p. 171–6. Jones MT. Linux scheduler simulation: simulating the linux scheduler in user space with LinSched, IBM DeveloperWorks. <http://www.ibm.com/developerworks/linux/library/l-linux-scheduler-simulator/index.html>. Zhang Y. Improved Hackbench, February 2008 [online]. Manomohan R. LinSched for 2.6.35 released, October 2010 [online]. Bilski, 2010, Multi-core implementation of the symmetric cryptography algorithms in the measurement system, Measurement, 43, 1049, 10.1016/j.measurement.2010.03.002 Chen, 2011, A comparison of 3d cone-beam computed tomography (ct) image reconstruction performance on homogeneous multi-core processor and on other processors, Measurement, 44, 2035, 10.1016/j.measurement.2011.08.012 Jain, 1991 C. Staelin, H. Packard Laboratories, lmbench: portable tools for performance analysis. In: In USENIX annual technical conference, 1996. p. 279–94. Kopytov A. SysBench: a system performance benchmark, 2012 [online]. Bailey, 1991, The NAS parallel benchmarks, SC Conf, 0, 158 Dunbar J, editor. NAS parallel benchmarks, 2012 [online]. Jin H, Jin H, Frumkin M, Frumkin M, Yan J, Yan J. The OpenMP implementation of NAS parallel benchmarks and its performance. Tech. rep., NASA Ames Research Center, October 1999. Hofmeyr, 2010, Load balancing on speed, 147 Hofmeyr, 2011, Juggle: proactive load balancing on multicore computers, 3