Simulation and optimization of HPC job allocation for jointly reducing communication and cooling costs

Sustainable Computing: Informatics and Systems - Tập 6 - Trang 48-57 - 2015
Jie Meng1, Samuel McCauley2, Fulya Kaplan1, Vitus J. Leung3, Ayse K. Coskun1
1Department of Electrical and Computer Engineering, Boston University, 8 St. Mary's Street, Boston, MA 02215, United States
2Department of Computer Science, Stony Brook University, Stony Brook, NY 11794-4400, United States
3Sandia National Laboratories, P.O. Box 5800, Albuquerque, NM 87185, United States

Tài liệu tham khảo

Leung, 2002, Processor allocation on Cplant: achieving general processor locality using one-dimensional allocation strategies, 296 Mache, 1997, Minimizing message-passing contention in fragmentation-free processor allocation, 120 Bender, 2008, Communication-aware processor allocation for supercomputers: finding point sets of small average distance, Algorithmica, 50, 279, 10.1007/s00453-007-9037-2 Bhattacharya, 1994, Lookahead processor allocation in mesh-connected massively parallel multicomputer, 868 Walker, 2010, Faster high-quality processor allocation Subramani, 2002, Selective buddy allocation for scheduling parallel jobs on clusters, 107 Sansottera, 2011, Cooling-aware workload placement with performance constraints, Perform. Eval., 68, 1232, 10.1016/j.peva.2011.07.018 J. Koomey, “Growth in data center electricity use 2005 to 2010”, http://www.analyticspress.com/datacenters.html (accessed 1.08.11). UC San Diego's greenlight project to improve energy efficiency of computing”, http://ucsdnews.ucsd.edu/newsrel/science/07-08GreenLightProj.asp. Sawyer, 2004, Calculating total power requirements for data centers J. Rajic, “Evolving toward the green data center”, http://http://stack.nil.si, 2009. Brown, 2010, Toward energy-efficient computing, Commun. ACM, 53, 50, 10.1145/1666420.1666438 Moore, 2005, Making scheduling “cool”: temperature-aware workload placement in data centers, 5 Tang, 2008, Energy-efficient thermal-aware task scheduling for homogeneous high-performance computing data centers: a cyber-physical approach, IEEE Trans. Parallel Distrib. Syst., 19, 1458, 10.1109/TPDS.2008.111 Pakbaznia, 2009, Minimizing data center cooling and server power costs, 145 Ayoub, 2010, Gentlecool: cooling aware proactive workload scheduling in multi-machine systems, 295 Heath, 2006, Mercury and freon: temperature emulation and management for server systems, 106 Kaplan, 2013, Optimizing communication and cooling costs in HPC data centers via intelligent job allocation Chuang, 1991, An efficient submesh allocation strategy for mesh computer systems, 256 Kim, 2012, Free cooling-aware dynamic power management for green datacenters, 140 Wang, 2009, Towards thermal aware workload scheduling in a data center, 116 “Parallel workloads archive: raw workload logs from various machines around the world”, http://www.cs.huji.ac.il/labs/parallel/workload/. Kumar, 2008, Optimization of all-to-all communication on the blue gene/l supercomputer, 320 Barrett, 2011, MiniGhost: a miniapp for exploring boundary exchange strategies using stencil computations scientific parallel computing Hertel, 1993, “CTH: a software family for multi-dimensional shock physics analysis,” L. A. N. Laboratory, “High-performance computing: Cielo supercomputer”, http://www.lanl.gov/orgs/hps/cielo/index.html. “Top 500 list – November 2013,” http://www.top500.org/lists/2013/11/. Lively, 2011, “Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems,”, J. High Perform. Comput. Appl., 25 Sayeed, 2008, Measuring high-performance computing with real applications, Comput. Sci. Eng., 10, 60, 10.1109/MCSE.2008.98 Crovella, 1992, Using communication-to-computation ratio in parallel program design and performance prediction, 238 Tang, 2006, Sensor-based fast thermal evaluation model for energy efficient high-performance datacenters, 203 Brayton, 1979, A new algorithm for statistical circuit design based on quasi-newton methods and function splitting, 784 Rodrigues, 2012, Improvements to the structural simulation toolkit, 190 Binkert, 2006, The M5 simulator: Modeling networked systems, IEEE Micro., 26, 52, 10.1109/MM.2006.82 Hsieh, 2012, Sst + gem5 = a scalable simulation infrastructure for high performance computing, 648 Lifka, 1995, The anl/ibm sp scheduling system, 295 Mu’alem, 2001, Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling, IEEE Trans. Parallel Distrib. Syst., 12, 529, 10.1109/71.932708 Lo, 1997, Noncontiguous processor allocation algorithms for mesh-connected multicomputers, IEEE Trans. Parallel Distrib. Syst., 8, 712, 10.1109/71.598346 Krumke, 1997, Compact location problems, Theor. Comput. Sci., 181, 379, 10.1016/S0304-3975(96)00304-0