Small scale to extreme: Methods for characterizing energy efficiency in supercomputing applications

Sustainable Computing: Informatics and Systems - Tập 21 - Trang 90-102 - 2019
Andrew J. Younge1, Ryan E. Grant1, James H. Laros1, Michael Levenhagen1, Stephen L. Olivier1, Kevin Pedretti1, Lee Ward1
1Center for Computing Research, Sandia National Laboratories, P.O. Box 5800, MS-1319, Albuquerque, NM 87185-1319, United States

Tài liệu tham khảo

AMD, 2013. BIOS and kernel developer's guide (BKDG) for AMD family 15h models 00h-0Fh processors. Anderson, 2003 Barry, 2013, Resource utilization reporting, Proc. Cray Users’ Group Technical Conference (CUG) Bertran, 2013, Application-level power and performance characterization and optimization on IBM Blue Gene/Q systems, IBM J. Res. Develop., 57, 1, 10.1147/JRD.2012.2227580 Bircher, 2012, Complete system power estimation using processor performance events, IEEE Trans. Comput., 61, 563, 10.1109/TC.2011.47 Bogdan, P., Marculescu, R., Jain, S., 2013. Dynamic power management for multidomain system-on-chip platforms: An optimal control approach. ACM Trans. Des. Autom. Electron. Syst. 18, 46:1-46:20. URL: http://doi.acm.org/10.1145/2504904, doi: 10.1145/2504904. Brooks, 2000 Browne, 2000, A scalable cross-platform infrastructure for application performance tuning using hardware counters David, 2010, RAPL: memory power estimation and capping, 189 David, 2012, Dynamic power management for multicores: Case study using the intel scc, 147 Dongarra, 2009, Energy profiling and analysis of the HPC challenge benchmarks, The Intl. Journal of High Performance Computing Applications, 23, 265, 10.1177/1094342009106193 Eastep, 2017, Global extensible open power manager: A vehicle for hpc community collaboration toward co-designed energy management solutions, in: Intl. Conference on Supercomputing (ICS) Electronic Educational Devices, 2009. Watts up PRO. Ferreira, 2008, Characterizing application sensitivity to os interference using kernel-level noise injection, 1 Ge, 2005, Performance-constrained distributed DVS scheduling for scientific applications on power-aware clusters, 34 Ge, 2007, CPU miser: A performance-directed, run-time system for power-aware clusters Gholkar, 2016, Power tuning hpc jobs on power-constrained systems, 179 Grant, 2006, Power-performance efficiency of asymmetric multiprocessors for multi-threaded scientific applications Grant, 2017, Evaluating Energy and Power Profiling Techniques for HPC Workloads Grant, 2016, Overcoming challenges in scalable power monitoring with the power api Grant, 2016, Standardizing power monitoring and control at exascale, Computer, 49, 38, 10.1109/MC.2016.308 Grant, 2014, Metrics for evaluating energy saving techniques for resilient hpc systems, 790 Groves, T., Grant, R., 2015. Power aware, dynamic provisioning of hpc networks. Sandia National Labs report 21. Hahnel, 2012, Measuring energy consumption for short code paths using RAPL, ACM SIGMETRICS Performance Evaluation Review, 40, 13, 10.1145/2425248.2425252 Hammond, 2016 Hart, 2014, User-level power monitoring and application performance on Cray XC30 supercomputers., Proceedings of the Cray User Group (CUG) Hewlett Packard Enterprise, 2016 Hsieh, 2012, SST + gem5 = a scalable simulation infrastructure for high performance computing, 196 Hsu, 2005, A power-aware run-time system for high-performance computing, 1 Huang, 2009, Energy-efficient cluster computing via accurate workload characterization, 68 Joseph, 2001, Run-time power estimation in high performance microprocessors, 135 Kappiah, 2005, Just in time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs, 33 Karlin, 2013, Exploring traditional and emerging parallel programming models using a proxy application, 919 Kaufmann, 2003, Craypat-cray x1 performance analysis tool., Cray User Group (May 2003) Knobloch, 2014, Mapping fine-grained power measurements to HPC application runtime characteristics on IBM POWER7, Comput. Sci. Res. Develop., 29, 211, 10.1007/s00450-013-0245-5 Laros, J.H., Grant, R.E., Levenhagen, M., Olivier, S., Pedretti, K.T., Ward, L., Younge, A., 2017. High performance computing - power application programming interface specification version 2.0. URL: http://powerapi.sandia.gov. Laros, 2009, Topics on measuring real power usage on high performance computing platforms, 1 Laros, 2013, PowerInsight - a commodity power measurement capability, 1 Laros, 2012, Energy based performance tuning for large scale high performance computing systems, 6 León, 2015, Optimizing explicit hydrodynamics for power, energy, and performance, 11 León, 2016, Program optimizations: The interplay between power, performance, and energy, Parallel Comput., 58, 56, 10.1016/j.parco.2016.05.004 Lim, 2006, Adaptive, transparent frequency and voltage scaling of communication phases in MPI programs Lucas, 2014 Martin, 2014, Cray XC30 power monitoring and management, Proceedings of CUG Martin, 2015, Cray advanced platform monitoring and control (CAPMC), in: Proc. Cray Users’ Group Technical Conference (CUG) Martin, 2015, Cray advanced platform monitoring and control (CAPMC)., Proceedings of CUG Mazouz, 2014, Statistical validation methodology of CPU power probes, 487 Meuer, 2017 Mills, 2013, Evaluating energy savings for checkpoint/restart, 6 Mills, 2014, Energy consumption of resilience mechanisms in large scale systems, 528 Mittal, M., Valentine, R., 1998. Performance throttling to reduce IC power consumption. US Patent 5,719,800. Forum, 2015, MPI: A Message-Passing Interface Standard, Version, 3, 1 Pande, 2011, Sustainability through massively integrated computing: Are we ready to break the energy efficiency wall for single-chip platforms?, 1 Patki, 2013, Exploring hardware overprovisioning in power-constrained, high performance computing, 173 Pering, 1998, Dynamic voltage scaling and the design of a low-power microprocessor system, in: Power Driven Microarchitecture Workshop, attached to ISCA98, 96 Petrini, 2003, The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q, in: Proc. of the 2003 ACM/IEEE Conference on Supercomputing, 55 Plimpton, 2007 Rashti, 2015, WattProf: A flexible platform for fine-grained HPC power profiling, 698 Reinders, 2005 Rountree, 2009, Adagio: making DVS practical for complex HPC applications, 460 Sarood, 2013, Optimizing power allocation to CPU and memory subsystems in overprovisioned HPC systems, 1 Sprunt, 2002, The basics of performance-monitoring hardware, IEEE Micro, 22, 64, 10.1109/MM.2002.1028477 Tiwari, 2012, Green queue: Customized large-scale clock frequency scaling, 260 Tiwari, 2012, Modeling power and energy usage of HPC kernels, 990 Witkowski, 2013, Practical power consumption estimation for real life HPC applications, Future Generation Computer Systems, 29, 208, 10.1016/j.future.2012.06.003 Xue, 2016, Scalable and realistic benchmark synthesis for efficient noc performance evaluation: A complex network analysis approach, in: 2016 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS), 1 Zhang, 2015