LWPTool: A Lightweight Profiler to Guide Data Layout Optimization

IEEE Transactions on Parallel and Distributed Systems - Tập 29 Số 11 - Trang 2489-2502 - 2018
Chao Yu1, Probir Roy2, Yuebin Bai1, Hailong Yang1, Xu Liu2
1Beihang University, Beijing, CN
2College of William and Mary, Williamsburg, VA, US

Tóm tắt

Từ khóa


Tài liệu tham khảo

2007

2005

servat, 2017, Automating the application data placement in hybrid memory systems, Proc IEEE Int Conf Cluster Comput, 126

shen, 2016, Characterizing emerging heterogeneous memory, ACM SIGPLAN Int Symp on Memory Management, 13

10.1145/2807591.2807648

10.1145/2628071.2628102

10.1109/SC.2010.47

10.1177/109434200001400404

10.1016/j.parco.2012.10.001

10.1145/262004.262005

10.1109/MC.2009.57

carlisle, 1996, Olden: Parallelizing programs with dynamic data structures on distributed-memory machines

10.1145/2464996.2465014

hagog, 2005, Cache aware data layout reorganization optimization in gcc, Proc GCC Developers Summit, 69

10.1109/CGO.2006.29

10.1145/1739025.1739033

10.1109/CGO.2007.36

10.1145/2451116.2451153

10.1016/S1007-0214(11)70022-0

10.1145/2086696.2086705

curial, 2008, MPADS: Memory-pooling-assisted data splitting, Proc of the 7th Intl Symposium on Memory Management, 101, 10.1145/1375634.1375649

lachaize, 2012, MemProf: A memory profiler for NUMA multicore systems, Proc of USENIX ATC Annu Tech Conf, 5

chakrabarti, 2008, Structure layout optimizations in the open64 compiler: Design, implementation and measurements, Proceedings of the International Symposium on Code Generation and Optimization Open64 workshop

10.1109/ISPASS.2010.5452060

10.1109/TC.2004.1255787

10.1145/301631.301635

10.1145/2692916.2555271

prashantha, 2014, Implementing data layout optimizations in the LLVM Framework

10.1145/1375634.1375648

10.1007/s11432-011-4265-0

ding, 1999, Inter-array data regrouping, Proc 12th Int Workshop Languages Compilers Parallel Comput, 149

10.1145/2370816.2370838

10.1145/996893.996872

10.1016/S0129-6264(00)00021-4

10.1145/1290520.1290522

2014

10.1145/1088149.1088167

10.1145/291006.291012

2014

10.1109/SC.2004.21

10.1109/IISWC.2009.5306794

jiang, 2010, Array regrouping on CMP with non-uniform cache sharing, Proc 23rd Int Conf Languages Compilers Parallel Comput, 92

10.1109/IISWC.2009.5306797

10.1145/2503210.2503297

10.1109/ICPP.2009.64

10.1109/CGO.2011.5764685