A set of level 3 basic linear algebra subprograms

ACM Transactions on Mathematical Software - Tập 16 Số 1 - Trang 1-17 - 1990
Jack Dongarra1, Jeremy Du Croz2, Sven Hammarling2, Iain Duff3
1Univ. of Tennessee, Knoxville
2Numerical Algorithms Group, Ltd., Oxford, UK
3Harwell Lab, Oxfordshire, UK

Tóm tắt

This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations that should provide for efficient and portable implementations of algorithms for high-performance computers

Từ khóa


Tài liệu tham khảo

BARRON D. W., 1960, Solution of simultaneous linear equations using a magnetic-tape store, Comput. J., 3, 28, 10.1093/comjnl/3.1.28

10.1137/0908009

BRONLUND O. E., 1974, QR-factorization of partitioned matrices, Comput. Meth. Appl. Mech. Eng., 3, 153, 10.1016/0045-7825(74)90023-1

BUCHER I., 1984, Eds. IMACS, 546

CALAHAN D.A., 1986, Proceedings International Conference on Parallel Processing (Aug. 1986

CARNEVALI P., 1987, G., ROBERT, Y., AND SGUAZZERO, P. Efficient Fortran implementation of the Gaussian elimination and Householder reduction algorithms on the IBM 3090 vector multiprocessor. IBM ECSEC Rep. ICE-0012

DAVE A. K., 1987, Sparse matrix calculations on the CRAY-2, Parallel Comput., 5, 55, 10.1016/0167-8191(87)90006-8

DEMMEL J., 1987, Argonne National Lab. Rep. ANL-MCS-TM-97

DIETRICH G, 1976, A new formulation of the hypermatrix Householder QR-decomposition, Comput. Meth. AppI. Mech. Eng., 9, 273, 10.1016/0045-7825(76)90032-3

10.1145/1057935.1057937

DONGARRA J. J., 1979, LINPACK Users' Guide, 10.1137/1.9781611971811

10.1145/42288.42291

10.1145/42288.42292

10.1145/77626.77627

DONGARRA J. J., 1989, Rep. CS-89-90

DONGARRA J. J., 1984, Implementing linear algebra algorithms for dense matrices on a vector pipeline machine, SIAM Rev., 26, 1, 10.1137/1026003

DONGARRA J. J., 1987, Argonne National Lab. Rep. ANL-MCS-TM-99

DONGARRA J. J., 1989, Implementing dense linear algebra using multitasking on the CRAY X-MP-4, J. Comput. Appl. Math., 27, 215

DONGARRA J. J., 1986, Proceedings Parallel Computing 85, 113

10.1145/355972.355980

DUFF I. S., 1981, Numerical Analysis Proceedings, Dundee

10.1137/0908086

GEORGE A., 1985, Auxiliary storage methods for solving finite element systems, SIAM J. Sci. Star. Comput., 6, 4

IBM, 1986, and scientific subroutine library, Program, 5668

10.1145/355841.355847

10.1145/355841.355848

10.1145/362875.362879

ROBERT Y., 1987, The LU decomposition algorithm and its efficient Fortran implementation on the IBM 3090 vector multiprocessor. IBM ECSEC Rep. ICE-0006

SCHREIBER R., 1986, Module design specification (Version 1.0)

10.1137/0725014