Computational SIMD framework: split-radix SIMD-FFT algorithm, derivation, implementation and performance

P. Rodriguez V1, M.S. Pattichis1, R. Jordan2
1IvPCL Laboratory, Albuquerque, NM, USA
2ISTEC, University of New Mexico, Albuquerque, NM, USA

Tóm tắt

A general framework to develop efficient single instruction multiple data (SIMD) compliant algorithms was previously proposed. In this paper a split-radix SIMD-FFT algorithm is derived under this framework and compared against the radix-2 SIMD-FFT algorithm, proven to have very efficient implementation. Regardless of the intrinsic irregular pattern present in the split-radix algorithm, it is shown that its performance improvement, when compared to the radix-2 algorithm, ranges from 2.5% up to 8.1%.

Từ khóa

#Computer architecture #Computational complexity #Registers #Indexing #Costs #Computer aided manufacturing #Microprocessors #Digital signal processing #Wire

Tài liệu tham khảo

dudgeon, 1984, Multidimensional Digital Signal Processing rodriguez, 0, Computational Framework fo single J nstruction Multiple Data (SIMD) Architectures Applied (0 Digital Signal Processing, IEEE 2002 2001, IA-32 Intel Architecture Software Developer's Manual, 2 10.1145/301618.301661 2001, AltiVec Technology Programming Environment Manual-CT_ALTIVECPEM_Rl 10.1137/1.9781611970999 1999, Split-Radix Fast Fourier Transform Using Streaming SIMD Extensions, Intel Applications Notes, ap 808 10.1109/ICASSP.2001.941115 rodriguez, 0, A Radix-2 Multidimensional Transposition-free FFf algorithm for Modem Single Instruction Multiple Data (SIMD Architectures, EUSIPCO 2002 crandall, 2000, Super-Computing Style FFT Library for Apple G4, Advanced Computation Group Apple Computer rodriguez, 0, A Radix-2 FFf Algorithm for Modem Single Instruction MUltiple Data (SIMD) Architectures, ICASSP 2002