
doi: 10.1002/cpe.4256
handle: 11583/2702004
SummaryIn this paper, we present our implementation of the fast Fourier transforms on graphic processing unit (GPU) using OpenCL. This implementation of the FFT (ToPe‐FFT) is based on the Cooley‐Tukey set of algorithms with support for 1D and higher dimensional transforms using different radices. Factorization for mix‐radices enables our code to target FFTs of near arbitrary length. In systems with multiple graphic cards (GPUs), the library automatically balances the FFT computation thus achieving maximum resource utilization and higher speedup. Based on profiling and micro‐benchmarking of ToPe‐FFT, it is observed that the average speedup of our library for different sizes is 48× faster than the single CPU‐based code using FFTW and 3× faster than NVIDIA's GPU‐based cuFFT library.
fast fourier transform; GPU; high performance computing; OpenCL; Software; Theoretical Computer Science; Computer Science Applications; 1707; Computer Vision and Pattern Recognition; Computer Networks and Communications; Computational Theory and Mathematics
fast fourier transform; GPU; high performance computing; OpenCL; Software; Theoretical Computer Science; Computer Science Applications; 1707; Computer Vision and Pattern Recognition; Computer Networks and Communications; Computational Theory and Mathematics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
