Cufft

CuFFT is a FFT library for CUDA enabled GPUs

Fftw

FFTW, "The Fastest Fourier Transform in The West", is a C library which implements discrete Fourier Transforms



Experiments discussion batched

Example

"In the experiments and discussion below i find that cufft is slower than fftw for batched 2d ffts"

from question  

Why is cuFFT so slow?

"However for a variety of fft problem sizes i ve found that cufft is slower than fftw with openmp"

from question  

Why is cuFFT so slow?

Others

Example

I am working on a code which needs to be time efficient and thus using cufft for this purpose but when i try to compute fft of a very large data in parallel it is slower than cpu fftw and the reason i find after finding the time for every line of code using high precision timing code is that cudamalloc taking around 0.983 sec while the time for rest of the lines of code is around 0.00xx sec which is expected ..

from question  

Is cudamalloc slower than cudamemcpy?

Back to Home
Data comes from Stack Exchange with CC-BY-SA-4.0