Gabriel Zachmann and Alexander Greis has released an efficient algorithm for sorting in stream architectures, called GPU-ABiSort. Expect (n log n) / p complexity from this fast algorithm.
Currently I’m working to implement this in CUDA.
Download
A. Paper
B. Source Code
Not available yet.