Chrome Extension
WeChat Mini Program
Use on ChatGLM

POSTER: RadiK: Scalable Radix Top-K Selection on GPUs

Yifei Li, Bole Zhou, Jiejing Zhang,Xuechao Wei, Yinghan Li, Yingda Chen

ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming(2024)

Cited 0|Views0
No score
Abstract
By identifying the.. largest or smallest elements in a set of data, top-k selection is critical for modern high-performance databases and machine learning systems, especially with large data volumes. However, previous studies on its GPU implementation are mostly merge-based and rely heavily on the high-speed but size-limited on-chip memory, thereby resulting in a restricted upper bound on... This paper introduces RadiK, a highly optimized GPU-parallel radix top-k selection that is scalable with.., input length, and batch size. With a carefully designed optimization framework targeting high memory bandwidth and resource utilization, RadiK supports far larger.. than the prior art, achieving up to 2.5x speedup for non-batch queries and up to 4.8x speedup for batch queries. We also propose a lightweight refinement that strengthens the robustness of RadiK against skewed distributions by adaptively scaling the input elements.
More
Translated text
Key words
Top-K,Radix Select,GPU-Parallel Algorithm
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined