Implementation of SM4 block cipher on CUDA GPU and its analysis*

2022 International Conference on Platform Technology and Service (PlatCon)(2022)

Cited 1|Views2
No score
Abstract
SM4 is a symmetric key algorithm developed by the China National Cryptographic Authority. In this paper, the parallel implementation of SM4 block cipher commonly used in China was performed on GPU. The SM4 block cipher has an implementation that uses an 8-bit Sbox table and an implementation that uses a 32-bit T-table. Measuring the performance of each of the two table implementations, the T-table implementation performed approximately 0.75× worse than the Sbox table implementation. Additionally, Implemented SM4 to use shared memory for better performance. The result is a performance improvement of approximately 1.06× ~ 1.19× when using shared memory in the Sbox table implementation.
More
Translated text
Key words
SM4,Block cipher,GPU,Shared memory,Parallel implementation,CUDA programming
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined