发表状态 | 已发表Published |
题名 | Fast algorithm for parallel solving inversion of large scale small matrices based on GPU |
作者 | |
发表日期 | 2023-11-01 |
发表期刊 | Journal of Supercomputing
![]() |
ISSN/eISSN | 0920-8542 |
卷号 | 79期号:16页码:18313-18339 |
摘要 | Inverting a matrix is time-consuming, and many works focus on accelerating the inversion of a single large matrix by GPU. However, the problem of parallelizing the inversion of a large number of small matrices has received little attention. These problems are widely applied in computer science, including accelerating cryptographic algorithms and image processing algorithms. In this paper, we propose a Revised In-Place Inversion algorithm for inverting a large number of small matrices on the CUDA platform, which adopts a more refined parallelization scheme and outperforms other algorithms, achieving a speedup of up to 20.9572 times over the batch matrix inverse kernel in CUBLAS. Additionally, we found that there is an upper bound on the input data size for each GPU device, and the performance will degrade if the input data size is too large. Therefore, we propose the Saturation Size Curve based on this finding to divide matrices into batches and improve the algorithm performance. Experimental results show that this strategy increases the algorithm’s performance by 1.75 times and effectively alleviates the problem of performance degradation. |
关键词 | A large number of small matrices CUDA GPU acceleration High performance computing Matrix inversion |
DOI | 10.1007/s11227-023-05336-7 |
URL | 查看来源 |
收录类别 | SCIE |
语种 | 英语English |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000986431100005 |
Scopus入藏号 | 2-s2.0-85159283646 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | https://repository.uic.edu.cn/handle/39GCC9TT/13086 |
专题 | 个人在本单位外知识产出 理工科技学院 |
通讯作者 | Yewang, Chen |
作者单位 | 1.The College of Computer Science and Technology,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China 2.Fujian Key Laboratory of Big Data Intelligence and Security,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China 3.Xiamen Key Laboratory of Computer Vision and Pattern Recognition,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China 4.College of Mechanical Engineering and Automation,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China |
推荐引用方式 GB/T 7714 | Xuebin, Jin,Yewang, Chen,Wentao, Fanet al. Fast algorithm for parallel solving inversion of large scale small matrices based on GPU[J]. Journal of Supercomputing, 2023, 79(16): 18313-18339. |
APA | Xuebin, Jin, Yewang, Chen, Wentao, Fan, Yong, Zhang, & Jixiang, Du. (2023). Fast algorithm for parallel solving inversion of large scale small matrices based on GPU. Journal of Supercomputing, 79(16), 18313-18339. |
MLA | Xuebin, Jin,et al."Fast algorithm for parallel solving inversion of large scale small matrices based on GPU". Journal of Supercomputing 79.16(2023): 18313-18339. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论