科研成果详情

发表状态已发表Published
题名Fast algorithm for parallel solving inversion of large scale small matrices based on GPU
作者
发表日期2023-11-01
发表期刊Journal of Supercomputing
ISSN/eISSN0920-8542
卷号79期号:16页码:18313-18339
摘要

Inverting a matrix is time-consuming, and many works focus on accelerating the inversion of a single large matrix by GPU. However, the problem of parallelizing the inversion of a large number of small matrices has received little attention. These problems are widely applied in computer science, including accelerating cryptographic algorithms and image processing algorithms. In this paper, we propose a Revised In-Place Inversion algorithm for inverting a large number of small matrices on the CUDA platform, which adopts a more refined parallelization scheme and outperforms other algorithms, achieving a speedup of up to 20.9572 times over the batch matrix inverse kernel in CUBLAS. Additionally, we found that there is an upper bound on the input data size for each GPU device, and the performance will degrade if the input data size is too large. Therefore, we propose the Saturation Size Curve based on this finding to divide matrices into batches and improve the algorithm performance. Experimental results show that this strategy increases the algorithm’s performance by 1.75 times and effectively alleviates the problem of performance degradation.

关键词A large number of small matrices CUDA GPU acceleration High performance computing Matrix inversion
DOI10.1007/s11227-023-05336-7
URL查看来源
收录类别SCIE
语种英语English
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号WOS:000986431100005
Scopus入藏号2-s2.0-85159283646
引用统计
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/13086
专题个人在本单位外知识产出
理工科技学院
通讯作者Yewang, Chen
作者单位
1.The College of Computer Science and Technology,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China
2.Fujian Key Laboratory of Big Data Intelligence and Security,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China
3.Xiamen Key Laboratory of Computer Vision and Pattern Recognition,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China
4.College of Mechanical Engineering and Automation,Huaqiao University,Xiamen,Jimei Avenu 668, Fujian,361021,China
推荐引用方式
GB/T 7714
Xuebin, Jin,Yewang, Chen,Wentao, Fanet al. Fast algorithm for parallel solving inversion of large scale small matrices based on GPU[J]. Journal of Supercomputing, 2023, 79(16): 18313-18339.
APA Xuebin, Jin, Yewang, Chen, Wentao, Fan, Yong, Zhang, & Jixiang, Du. (2023). Fast algorithm for parallel solving inversion of large scale small matrices based on GPU. Journal of Supercomputing, 79(16), 18313-18339.
MLA Xuebin, Jin,et al."Fast algorithm for parallel solving inversion of large scale small matrices based on GPU". Journal of Supercomputing 79.16(2023): 18313-18339.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Xuebin, Jin]的文章
[Yewang, Chen]的文章
[Wentao, Fan]的文章
百度学术
百度学术中相似的文章
[Xuebin, Jin]的文章
[Yewang, Chen]的文章
[Wentao, Fan]的文章
必应学术
必应学术中相似的文章
[Xuebin, Jin]的文章
[Yewang, Chen]的文章
[Wentao, Fan]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。