Details of Research Outputs

TitleImplementation and analysis of block dense matrix decomposition on network-on-chips
Creator
Date Issued2012
Conference Name14th IEEE International Conference on High Performance Computing and Communications
Source PublicationProceedings of the 14th IEEE International Conference on High Performance Computing and Communications, HPCC-2012 - 9th IEEE International Conference on Embedded Software and Systems, ICESS-2012
Pages516-523
Conference DateJune 25-27, 2012
Conference PlaceLiverpool
Abstract

The decomposition of a dense matrix into lower and upper triangular matrices is an important linear algebra kernel that used in scientific and engineering applications. To decompose large matrices efficiently, the matrix is divided into sub-matrices as blocks. The block matrix decomposition is introduced for parallel hardware platforms, e.g. supercomputers, multicore processors and GPUs. Recently, the Network-on-Chip (NoC) paradigm is proposed as a promising multicore architecture for future Chip Multiprocessors (CMPs) with hundreds or even thousands of cores. The communication bottleneck of traditional bus or crossbar based on-chip interconnect is alleviated in the NoC architecture. However, the implementation and analysis of parallel block matrix decomposition in a NoC platform has not been well addressed. We design an NoC platform based on state-of-the-art systems. A block matrix decomposition algorithm is implemented on the NoC platform. Evaluation results are presented using a cycle accurate full system simulator. We achieve parallel efficiency of 74.8% with a 64-node NoC, which outperforms other three multiprocessor systems (30.5%, 67% and 50% respectively). We also analyzed the impact of block size, cache behavior and network pressure of the platform. © 2012 IEEE.

DOI10.1109/HPCC.2012.76
URLView source
Indexed ByCPCI-S
Language英语English
WOS Research AreaComputer Science;Engineering
WOS SubjectComputer Science; Theory & Methods; Engineering;Electrical & Electronic
WOS IDWOS:000310377500067
Scopus ID2-s2.0-84870435948
Citation statistics
Cited Times:3[WOS]   [WOS Record]     [Related Records in WOS]
Document TypeConference paper
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/9317
CollectionResearch outside affiliated institution
Corresponding AuthorXu, Thomas Canhao
Affiliation
Department of Information Technology, University of Turku, 20014, Turku, Finland Turku Center for Computer Science (TUCS), Joukahaisenkatu 3-5 B, 20520, Turku, Finland
Recommended Citation
GB/T 7714
Xu, Thomas Canhao,Pahikkala, Tapio,Airola, Anttiet al. Implementation and analysis of block dense matrix decomposition on network-on-chips[C], 2012: 516-523.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Xu, Thomas Canhao]'s Articles
[Pahikkala, Tapio]'s Articles
[Airola, Antti]'s Articles
Baidu academic
Similar articles in Baidu academic
[Xu, Thomas Canhao]'s Articles
[Pahikkala, Tapio]'s Articles
[Airola, Antti]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xu, Thomas Canhao]'s Articles
[Pahikkala, Tapio]'s Articles
[Airola, Antti]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.