Details of Research Outputs

Status已发表Published
TitleIN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane
Creator
Date Issued2024-04-01
Source PublicationIEEE Communications Magazine
ISSN0163-6804
Volume62Issue:4Pages:96-102
Abstract

Neural networks have been widely used in networking applications due to their high accuracy and generalization. However, the traditional approach of collecting network features from switches and transmitting them to the controller introduces high traffic overhead and extra communication latency. In-network computing (INC) mitigates this issue by running computing tasks directly in the networks on the data paths using programmable data planes (PDP). However, it is challenging to embed more sophisticated computing tasks, such as neural networks, in the networks due to the limitations in the computation and storage resources of PDP. To address this challenge, we propose IN3, a framework that enables complete neural network inference in PDP. IN3 uses model compression techniques to reduce the memory and computational requirements of given neural networks. Additionally, a purposely designed data plane pipeline for per-flow features computation and inference is proposed. We implemented a testbed prototype (based on Intel Tofino ASIC), and experimental results demonstrate that IN3 effectively reduces memory usage, while significantly decreasing the inference time. IN3 demonstrates the feasibility of implementing neural networks in PDP, and we identify potential future research directions for this issue.

DOI10.1109/MCOM.001.2300587
URLView source
Language英语English
Scopus ID2-s2.0-85190748543
Citation statistics
Document TypeJournal article
Identifierhttp://repository.uic.edu.cn/handle/39GCC9TT/11461
CollectionFaculty of Science and Technology
Corresponding AuthorCui, Lin
Affiliation
1.Jinan University,China
2.Loughborough University,United Kingdom
3.Beijing Normal University,China
4.BNU-HKBU United International College,China
Recommended Citation
GB/T 7714
Zhang, Xiaoquan,Cui, Lin,Tso, Fung Poet al. IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane[J]. IEEE Communications Magazine, 2024, 62(4): 96-102.
APA Zhang, Xiaoquan, Cui, Lin, Tso, Fung Po, Li, Wenzhi, & Jia, Weijia. (2024). IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane. IEEE Communications Magazine, 62(4), 96-102.
MLA Zhang, Xiaoquan,et al."IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane". IEEE Communications Magazine 62.4(2024): 96-102.
Files in This Item:
There are no files associated with this item.
Related Services
Usage statistics
Google Scholar
Similar articles in Google Scholar
[Zhang, Xiaoquan]'s Articles
[Cui, Lin]'s Articles
[Tso, Fung Po]'s Articles
Baidu academic
Similar articles in Baidu academic
[Zhang, Xiaoquan]'s Articles
[Cui, Lin]'s Articles
[Tso, Fung Po]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Zhang, Xiaoquan]'s Articles
[Cui, Lin]'s Articles
[Tso, Fung Po]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.