科研成果详情

发表状态已发表Published
题名IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane
作者
发表日期2024-04-01
发表期刊IEEE Communications Magazine
ISSN/eISSN0163-6804
卷号62期号:4页码:96-102
摘要

Neural networks have been widely used in networking applications due to their high accuracy and generalization. However, the traditional approach of collecting network features from switches and transmitting them to the controller introduces high traffic overhead and extra communication latency. In-network computing (INC) mitigates this issue by running computing tasks directly in the networks on the data paths using programmable data planes (PDP). However, it is challenging to embed more sophisticated computing tasks, such as neural networks, in the networks due to the limitations in the computation and storage resources of PDP. To address this challenge, we propose IN3, a framework that enables complete neural network inference in PDP. IN3 uses model compression techniques to reduce the memory and computational requirements of given neural networks. Additionally, a purposely designed data plane pipeline for per-flow features computation and inference is proposed. We implemented a testbed prototype (based on Intel Tofino ASIC), and experimental results demonstrate that IN3 effectively reduces memory usage, while significantly decreasing the inference time. IN3 demonstrates the feasibility of implementing neural networks in PDP, and we identify potential future research directions for this issue.

DOI10.1109/MCOM.001.2300587
URL查看来源
语种英语English
Scopus入藏号2-s2.0-85190748543
引用统计
被引频次:1[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符https://repository.uic.edu.cn/handle/39GCC9TT/11461
专题理工科技学院
通讯作者Cui, Lin
作者单位
1.Jinan University,China
2.Loughborough University,United Kingdom
3.Beijing Normal University,China
4.BNU-HKBU United International College,China
推荐引用方式
GB/T 7714
Zhang, Xiaoquan,Cui, Lin,Tso, Fung Poet al. IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane[J]. IEEE Communications Magazine, 2024, 62(4): 96-102.
APA Zhang, Xiaoquan, Cui, Lin, Tso, Fung Po, Li, Wenzhi, & Jia, Weijia. (2024). IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane. IEEE Communications Magazine, 62(4), 96-102.
MLA Zhang, Xiaoquan,et al."IN3: A Framework for In-Network Computation of Neural Networks in the Programmable Data Plane". IEEE Communications Magazine 62.4(2024): 96-102.
条目包含的文件
条目无相关文件。
个性服务
查看访问统计
谷歌学术
谷歌学术中相似的文章
[Zhang, Xiaoquan]的文章
[Cui, Lin]的文章
[Tso, Fung Po]的文章
百度学术
百度学术中相似的文章
[Zhang, Xiaoquan]的文章
[Cui, Lin]的文章
[Tso, Fung Po]的文章
必应学术
必应学术中相似的文章
[Zhang, Xiaoquan]的文章
[Cui, Lin]的文章
[Tso, Fung Po]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。