物联网学报 ›› 2019, Vol. 3 ›› Issue (2): 56-63.doi: 10.11959/j.issn.2096-3750.2019.00097

• 理论与技术 • 上一篇    下一篇

基于深度强化学习的物联网智能路由策略

丁瑞金1,高飞飞1,邢玲2   

  1. 1 清华大学自动化系,北京 100084
    2 河南科技大学,河南 洛阳 471023
  • 修回日期:2019-03-07 出版日期:2019-06-30 发布日期:2019-07-17
  • 作者简介:丁瑞金(1994- ),男,江苏南通人,清华大学博士生,主要研究方向为AI及其在通信中的应用。|高飞飞(1980- ),男,陕西西安人,博士,清华大学副教授、博士生导师,主要研究方向为多天线通信以及智能信号处理技术。|邢玲(1978- ),女,四川成都人,博士,河南科技大学教授、博士生导师,主要研究方向为智能信息传输、计算机网络与多媒体技术。

Intelligent routing strategy in the Internet of things based on deep reinforcement learning

Ruijin DING1,Feifei GAO1,Ling XING2   

  1. 1 Department of Automation,Tsinghua University,Beijing 100084,China
    2 Henan University of Science and Technology,Luoyang 471023,China
  • Revised:2019-03-07 Online:2019-06-30 Published:2019-07-17

摘要:

随着物联网时代的到来,万物互联的传输模式引发数据量爆炸式增长,给传统路由协议带来了严峻挑战。阐述了在数据量急剧增长的情况下,已有路由协议的局限性,并将路由选择问题重新建模为马尔可夫决策过程。在此基础上,采用深度强化学习方法为每项数据传输任务选择下一跳路由器,从而在避免数据堵塞的前提下尽可能缩短数据传输路径长度。仿真结果表明,所提方法能够显著降低数据堵塞概率,增大网络吞吐量。

关键词: 深度强化学习, 路由, 物联网, 网络堵塞

Abstract:

At the era of the Internet of things,networking mode that connects everything would bring tremendous increase in the data volume and challenge the traditional routing protocols.The limitations of the existing routing protocols was analyzed when facing the data explosion and then the routing selection problem was re-modeled as a Markov decision process.On this basis,the deep reinforcement learning technique was utilized to choose the next-hop router for data transmission task in order to shorten the transmission path length while network congestion was avoided.The simulation results demonstrate that the congestion probability can be reduced significantly and the network throughput can be enhanced by the proposed strategy.

Key words: deep reinforcement learning, routing, Internet of things, network congestion

中图分类号: 

No Suggested Reading articles found!