Chinese Journal on Internet of Things ›› 2022, Vol. 6 ›› Issue (4): 1-13.doi: 10.11959/j.issn.2096-3750.2022.00306

• Theory and Technology •     Next Articles

Reinforcement learning-based real-time video streaming control and on-device training research

Huanhuan ZHANG, Anfu ZHOU, Huadong MA   

  1. Beijing University of Posts and Telecommunications, Beijing Key Lab of Intelligent Telecommunication Software and Multimedia, Beijing 100876, China
  • Revised:2022-10-17 Online:2022-12-30 Published:2022-12-01
  • Supported by:
    The National Natural Science Foundation of China(61921003);The China National Postdoctoral Program for Innovative Talents(BX20220046)

Abstract:

Service platforms centered on the Internet of things and mobile Internet are in accelerating process.Hundreds of millions of end-users communicate through network real-time video services, which have become an irreplaceable core tool in human’s digital life.However, the Internet is becoming dynamic, and heterogeneous, which imposes stringent requirements on real-time video streaming control technology.Moreover, the QoE of real-time video is not satisfactory.An adaptive reinforcement learning-based video intelligent transmission algorithm was designed, which can deal with heterogeneous network environment.And then, an effective end-to-end on-device training framework was designed to decrease server overhead, and a detailed evaluation and analysis on the neural network design and structure was provided.Experimental results show that the proposed algorithm can effectively predict heterogeneous network bandwidth, and reduces the bandwidth prediction error by 48.48%, comparing with the representative streaming control algorithm.The effective bandwidth prediction can further improve the user QoE, such as improving the video fluency by 60.65%, and improving the video quality by 16.52%.Besides, the analysis can provide empirical insights for further study, and holds potential to push the development of intelligent video applications.

Key words: real-time video, adaptive streaming control, quality-of-experience, reinforcement learning, on-device training

CLC Number: 

No Suggested Reading articles found!