基于策略约束强化学习的算网多目标优化研究
沈林江, 曹畅, 崔超, 张岩
Research on constrained policy reinforcement learning based multi-objective optimization of computing power network
Linjiang SHEN, Chang CAO, Chao CUI, Yan ZHANG
电信科学 . 2023, (8): 136 -148 .  DOI: 10.11959/j.issn.1000-0801.2023165