网络与信息安全学报 ›› 2021, Vol. 7 ›› Issue (5): 57-76.doi: 10.11959/j.issn.2096-109x.2021087

• 专栏Ⅱ:机器学习及安全应用 • 上一篇    

计算机扑克智能博弈研究综述

袁唯淋, 廖志勇, 高巍, 魏婷婷, 罗俊仁, 张万鹏, 陈璟   

  1. 国防科技大学智能科学学院,湖南 长沙 410073
  • 修回日期:2021-03-06 出版日期:2021-10-01 发布日期:2021-10-01
  • 作者简介:袁唯淋(1994− ),男,云南曲靖人,国防科技大学博士生,主要研究方向为认知决策与智能博弈、对手建模、强化学习、多智能体系统
    廖志勇(1995− ),男,湖南永州人,国防科技大学硕士生,主要研究方向为知识图谱、智能决策、网络通信
    高巍(1996− ),女,辽宁开原人,国防科技大学硕士生,主要研究方向为对手建模、意图识别、弹道规划
    魏婷婷(1997− ),女,内蒙古鄂尔多斯人,国防科技大学硕士生,主要研究方向为智能体建模、智能博弈、多智能体强化学习
    罗俊仁(1989− ),男,湖北大冶人,国防科技大学博士生,主要研究方向为智能体建模、对抗团队博弈、多智能体强化学习
    张万鹏(1981− ),男,四川邛崃人,国防科技大学副研究员,主要研究方向为智能决策、任务规划、自动化和控制、人机协同
    陈璟(1972− ),男,江西南昌人,国防科技大学教授、博士生导师,主要研究方向为人工智能、智能决策、任务规划
  • 基金资助:
    国家自然科学基金(61702528);国家自然科学基金(61806212)

Survey on intelligent game of computer poker

Weilin YUAN, Zhiyong LIAO, Wei GAO, Tingting WEI, Junren LUO, Wanpeng ZAHNG, Jing CHEN   

  1. College of Intelligence Science and Technology, National University of Defense and Technology, Changsha 410073, China
  • Revised:2021-03-06 Online:2021-10-01 Published:2021-10-01
  • Supported by:
    The National Natural Science Foundation of China(61702528);The National Natural Science Foundation of China(61806212)

摘要:

计算机博弈是人工智能领域的“果蝇”,备受人工智能领域研究者的关注,已然成为研究认知智能的有利平台。扑克类博弈对抗问题可建模成边界确定、规则固定的不完美信息动态博弈,计算机扑克 AI 需要具备不完全信息动态决策、对手误导欺诈行为识别以及多回合筹码和风险管理等能力。首先梳理了以德州扑克为代表的计算机扑克智能博弈的发展历程,其次针对计算机扑克智能博弈典型模型算法、关键技术以及存在的主要问题进行了综述分析,最后探讨了计算机扑克智能博弈的未来发展趋势和应用前景。

关键词: 计算机扑克, 认知智能, 不完美信息博弈, 德州扑克, 虚拟遗憾最小化

Abstract:

Computer game is the drosophila in the field of artificial intelligence, which has attracted the attention of researchers in artificial intelligence, and has become an advantageous testbed for the research of cognitive intelligence.Poker game can be modeled as dynamic games with imperfect information, definite boundaries and fixed rules.Computer poker AI needs such abilities as dynamic decision-making with incomplete information, identification of misleading and fraudulent behaviors by opponents, and multi-round chips and risk management.Firstly , the development of computer poker game was introduced, which represented by Texas Hold’em poker.Then, typical intelligence game model algorithm, key techniques and existing main problems of computer poker were reviewed analysis.Finally, the future development trends and application prospect of computer intelligent poker game were discussed for cognitive intelligence.

Key words: computer poker, cognitive intelligence, imperfect information game, Texas Hold'em poker, Counterfactual regret minimization

中图分类号: 

No Suggested Reading articles found!