Chinese Journal of Network and Information Security ›› 2021, Vol. 7 ›› Issue (4): 86-100.doi: 10.11959/j.issn.2096-109x.2021052
• TopicⅠ: Network Security: Attack and Defense • Previous Articles Next Articles
Wei GAO1,2, Junren LUO1, Weilin YUAN1, Wanpeng ZHANG1
Revised:
2020-09-25
Online:
2021-08-15
Published:
2021-08-01
Supported by:
CLC Number:
Wei GAO, Junren LUO, Weilin YUAN, Wanpeng ZHANG. Survey of intention recognition for opponent modeling[J]. Chinese Journal of Network and Information Security, 2021, 7(4): 86-100.
[1] | SUKTHANKAR G . Plan,activity,and intent recognition:theory and practice[R]. 2014. |
[2] | CHAKRABORTI T , KAMBHAMPATI S , SCHEUTZ M ,et al. AI challenges in human-robot cognitive teaming[J]. arXiv preprint arXiv:1707.04775, 2017. |
[3] | ALBRECHT S V , STONE P . Autonomous agents modelling other agents:a comprehensive survey and open problems[J]. Artificial Intelligence, 2018,258: 66-95. |
[4] | HEINZE C . Modelling intention recognition for intelligent agent systems[R]. 2004. |
[5] | LE GUILLARME N . A game-theoretic planning framework for intentional threat assessment[D]. Thèse de doctorat:Université de Caen, 2016. |
[6] | BIGELOW D . Intent recognition in multi-agent domains[M]. University of Nevada,Reno, 2013. |
[7] | STROUSE D J , KLEIMAN-WEINER M , TENENBAUM J ,et al. Learning to share and hide intentions using information regularization[C]// Advances in Neural Information Processing Systems. 2018: 10249-10259. |
[8] | CHAKRABORTI T , KULKARNI A , SREEDHARAN S ,et al. Explicability legibility predictability transparency privacy security the emerging landscape of interpretable agent behavior[C]// Proceedings of the International Conference on Automated Planning and Scheduling. 2019: 86-96. |
[9] | KEREN S , GAL A , KARPAS E . Privacy preserving plans in partially observable environments[C]// IJCAI. 2016: 3170-3176. |
[10] | WRIGHT J R . Modeling human behavior in strategic settings[D]. Columbia:University of British Columbia, 2016. |
[11] | PLONSKY O , APEL R , ERT E ,et al. Predicting human decisions with behavioral theories and machine learning[J]. arXiv preprint arXiv:1904.06866, 2019. |
[12] | BORGHETTI B J . Opponent modeling in interesting adversarial environments[M]. Minnesota: University of Minnesota, 2008. |
[13] | BROWNE C B , POWLEY E , WHITEHOUSE D ,et al. A survey of monte carlo tree search methods[J]. IEEE Transactions on Computational Intelligence and AI in Games, 2012,4(1): 1-43. |
[14] | CHAKRABORTY D , STONE P . Multiagent learning in the presence of memory-bounded agents[J]. Autonomous Agents and Multi-Agent Systems, 2014,28(2): 182-213. |
[15] | KOLODNER J . Case-based reasoning[M]. Morgan Kaufmann, 2014. |
[16] | CARMEL D , MARKOVITCH S . Learning models of intelligent agents[C]// AAAI/IAAI. 1996: 62-67. |
[17] | BAARSLAG T , HENDRIKX M J C , HINDRIKS K V ,et al. Learning about the opponent in automated bilateral negotiation:a comprehensive survey of opponent modeling techniques[J]. Autonomous Agents and Multi-Agent Systems, 2016,30(5): 849-898. |
[18] | BARRETT S , STONE P , KRAUS S ,et al. Teamwork with limited knowledge of teammates[C]// Twenty-Seventh AAAI Conference on Artificial Intelligence. 2013. |
[19] | ALBRECHT S V , CRANDALL J W , RAMAMOORTHY S . An empirical study on the practical impact of prior beliefs over policy types[C]// Twenty-Ninth AAAI Conference on Artificial Intelligence. 2015. |
[20] | ALBRECHT S V , RAMAMOORTHY S . On convergence and optimality of best-response learning with policy types in multiagent systems[J]. arXiv preprint arXiv:1907.06995, 2019. |
[21] | SCHADD F , BAKKES S , SPRONCK P . Opponent modeling in real-time strategy games[C]// GAMEON. 2007: 61-70. |
[22] | WEN Y , YANG Y , LU R ,et al. Multi-agent generalized recursive reasoning[J]. arXiv preprint arXiv:1901.09216, 2019. |
[23] | WEN Y , YANG Y , LUO R ,et al. Probabilistic recursive reasoning for multi-agent reinforcement learning[J]. arXiv preprint arXiv:1901.09207, 2019. |
[24] | DOSHI P , ZENG Y , CHEN Q . Graphical models for interactive POMDPs:representations and solutions[J]. Autonomous Agents and Multi-Agent Systems, 2009,18(3): 376. |
[25] | TORKAMAN A , SAFABAKHSH R . Robust opponent modeling in real-time strategy games using bayesian networks[J]. Journal of AI and Data Mining, 2019,7(1): 149-159. |
[26] | MAO W , GRATCH J , LI X . Probabilistic plan inference for group behavior prediction[J]. IEEE Intelligent Systems, 2012,27(4): 27-36. |
[27] | HAUSKNECHT M , MUPPARAJU P , SUBRAMANIAN S ,et al. Half field offense:an environment for multiagent learning and ad hoc teamwork[C]// AAMAS Adaptive Learning Agents (ALA) Workshop. 2016. |
[28] | ?O?I? A . Learning models of behavior from demonstration and through interaction[D]. Technische Universit?t, 2018. |
[29] | HERNANDEZ-LEAL P , ZHAN Y , TAYLOR M E ,et al. Efficiently detecting switches against non-stationary opponents[J]. Autonomous Agents and Multi-Agent Systems, 2017,31(4): 767-789. |
[30] | ALBRECHT S V , RAMAMOORTHY S . Are you doing what i think you are doing? criticising uncertain agent models[J]. arXiv preprint arXiv:1907.01912, 2019. |
[31] | WANG Z , BOULARIAS A , MüLLING K ,, et al . Balancing safety and exploitability in opponent modeling[C]// Twenty-Fifth AAAI Conference on Artificial Intelligence. 2011. |
[32] | STANESCU A M . Outcome prediction and hierarchical models in real-time strategy games[R]. 2019. |
[33] | MOURAD M , AREF M , ABD-ELAZIZ M , . Opponent models pre-processing in real-time strategy games[J]. International Journal of Intelligent Computing and Information Sciences, 2016,16(3): 37-45. |
[34] | SUKTHANKAR G R . Activity recognition for agent teams[R]. 2007. |
[35] | FREEDMAN R G , ZILBERSTEIN S . A unifying perspective of plan,activity,and intent recognition[C]// Proceedings of the AAAI Workshops:Plan,Activity,Internet Recognition. 2019: 1-8. |
[36] | RAMíREZ M , GEFFNER H . Plan recognition as planning[C]// Twenty-First International Joint Conference on Artificial Intelligence. 2009. |
[37] | SOHRABI S , RIABOV A V , UDREA O . Plan recognition as planning revisited[C]// IJCAI. 2016: 3258-3264. |
[38] | PEREIRA R F , OREN N , MENEGUZZI F . Landmark-based approaches for goal recognition as planning[J]. arXiv preprint arXiv:1904.11739, 2019. |
[39] | AINETO D , JIMéNEZ S ,, ONAINDIA E , et al . Model recognition as planning[C]// Proceedings of the International Conference on Automated Planning and Scheduling. 2019: 13-21. |
[40] | ANG S , CHAN H , JIANG A X ,et al. Game-theoretic goal recognition models with applications to security domains[C]// International Conference on Decision and Game Theory for Security. 2017: 256-272. |
[41] | LE GUILLARME N , MOUADDIB A I , LEROUVREUR X ,et al. A generative game-theoretic framework for adversarial plan recognition[C]// JFPDA 2015. 2015. |
[42] | LI J , REN T , SU H ,et al. Learn a robust policy in adversarial games via playing with an expert opponent[C]// Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. 2019: 2096-2098. |
[43] | ?O?I? A . Learning models of behavior from demonstration and through interaction[D]. Technische Universit?t, 2018. |
[44] | ZIEBART B D . Modeling purposeful adaptive behavior with the principle of maximum causal entropy[D]. Figshare, 2010. |
[45] | TASTAN B , . Learning human motion models[C]// Eighth Artificial Intelligence and Interactive Digital Entertainment Conference. 2012. |
[46] | GAURAV S , ZIEBART B . Discriminatively learning inverse optimal control models for predicting human intentions[C]// Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. 2019: 1368-1376. |
[47] | LI X , YANG W , ZHANG Z . A unified framework for regularized reinforcement learning[J]. arXiv preprint arXiv:1903.00725, 2019. |
[48] | TIAN Z , WEN Y , GONG Z ,et al. A regularized opponent model with maximum entropy objective[J]. arXiv preprint arXiv:1905.08087, 2019. |
[49] | MAYNARD M , DUHAMEL T , KABANZA F . Cost-based goal recognition meets deep learning[J]. arXiv preprint arXiv:1911.10074, 2019. |
[50] | WOOKHEE M , YOUNG E H , ROWE J . Deep learning-based goal recognition in open-ended digital games[C]// Tenth AAAI Conference on Artificial Intelligence & Interactive Digital Entertainment. 2014. |
[51] | DUHAMEL T , MAYNARD M , KABANZA F . A transfer learning method for goal recognition exploiting cross-domain spatial fea-tures[J]. arXiv preprint arXiv:1911.10134, 2019. |
[52] | THIBAULT D , MAYNARD M , KABANZA F . Imagination-augmented deep learning for goal recognition[J]. arXiv preprint arXiv:2003.09529v1, 2020. |
[53] | BLAYLOCK N , ALLEN J . Fast hierarchical goal schema recognition[C]// Proceedings of the National Conference on Artificial Intelligence. 2006:796. |
[54] | VERED M , KAMINKA G A . Heuristic online goal recognition in continuous domains[C]// International Joint Conference on Artificial Intelligence. 2017: 4447-4454. |
[55] | VERED M , KAMINKA G A , BIHAM S . Online goal recognition through mirroring:Humans and agents[C]// The Fourth Annual Conference on Advances in Cognitive Systems. 2016. |
[56] | VERED M , KAMINKA G A . Online recognition of navigation goals through goal mirroring[C]// Proceedings of the 16th Conference on Autonomous Agents and Multiagent Systems.International Foundation for Autonomous Agents and Multiagent Systems. 2017: 1748-1750. |
[57] | VERED M , PEREIRA R F , MAGNAGUAGNO M C ,et al. Towards online goal recognition combining goal mirroring and landmarks[C]// AAMAS. 2018: 2112-2114. |
[58] | MASTERS P , SARDINA S . Cost-based goal recognition for the path-planning domain[C]// IJCAI. 2018: 5329-5333. |
[59] | MASTERS P , SARDINA S . Goal recognition for rational and irrational agents[C]// Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems.International Foundation for Autonomous Agents and Multiagent Systems. 2019: 440-448. |
[60] | HOFFMANN J , PORTEOUS J , SEBASTIA L . Ordered landmarks in planning[J]. Journal of Artificial Intelligence Research, 2004,22: 215-278. |
[61] | SCHMIDT C , SRIDHARAN N , GOODSON J . The plan recognition problem:an intersection of psychology and artificial intelligence[J]. Artificial Intelligence, 1978,11: 45-83. |
[62] | COHEN P R , PERRAULT C R , ALLEN J F . Beyond question answering[M]// Strategies for Natural Language Processing. Lawrence Erlbaum Associates, 1981. |
[63] | PENTNEY W , POPESCU A ,, WANG S , KAUTZ H ,et al. Sensor-based understanding of daily life via large-scale use of common sense[C]// Proceedings of AAAI. 2006. |
[64] | KEREN S , GAL A , KARPAS E . Goal recognition design[C]// Twenty-Fourth International Conference on Automated Planning and Scheduling. 2014. |
[65] | SON T C , SABUNCU O , Schulz-Hanke C ,, et al . Solving goal recognition design using ASP[C]// Thirtieth AAAI Conference on Artificial Intelligence. 2016. |
[66] | KEREN S , GAL A , KARPAS E ,et al. Goal recognition design for non-optimal agents[C]// National Conference on Artificial Intelligence. 2015: 3298-3304. |
[67] | KEREN S , GAL A , KARPAS E . Goal recognition design with non-observable actions[C]// Thirtieth AAAI Conference on Artificial Intelligence. 2016. |
[68] | KEREN S , GAL A , KARPAS E . Strong stubborn sets for efficient goal recognition design[C]// Twenty-Eighth International Conference on Automated Planning and Scheduling. 2018. |
[69] | SARAH K , AVIGDOR G , EREZ K . Goal recognition design in deterministic environments[J]. Journal of Artificial Intelligence Research, 2019,65: 209-269. |
[70] | KEREN S , KEREN S , GAL A ,et al. Equi-reward utility maximizing design in stochastic environments[C]// Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 2017: 4353-4360. |
[71] | WAYLLACE C , HOU P , YEOH W ,et al. Goal recognition design with stochastic agent action outcomes[C]// IJCAI. 2016. |
[72] | WAYLLACE C , HOU P , YEOH W . New Metrics and Algorithms for Stochastic Goal Recognition Design Problems[C]// IJCAI. 2017: 4455-4462. |
[73] | WAYLLACE C , KEREN S , YEOH W ,et al. Accounting for partial observability in stochastic goal recognition design:messing with the marauder’s map[C]// Proceedings of the 10th Workshop on Heuristics and Search for Domain-Independent Planning (HSDIP),Delft,The Netherlands. 2018: 33-41. |
[74] | RICHARD B . Dynamic Programming[M]. Princeton University Press, 1957. |
[75] | ROBERT T . Depth-first search and linear graph algorithms[J]. SIAM Journal on Computing, 1972,1(2): 146-160. |
[76] | MIRSKY R , STERN R , GAL Y ,et al. Plan recognition design[C]// Workshops at the Thirty First AAAI Conference on Artificial Intelligence. 2017. |
[77] | RAMIREZ M , GEFFNER H . Heuristics for planning,plan recognition and parsing[J]. arXiv preprint arXiv:1605.05807, 2016. |
[78] | MIRSKY R . Goal and plan recognition design for plan libraries[J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2019,10(2): 14. |
[79] | PEREIRA R F , PEREIRA A G , MENEGUZZI F . Landmark-enhanced heuristics for goal recognition in incomplete domain models[C]// Proceedings of the International Conference on Automated Planning and Scheduling. 2019: 329-337. |
[80] | PEREIRA R F . Goal recognition over imperfect domain models[J]. arXiv preprint arXiv:2005.05712, 2020. |
[81] | AMATO C , BAISERO A . Active goal recognition[J]. arXiv preprint arXiv:1909.11173, 2019. |
[82] | ZHANG T , . Solving large scale linear prediction problems using stochastic gradient descent algorithms[C]// Proceedings of the International Conference on Machine Learning (ICML). 2004. 919-926. |
[83] | ZHUO H H . Recognizing multi-agent plans when action models and team plans are both incomplete[J]// ACM Transactions on Intelligent Systems and Technology, 2019,10(3): 1-24. |
[84] | POZANCO A , MARTIN Y E , FERNANDEZ S ,et al. Counterplanning using Goal Recognition and Landmarks[C]// International Joint Conference on Artificial Intelligence. 2018: 4808-4814. |
[85] | GADEPALLY V , GOODWIN J , KEPNER J ,et al. AI enabling technologies:a survey[J]. arXiv preprint arXiv:1905.03592, 2019. |
[86] | CHAKRABORTI T , KULKARNI A , SREEDHARAN S ,et al. Explicability legibility predictability transparency privacy security the emerging landscape of interpretable agent behavior[C]// Proceedings of the International Conference on Automated Planning and Scheduling. 2019: 86-96. |
[87] | SREEDHARAN S , KAMBHAMPATI S . Balancing explicability and explanation in human-aware planning[C]// 2017 AAAI Fall Symposium Series. 2017. |
[88] | BANERJEE B , KRAEMER L , LYLE J ,et al. Multi-agent plan recognition:formalization and algorithms[C]// National Conference on Artificial Intelligence, 2010: 1059-1064. |
[89] | ZHUO H H . Multiagent plan recognition from partially observed team traces[J]. Plan,Activity,and Intent Recognition, 2014: 227-249. |
[90] | ARGENTA C , DOYLE J . Multi-agent plan recognition as planning (MAPRAP)[C]// International Conference on Agents and Artificial Intelligence. 2016: 141-14 |
[91] | ALEXANDER K , WILLIAM M.McEneaney . Adversarial reasoning:computational approaches to reading the opponent's mind[R]. 2006. |
[92] | KEREN S , Gal A , KARPAS E . Goal recognition design-survey[C]// Twenty-Ninth International Joint Conference on Artificial Intelligence and Seventeenth Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI-20). 2020. |
[1] | Jinyin CHEN, Rongchang LI, Guohan HUANG, Tao LIU, Haibin ZHENG, Yao CHENG. Survey on vertical federated learning: algorithm, privacy and security [J]. Chinese Journal of Network and Information Security, 2023, 9(2): 1-20. |
[2] | Dian LIN, Li PAN, Ping YI. Research on the robustness of convolutional neural networks in image recognition [J]. Chinese Journal of Network and Information Security, 2022, 8(3): 111-122. |
[3] | Jinyin CHEN, Changan WU, Haibin ZHENG. Novel defense based on softmax activation transformation [J]. Chinese Journal of Network and Information Security, 2022, 8(2): 48-63. |
[4] | Yu ZHANG, Hailiang LI. RSA-based image recognizable adversarial attack method [J]. Chinese Journal of Network and Information Security, 2021, 7(5): 40-48. |
[5] | Weilin YUAN, Zhiyong LIAO, Wei GAO, Tingting WEI, Junren LUO, Wanpeng ZAHNG, Jing CHEN. Survey on intelligent game of computer poker [J]. Chinese Journal of Network and Information Security, 2021, 7(5): 57-76. |
[6] | Zhenglong WANG, Baowen ZHANG. Survey of generative adversarial network [J]. Chinese Journal of Network and Information Security, 2021, 7(4): 68-85. |
[7] | Wei GU, Jian SHEN, Yongjun REN. Secure key-sharing algorithm based on smart grid [J]. Chinese Journal of Network and Information Security, 2021, 7(4): 141-146. |
[8] | Ximeng LIU,Lehui XIE,Yaopeng WANG,Xuru LI. Adversarial attacks and defenses in deep learning [J]. Chinese Journal of Network and Information Security, 2020, 6(5): 36-53. |
[9] | Pei WANG,Yan JIA,Aiping LI,Qianyue JIANG. De-anonymiation method for networks based on DeepLink [J]. Chinese Journal of Network and Information Security, 2020, 6(4): 104-108. |
[10] | Qinrang LIU,Chongyang LIU,Jun ZHOU,Xiaolong WANG. Based on linear systolic array for convolutional neural network’s calculation optimization and performance analysis [J]. Chinese Journal of Network and Information Security, 2018, 4(12): 16-24. |
[11] | Jialin WANG, Jiqiang LIU, Di ZHAO, Yingdi WANG, Yingxiao XIANG, Tong CHEN, Endong TONG, Wenjia NIU. Intrusion detection model based on non-symmetric convolution auto-encode and support vector machine [J]. Chinese Journal of Network and Information Security, 2018, 4(11): 57-68. |
[12] | Yi-zhang JIANG,Shi-tong WANG. Research on privacy-preserving learning machines [J]. Chinese Journal of Network and Information Security, 2016, 2(6): 22-31. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||
|