Big Data Research ›› 2021, Vol. 7 ›› Issue (5): 111-130.doi: 10.11959/j.issn.2096-0271.2021052

• COLUMN: DATA-DRIVEN OPTIMIZATION • Previous Articles     Next Articles

Combinatorial online learning based on optimizing feedbacks

Fang KONG1, Yueran YANG1, Wei CHEN2, Shuai LI1   

  1. 1 John Hopcroft Center for Computer Science, Shanghai Jiao Tong University, Shanghai 200240, China
    2 Microsoft Research Asia, Beijing 100080, China
  • Online:2021-09-15 Published:2021-09-01
  • Supported by:
    The National Natural Science Foundation of China(62006151);The National Natural Science Foundation of China(62076161);Shanghai Sailing Program

Abstract:

Combinatorial online learning studies how to learn the unknown parameters and gradually find the optimal combination of targets during the interactions with the environment.This problem has a wide range of applications including advertisement placement, searching and recommendation.Firstly, the definition of combinatorial online learning and its general framework – the problem of combinatorial multi-armed bandits were introduced, and its traditional algorithms and research progress were summarized.Then, the related works of two specific applications, online influence maximization and online learning to rank, were introduced.Finally, the prospective directions of further researches on combinatorial online learning were discussed.

Key words: combinatorial multi-armed bandits, online learning, online influence maximization, online learning to rank

CLC Number: 

No Suggested Reading articles found!