Tug-of-war Model for Competitive Multi-armed Bandit Problem: Amoeba-inspired Algorithm for Cognitive Medium Access
-
- Kim Song-Ju
- Flucto-Order Functions Research Team, RIKEN-HYU Collaboration Research Center, RIKEN Advanced Science Institute
-
- Aono Masashi
- Flucto-Order Functions Research Team, RIKEN-HYU Collaboration Research Center, RIKEN Advanced Science Institute
-
- Nameda Etsushi
- Flucto-Order Functions Research Team, RIKEN-HYU Collaboration Research Center, RIKEN Advanced Science Institute
-
- Hara Masahiko
- Flucto-Order Functions Research Team, RIKEN-HYU Collaboration Research Center, RIKEN Advanced Science Institute
説明
The “tug-of-war (TOW) model” is a unique parallel search algorithm for solving the multi-armed bandit problem (BP), which was inspired by the photoavoidance behavior of a single-celled amoeboid organism, the true slime mold Physarum polycephalum [1, 2, 3, 4, 5, 6]. “The cognitive medium access”, which refers to multiuser channel allocations in cognitive radio, can be interpreted as “competitive multi-armed bandit problem (CBP) [14].” Unlike the normal BP, the reward (free channel) probability of a channel selected by more than one user is evenly split between selecting users. In this study, we propose the “solid TOW (STOW) model” for the CBP toward developing cognitive medium access protocols in uncertain environments. The aim of this study is to explore how can the users achieve the “social maximum”, which is the most desirable state to obtain the maximum total score, in a decentralized manner. We show that the performance of the STOW model is higher than that of the well-known UCB1-tuned algorithm in many cases.
収録刊行物
-
- IEICE Proceeding Series
-
IEICE Proceeding Series 1 590-593, 2014-03-17
The Institute of Electronics, Information and Communication Engineers
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390283687150829952
-
- ISSN
- 21885079
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- Crossref
- OpenAIRE
-
- 抄録ライセンスフラグ
- 使用不可