Home
Scholarly Works
Sequential Learning for Multi-Channel Wireless...
Journal article

Sequential Learning for Multi-Channel Wireless Network Monitoring with Channel Switching Costs

Abstract

We consider the problem of optimally assigning $p$ sniffers to $K$ channels to monitor the transmission activities in a multichannel wireless network with switching costs. The activity of users is initially unknown to the sniffers and is to be learned along with channel assignment decisions to maximize the benefits of this assignment, resulting in the fundamental tradeoff between exploration and exploitation. Switching costs are incurred when sniffers change their channel assignments. As a result, frequent changes are undesirable. We formulate the sniffer-channel assignment with switching costs as a linear partial monitoring problem, a superclass of multiarmed bandits. As the number of arms (sniffer-channel assignments) is exponential, novel techniques are called for, to allow efficient learning. We use the linear bandit model to capture the dependency amongst the arms and develop a policy that takes advantage of this dependency. We prove that the proposed Upper Confident Bound-based (UCB) policy enjoys a logarithmic regret bound in time $t$ that depends sublinearly on the number of arms, while its total switching cost grows in the order of $O(\log\log(t))$.

Authors

Le T; Szepesvári C; Zheng R

Journal

IEEE Transactions on Signal Processing, Vol. 62, No. 22, pp. 5919–5929

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publication Date

November 15, 2014

DOI

10.1109/tsp.2014.2357779

ISSN

1053-587X

Contact the Experts team