TY - JOUR
T1 - Online Learning-Based Reconfigurable Antenna Mode Selection Exploiting Channel Correlation
AU - Zhao, Tianchi
AU - Li, Ming
AU - Pan, Yanjun
N1 - Funding Information:
This work was supported in part by Office of Naval Research Young Investigator Program (ONR YIP) under Award N00014-16-1-2650 and in part by the NSF under Grant CNS-1619728.
Publisher Copyright:
© 2002-2012 IEEE.
PY - 2021/10/1
Y1 - 2021/10/1
N2 - Reconfigurable antennas (RAs) emerged as a promising technology that can deal with channel variations and enhance the capacity and reliability of the wireless channel. To fully exploit the advantage of RAs, optimal antenna modes need to be selected in an online manner. However, the channel statistics are unknown a priori. Multi-armed bandit-based online learning algorithms were proposed to address this challenge, but the main drawback of existing approaches are that their regret scales linearly with the number of antenna modes, which converges slowly when the latter is large. To improve the scalability, we first apply an existing algorithm: Thompson sampling via Gaussian process (TS-GP), and propose two new algorithms for antenna mode selection: upper confidence bound with channel prediction (UCB-CP) and Thompson Sampling with channel prediction (TS-CP). TS-GP uses Gaussian prior to model the reward distribution of each antenna mode, as well as the correlation among them. UCB-CP and TS-CP exploit channel modeling to predict the channel conditions of unexplored antenna modes at each time step, by relating the correlation between different channel states to the underlying antenna modes. We prove the finite-time regret bound of UCB-CP and show that it is independent from the number of arms, when the expected channel estimation errors are small enough. We also extend the algorithms to the mobile setting. Both simulation results and real-world experiments show that all of our proposed learning algorithms can significantly improve the convergence rate and yield much lower regret (thus higher throughput) than existing schemes.
AB - Reconfigurable antennas (RAs) emerged as a promising technology that can deal with channel variations and enhance the capacity and reliability of the wireless channel. To fully exploit the advantage of RAs, optimal antenna modes need to be selected in an online manner. However, the channel statistics are unknown a priori. Multi-armed bandit-based online learning algorithms were proposed to address this challenge, but the main drawback of existing approaches are that their regret scales linearly with the number of antenna modes, which converges slowly when the latter is large. To improve the scalability, we first apply an existing algorithm: Thompson sampling via Gaussian process (TS-GP), and propose two new algorithms for antenna mode selection: upper confidence bound with channel prediction (UCB-CP) and Thompson Sampling with channel prediction (TS-CP). TS-GP uses Gaussian prior to model the reward distribution of each antenna mode, as well as the correlation among them. UCB-CP and TS-CP exploit channel modeling to predict the channel conditions of unexplored antenna modes at each time step, by relating the correlation between different channel states to the underlying antenna modes. We prove the finite-time regret bound of UCB-CP and show that it is independent from the number of arms, when the expected channel estimation errors are small enough. We also extend the algorithms to the mobile setting. Both simulation results and real-world experiments show that all of our proposed learning algorithms can significantly improve the convergence rate and yield much lower regret (thus higher throughput) than existing schemes.
KW - antenna mode selection
KW - channel estimation
KW - multi-armed bandit (MAB) problem
KW - Multifunctional and reconfigurable antennas (RAs)
KW - multipath channels
UR - http://www.scopus.com/inward/record.url?scp=85105867952&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85105867952&partnerID=8YFLogxK
U2 - 10.1109/TWC.2021.3076760
DO - 10.1109/TWC.2021.3076760
M3 - Article
AN - SCOPUS:85105867952
SN - 1536-1276
VL - 20
SP - 6820
EP - 6834
JO - IEEE Transactions on Wireless Communications
JF - IEEE Transactions on Wireless Communications
IS - 10
ER -