论文标题
在多个未知通道上进行有效的动态上行链路调度
Towards Efficient Dynamic Uplink Scheduling over Multiple Unknown Channels
论文作者
论文摘要
信息年龄(AOI)是网络应用程序的关键指标。现有的作品主要解决了具有均质AOI要求的优化,这与实践不同。在这项工作中,我们优化了访问点(AP)的上行链路调度,而不是多个未知渠道,该渠道具有由AOI依赖性成本定义的异质AOI要求。 AP通过使用$ m $ channels提供$ n $用户,而无需通道状态信息。每个渠道在每个决策时期仅服务一个用户。优化目标是最大程度地减少依赖AOI的时间平均成本以及无限视野上的额外通信传输成本。这个决策问题可以作为马尔可夫决策过程提出,但是它在计算上是棘手的,因为状态空间的大小相对于用户的数量呈指数增长。为了减轻挑战,我们将问题重新制定为不安的多军强盗(RMAB)问题的变体,并利用Whittle的索引理论设计基于索引的调度策略算法。我们得出了索引的分析公式,该公式可减少计算开销并促进在线适应。我们的数值示例表明,我们的基于索引的调度策略可以达到与最佳策略相当的绩效,并且表现优于其他几种启发式方法。
Age-of-Information (AoI) is a critical metric for network applications. Existing works mostly address optimization with homogeneous AoI requirements, which is different from practice. In this work, we optimize uplink scheduling for an access point (AP) over multiple unknown channels with heterogeneous AoI requirements defined by AoI-dependent costs. The AP serves $N$ users by using $M$ channels without the channel state information. Each channel serves only one user in each decision epoch. The optimization objective is to minimize the time-averaged AoI-dependent costs plus additional communication transmission costs over an infinite horizon. This decision-making problem can be formulated as a Markov decision process, but it is computationally intractable because the size of the state space grows exponentially with respect to the number of users. To alleviate the challenge, we reformulate the problem as a variant of the restless multi-armed bandit (RMAB) problem and leverage Whittle's index theory to design an index-based scheduling policy algorithm. We derive an analytic formula for the indices, which reduces the computational overhead and facilitates online adaptation. Our numerical examples show that our index-based scheduling policy achieves comparable performance to the optimal policy and outperforms several other heuristics.