Sequential Decision Making with Limited Observation Capability: Application to Wireless Networks

IEEE Transactions on Cognitive Communications and Networking(2019)

Cited 13|Views12
No score
Abstract
This paper studies a generalized class of restless multi-armed bandits with hidden states and allow cumulative feedback, as opposed to the conventional instantaneous feedback. We call them lazy restless bandits (LRBs) as the events of decision making are sparser than the events of state transition. Hence, feedback after each decision event is the cumulative effect of the following state transition...
More
Translated text
Key words
Indexes,Relays,Decision making,Markov processes,Fading channels,Optimization,Productivity
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined