Chrome Extension
WeChat Mini Program
Use on ChatGLM

Adaptive multi-layer reward based Q-learning for source tracing in diffusive molecular communications environment with obstacles

Zhibo Lou,Wence Zhang,Xu Bao

NANO COMMUNICATION NETWORKS(2023)

Cited 0|Views14
No score
Abstract
Benefiting by the fast development of nanotechnology, molecular communication (MC) has received great attention in recent years. In many potential applications of MC, such as drug delivery and pollution prevention, it is essential to locate or trace the target. In this paper, we consider a 3D diffusive MC environment consisting of several obstacles, a molecule-releasing source (RS) and a mobile molecule sensor (MS) which aims to find the RS within a time constraint. The problem is reformulated using Markov Decision Process (MDP) and an adaptive multi-layer reward based Q-Learning (AMR-Q Learning) approach is proposed. By exploiting information from the number of received molecules and adaptively setting multi-layer rewards, MS with AMR-Q Learning can find the RS efficiently, unlike the gradient based method which is usually trapped in locally optimal points. Numerical results demonstrate that the proposed AMR-Q Learning approach outperforms existing path-planning schemes with significantly reduced training overhead.
More
Translated text
Key words
Molecular communication via diffusion,Source tracing,Q-learning,Multi-layer reward
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined