An MDP Model for Censoring in Harvesting Sensors: Optimal and Approximated Solutions

📅 2015-01-14
🏛️ IEEE Journal on Selected Areas in Communications
📈 Citations: 19
Influential: 2
📄 PDF
🤖 AI Summary
This paper addresses energy-efficient information transmission for energy-harvesting sensors, aiming to maximize cumulative message utility (i.e., importance) under finite energy constraints. Method: We formulate the problem as an infinite-horizon Markov decision process (MDP) and—under a realistic battery dynamics model—rigorously prove that the optimal policy is a state-dependent importance-threshold truncation policy, where transmission decisions depend dynamically on the current battery level. Building on this structural insight, we propose a low-complexity, fast-converging model-driven stochastic approximation algorithm and benchmark it against Q-learning. Results: Experiments in both single-hop and multi-hop networks demonstrate that our algorithm significantly reduces computational overhead and accelerates convergence while achieving utility performance close to the theoretical optimum.

Technology Category

Application Category

📝 Abstract
In this paper, we propose a novel censoring policy for energy-efficient transmissions in energy-harvesting sensors. The problem is formulated as an infinite-horizon Markov Decision Process (MDP). The objective to be optimized is the expected sum of the importance (utility) of all transmitted messages. Assuming that such importance can be evaluated at the transmitting node, we show that, under certain conditions on the battery model, the optimal censoring policy is a threshold function on the importance value. Specifically, messages are transmitted only if their importance is above a threshold whose value depends on the battery level. Exploiting this property, we propose a model-based stochastic scheme that approximates the optimal solution, with less computational complexity and faster convergence speed than a conventional Q-learning algorithm. Numerical experiments in single-hop and multi-hop networks confirm the analytical advantages of the proposed scheme.
Problem

Research questions and friction points this paper is trying to address.

Energy Harvesting
Sensor Networks
Optimal Energy Allocation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Energy Harvesting Sensors
Reinforcement Learning
Dynamic Information Importance Threshold
🔎 Similar Papers
No similar papers found.
J
Jesus Fernandez-Bes
Dept. of Teoría de la Señal y Comunicaciones, Univ. Carlos III de Madrid, Avda. de la Universidad 30, Leganés, 28911, Madrid, Spain
Jesús Cid-Sueiro
Jesús Cid-Sueiro
Universidad Carlos III de Madrid, Spain
Machine learningbig data signal processingsensor networks
A
A. Marques
Dept. of Teoría de la Señal y Comunicaciones, Universidad Rey Juan Carlos de Madrid, Camino del Molino s/n, Fuenlabrada 28943, Madrid, Spain