Safe RAN control: A Symbolic Reinforcement Learning Approach

@article{Nikou2022SafeRC,
  title={Safe RAN control: A Symbolic Reinforcement Learning Approach},
  author={Alexandros Nikou and Anusha Mujumdar and Marin Orlic and Aneta Vulgarakis Feljan},
  journal={ArXiv},
  year={2022},
  volume={abs/2106.01977}
}
—In this paper, we present a Symbolic Reinforce- ment Learning (SRL) based architecture for safety control of Radio Access Network (RAN) applications. In particular, we provide a purely automated procedure in which a user can specify high-level logical safety specifications for a given cellular network topology in order for the latter to execute optimal safe performance which is measured through certain Key Performance Indicators (KPIs). The network consists of a set of fixed Base Stations (BS… 

Figures and Tables from this paper

References

SHOWING 1-10 OF 27 REFERENCES
Symbolic Reinforcement Learning for Safe RAN Control
TLDR
In the proposed architecture, network safety shielding is ensured through model-checking techniques over combined discrete system models (automata) that are abstracted through reinforcement learning.
Remote Electrical Tilt Optimization via Safe Reinforcement Learning
TLDR
This work model the RET optimization problem in the Safe Reinforcement Learning (SRL) framework with the goal of learning a tilt control strategy providing performance improvement guarantees with respect to a safe baseline, and leverage a recent SRL method, namely Safe Policy Improvement through Baseline Bootstrapping (SPIBB), to learn an improved policy from an offline dataset of interactions collected by the safe baseline.
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
TLDR
This paper presents a comprehensive literature review on applications of deep reinforcement learning (DRL) in communications and networking, and presents applications of DRL for traffic routing, resource sharing, and data collection.
Safe Reinforcement Learning for Antenna Tilt Optimisation using Shielding and Multiple Baselines
TLDR
A modular Safe Reinforcement Learning (SRL) architecture is proposed which is then used to address the RET optimisation in cellular networks and demonstrates improved performance of the SRL agent over the baseline while ensuring the safety of the performed actions.
Off-policy Learning for Remote Electrical Tilt Optimization
TLDR
This paper proposes CMAB learning algorithms to extract optimal tilt update policies from the data and trains and evaluates these policies on real-world 4G Long Term Evolution (LTE) cellular network data, showing consistent improvements over the rule-based logging policy used to collect the data.
Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning
We present a reinforcement learning (RL) frame-work to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a
Safe Reinforcement Learning via Formal Methods: Toward Safe Control Through Proof and Learning
TLDR
It is proved that the approach toward incorporating knowledge about safe control into learning systems preserves safety guarantees, and it is demonstrated that the empirical performance benefits provided by reinforcement learning are retained.
Survey of Model-Based Reinforcement Learning: Applications on Robotics
TLDR
It is argued that, by employing model-based reinforcement learning, the—now limited—adaptability characteristics of robotic systems can be expanded, and model- based reinforcement learning exhibits advantages that makes it more applicable to real life use-cases compared to model-free methods.
Self-optimization of coverage and capacity based on a fuzzy neural network with cooperative reinforcement learning
TLDR
This paper proposes self-optimization of antenna tilt and power using a fuzzy neural network optimization based on reinforcement learning (RL-FNN), a central control mechanism enables cooperation-based learning by allowing distributed SON entities to share their optimization experience, represented as the parameters of learning method.
A comprehensive survey on safe reinforcement learning
TLDR
This work categorize and analyze two approaches of Safe Reinforcement Learning, based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor and the incorporation of external knowledge or the guidance of a risk metric.
...
...