Constrained Reinforcement Learning Has Zero Duality Gap
- Santiago Paternain, Luiz F. O. Chamon, Miguel Calvo-Fullana, Alejandro Ribeiro
- Computer ScienceNeural Information Processing Systems
- 29 October 2019
This work provides theoretical support to primal-dual approaches to reinforcement learning by establishing that despite its non-convexity, this problem has zero duality gap and can be solved exactly in the dual domain, where it becomes convex.
Stability of Graph Scattering Transforms
- Fernando Gama, Joan Bruna, Alejandro Ribeiro
- Computer Science, MathematicsNeural Information Processing Systems
- 11 June 2019
This work extends scattering transforms to network data by using multiresolution graph wavelets, whose computation can be obtained by means of graph convolutions, and proves that the resulting graph scattering transforms are stable to metric perturbations of the underlying network.
Stability Properties of Graph Neural Networks
- Fernando Gama, Joan Bruna, Alejandro Ribeiro
- Computer ScienceIEEE Transactions on Signal Processing
- 11 May 2019
This work proves that graph convolutions with integral Lipschitz filters, in combination with the frequency mixing effect of the corresponding nonlinearities, yields an architecture that is both stable to small changes in the underlying topology, and discriminative of information located at high frequencies.
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
- Harshat Kumar, Alec Koppel, Alejandro Ribeiro
- Computer ScienceArXiv
- 18 October 2019
This work puts forth a new variant of actor-critic that employs Monte Carlo rollouts during the policy search updates, which results in controllable bias that depends on the number of critic evaluations, providing insight into the interplay between optimization and generalization in reinforcement learning.
Safe Policies for Reinforcement Learning via Primal-Dual Methods
- Santiago Paternain, Miguel Calvo-Fullana, Luiz F. O. Chamon, Alejandro Ribeiro
- Computer ScienceArXiv
- 20 November 2019
It is established that primal-dual algorithms are able to find policies that are safe and optimal, and an ergodic relaxation of the safe-learning problem is proposed.
Network Newton-Part I: Algorithm and Convergence
- Aryan Mokhtari, Qing Ling, Alejandro Ribeiro
- Computer Science
- 23 April 2015
Convergence to a point close to the optimal argument at a rate that is at least linear is proven and the existence of a tradeoff between convergence time and the distance to the optimum argument is shown.
Graph Neural Networks for Decentralized Multi-Robot Path Planning
- Qingbiao Li, Fernando Gama, Alejandro Ribeiro, Amanda Prorok
- Computer ScienceIEEE/RJS International Conference on Intelligent…
- 12 December 2019
A combined model is proposed that automatically synthesizes local communication and decision-making policies for robots navigating in constrained workspaces that shows its capability to generalize to previously unseen cases.
Gated Graph Recurrent Neural Networks
- Luana Ruiz, Fernando Gama, Alejandro Ribeiro
- Computer ScienceIEEE Transactions on Signal Processing
- 1 February 2020
Graph Recurrent Neural Networks are introduced as a general learning framework that achieves this goal by leveraging the notion of a recurrent hidden state together with graph signal processing (GSP) and the number of learnable parameters is independent of the length of the sequence and of the size of the graph, guaranteeing scalability.
EdgeNets: Edge Varying Graph Neural Networks
- E. Isufi, Fernando Gama, Alejandro Ribeiro
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine…
- 21 January 2020
A general framework that unifies state-of-the-art graph neural networks (GNNs) through the concept of EdgeNet is put forth and it is shown that GATs are GCNNs on a graph that is learned from the features, which opens the doors to develop alternative attention mechanisms for improving discriminatory power.
Probably Approximately Correct Constrained Learning
- Luiz F. O. Chamon, Alejandro Ribeiro
- Computer ScienceNeural Information Processing Systems
- 9 June 2020
It is proved that under mild conditions the empirical dual problem of constrained learning is also a PAC constrained learner that now leads to a practical constrained learning algorithm and is used to illustrate how constrained learning can address problems in fair and robust classification.
...
...