MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
MALib is a scalable and efficient computing framework for population-based multi-agent reinforcement learning that enables efficient code reuse and flexible deployments on different distributed computing paradigms and achieves throughput higher than 40K FPS on a single machine with 32 CPU cores.
Towards Efficient Discrete Integration via Adaptive Quantile Queries
- Fan Ding, Hanjing Wang, Ashish Sabharwal, Yexiang Xue
- Computer ScienceEuropean Conference on Artificial Intelligence
- 13 October 2019
AdaWISH is proposed, which is able to obtain the same guarantee but accesses only a small subset of queries of WISH, and has a regret of only O(log n) relative to an idealistic oracle that issues queries at data-dependent optimal points.
Uncertainty-Guided Probabilistic Transformer for Complex Action Recognition
- Hongjian Guo, Hanjing Wang, Q. Ji
- Computer ScienceComputer Vision and Pattern Recognition
- 1 June 2022
This paper proposes a novel training strategy by introducing a majority model and a minority model based on the epistemic uncertainty that achieves the state-of-the-art per-formance under both sufficient and insufficient data.
AdaWISH: Faster Discrete Integration via Adaptive Quantiles
- Fan Ding, Hanjing Wang, Ashish Sabharwal, Yexiang Xue
- Computer ScienceArXiv
- 13 October 2019
AdaWISH is proposed, which is able to obtain the same guarantee, but accesses only a small subset of queries of WISH, and has a regret of no more than $O(\log n)$ relative to an oracle that issues queries at data-dependent optimal points.
Variational message passing neural network for Maximum-A-Posteriori (MAP) inference
- Zijun Cui, Hanjing Wang, Tian Gao, Kartik Talamadupula, Qiang Ji
- Computer ScienceConference on Uncertainty in Artificial…
- 2022
A variational message passing neural network (V-MPNN), where both the power of neural networks in modeling complex functions and the well-established algorithmic theories on variational belief propagation are leveraged.