Neural Bellman-Ford Networks: A General Graph Neural Network Framework for Link Prediction
- Zhaocheng Zhu, Zuobai Zhang, Louis-Pascal Xhonneux, Jian Tang
- Computer ScienceNeural Information Processing Systems
- 13 June 2021
The Neural Bellman-Ford Network (NBFNet) is proposed, a general graph neural network framework that solves the path formulation with learned operators in the generalized BellMan-Ford algorithm.
Protein Representation Learning by Geometric Structure Pretraining
- Zuobai Zhang, Minghao Xu, Jian Tang
- Computer Science, BiologyArXiv
- 11 March 2022
Experimental results show that the proposed pretraining methods outperform or are on par with the state-of-the-art sequence-based methods, while using much less data.
TorchDrug: A Powerful and Flexible Machine Learning Platform for Drug Discovery
- Zhaocheng Zhu, Chence Shi, Jian Tang
- Computer ScienceArXiv
- 16 February 2022
TorchDrug is a powerful and flexible machine learning platform for drug discovery built on top of PyTorch, which benchmarks a variety of important tasks in drug discovery, including molecular property prediction, pretrained molecular representations, de novo molecular design and optimization, retrosynthsis prediction, and biomedical knowledge graph reasoning.
Neural-Symbolic Models for Logical Queries on Knowledge Graphs
- Zhaocheng Zhu, Mikhail Galkin, Zuobai Zhang, Jian Tang
- Computer ScienceInternational Conference on Machine Learning
- 16 May 2022
Experiments show that GNN-QE significantly improves over previous state-of-the-art models in answering FOL queries, and can predict the number of answers without explicit supervision, and provide visualizations for intermediate variables.
Learning interpretable cellular and gene signature embeddings from single-cell transcriptomic data
- Yifan Zhao, Huiyu Cai, Zuobai Zhang, Jian Tang, Yue Li
- Computer Science, BiologyNature Communications
- 15 January 2021
A new model to address challenges in scalability, model interpretability, and confounders of computational single-cell RNA-seq analyses is shown, by learning meaningful embeddings from the data that simultaneously refine gene signatures and cell functions in diverse conditions.
PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding
- Minghao Xu, Zuobai Zhang, Jian Tang
- Computer ScienceArXiv
- 5 June 2022
This paper proposes a benchmark called PEER, a comprehensive and multi-task benchmark for protein understanding tasks, and evaluates different types of sequence-based methods for each task including traditional feature engineering approaches, different sequence encoding methods as well as large-scale pre-trained protein language models.
Power-Law Graphs Have Minimal Scaling of Kemeny Constant for Random Walks
- Wanyue Xu, Y. Sheng, Zuobai Zhang, Haibin Kan, Zhongzhi Zhang
- Computer Science, MathematicsThe Web Conference
- 20 April 2020
A theoretically guaranteed estimation algorithm is presented, which approximates the Kemeny constant for a graph in nearly linear time with respect to the number of edges, and it is shown that this approximation algorithm is both efficient and accurate.
A Roadmap for Big Model
- Sha Yuan, Hanyu Zhao, Jie Tang
- Computer ScienceArXiv
- 26 March 2022
This paper discusses not only the BM technologies themselves but also the prerequisites for BM training and applications with BMs, dividing the BM review into four parts: Resource, Models, Key Technologies and Application.
Structured Multi-task Learning for Molecular Property Prediction
- Shengchao Liu, Meng Qu, Zuobai Zhang, Huiyu Cai, Jian Tang
- Computer ScienceInternational Conference on Artificial…
- 22 February 2022
A method called SGNN-EBM is proposed to systematically investigate the structured task modeling from two perspec-tives, which can be e-ciently trained through noise-contrastive estimation (NCE) approach.
Protein Structure Representation Learning by Geometric Pretraining
- Zuobai Zhang, Minghao Xu, Jian Tang
- Computer Science, Biology
- 2022
Experimental results show that the proposed pretraining methods outperform or are on par with the state-of-the-art sequence-based methods using much less data.
...
...