Hierarchical Attention Networks for Document Classification
@inproceedings{Yang2016HierarchicalAN, title={Hierarchical Attention Networks for Document Classification}, author={Zichao Yang and Diyi Yang and Chris Dyer and X. He and Alex Smola and E. Hovy}, booktitle={HLT-NAACL}, year={2016} }
We propose a hierarchical attention network for document classification. Our model has two distinctive characteristics: (i) it has a hierarchical structure that mirrors the hierarchical structure of documents; (ii) it has two levels of attention mechanisms applied at the wordand sentence-level, enabling it to attend differentially to more and less important content when constructing the document representation. Experiments conducted on six large scale text classification tasks demonstrate that… Expand
2,503 Citations
Hierarchical Inter-Attention Network for Document Classification with Multi-Task Learning
- Computer Science
- IJCAI
- 2019
- 8
- PDF
Classify Sentence from Multiple Perspectives with Category Expert Attention Network
- Computer Science
- 2018 International Joint Conference on Neural Networks (IJCNN)
- 2018
- 1
Hierarchical Attentional Hybrid Neural Networks for Document Classification
- Computer Science
- ICANN
- 2019
- 9
- Highly Influenced
- PDF
Hierarchical Attention Networks for Different Types of Documents with Smaller Size of Datasets
- Computer Science
- RiTA
- 2018
- 1
A Hierarchical Structured Self-Attentive Model for Extractive Document Summarization (HSSAS)
- Computer Science
- IEEE Access
- 2018
- 44
- PDF
Multilingual Hierarchical Attention Networks for Document Classification
- Computer Science
- IJCNLP
- 2017
- 78
- Highly Influenced
- PDF
A Hierarchical Neural-Network-Based Document Representation Approach for Text Classification
- Computer Science
- 2018
- 9
- PDF
References
SHOWING 1-10 OF 35 REFERENCES
Stacked Attention Networks for Image Question Answering
- Computer Science, Mathematics
- 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2016
- 1,188
- PDF
A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval
- Computer Science
- CIKM
- 2014
- 486
- PDF
Document Modeling with Gated Recurrent Neural Network for Sentiment Classification
- Computer Science
- EMNLP
- 2015
- 985
- Highly Influential
- PDF
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
- Computer Science
- ACL
- 2015
- 2,056
- PDF