Hierarchical Attention Networks for Document Classification

@inproceedings{Yang2016HierarchicalAN,
  title={Hierarchical Attention Networks for Document Classification},
  author={Zichao Yang and Diyi Yang and Chris Dyer and X. He and Alex Smola and E. Hovy},
  booktitle={HLT-NAACL},
  year={2016}
}
We propose a hierarchical attention network for document classification. Our model has two distinctive characteristics: (i) it has a hierarchical structure that mirrors the hierarchical structure of documents; (ii) it has two levels of attention mechanisms applied at the wordand sentence-level, enabling it to attend differentially to more and less important content when constructing the document representation. Experiments conducted on six large scale text classification tasks demonstrate that… Expand
2,565 Citations

Figures, Tables, and Topics from this paper

Hierarchical Classification with Hierarchical Attention Networks
  • Highly Influenced
  • PDF
Classify Sentence from Multiple Perspectives with Category Expert Attention Network
  • 1
Hierarchical Attentional Hybrid Neural Networks for Document Classification
  • 9
  • Highly Influenced
  • PDF
Label-Attentive Hierarchical Network for Document Classification
  • Highly Influenced
A Hierarchical Structured Self-Attentive Model for Extractive Document Summarization (HSSAS)
  • 44
  • PDF
Multilingual Hierarchical Attention Networks for Document Classification
  • 81
  • Highly Influenced
  • PDF
A Hierarchical Neural-Network-Based Document Representation Approach for Text Classification
  • 10
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 35 REFERENCES
Hierarchical Recurrent Neural Network for Document Modeling
  • 131
  • PDF
Stacked Attention Networks for Image Question Answering
  • 1,216
  • PDF
A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval
  • 497
  • PDF
Modeling Interestingness with Deep Neural Networks
  • 139
  • PDF
Recurrent Convolutional Neural Networks for Text Classification
  • 1,278
  • PDF
A Convolutional Neural Network for Modelling Sentences
  • 2,621
  • PDF
Convolutional Neural Networks for Sentence Classification
  • 8,053
  • PDF
Document Modeling with Gated Recurrent Neural Network for Sentiment Classification
  • 1,006
  • Highly Influential
  • PDF
A Hierarchical Neural Autoencoder for Paragraphs and Documents
  • 495
  • PDF
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
  • 2,094
  • PDF
...
1
2
3
4
...