Explaining Predictions of Non-Linear Classifiers in NLP

@article{Arras2016ExplainingPO,
  title={Explaining Predictions of Non-Linear Classifiers in NLP},
  author={Leila Arras and F. Horn and Gr{\'e}goire Montavon and K. M{\"u}ller and W. Samek},
  journal={ArXiv},
  year={2016},
  volume={abs/1606.07298}
}
  • Leila Arras, F. Horn, +2 authors W. Samek
  • Published 2016
  • Computer Science, Mathematics
  • ArXiv
  • Layer-wise relevance propagation (LRP) is a recently proposed technique for explaining predictions of complex non-linear classifiers in terms of input variables. In this paper, we apply LRP for the first time to natural language processing (NLP). More precisely, we use it to explain the predictions of a convolutional neural network (CNN) trained on a topic categorization task. Our analysis highlights which words are relevant for a specific prediction of the CNN. We compare our technique to… CONTINUE READING
    Explaining Recurrent Neural Network Predictions in Sentiment Analysis
    • 138
    • PDF
    Explaining nonlinear classification decisions with deep Taylor decomposition
    • 475
    • PDF
    Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models
    • 371
    • PDF
    "What is relevant in a text document?": An interpretable machine learning approach
    • 120
    • PDF
    Is Attention Interpretable?
    • 67
    • PDF
    Evaluating Recurrent Neural Network Explanations
    • 19
    • PDF
    Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement
    • 21
    • Highly Influenced
    • PDF

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 27 REFERENCES
    Convolutional Neural Networks for Sentence Classification
    • 6,739
    • PDF
    Visualizing and Understanding Convolutional Networks
    • 8,307
    • PDF
    Natural Language Processing (Almost) from Scratch
    • 5,473
    • PDF
    On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    • 1,077
    • PDF
    Visualizing and Understanding Neural Models in NLP
    • 373
    • PDF
    Efficient Estimation of Word Representations in Vector Space
    • 15,117
    • PDF
    Explaining nonlinear classification decisions with deep Taylor decomposition
    • 475
    • PDF
    Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
    • 2,684
    • PDF
    Analyzing Classifiers: Fisher Vectors and Deep Neural Networks
    • 101
    • PDF