Corpus ID: 60441374

PAC-Bayes Analysis of Sentence Representation

  title={PAC-Bayes Analysis of Sentence Representation},
  author={Kento Nozawa and I. Sato},
  • Kento Nozawa, I. Sato
  • Published 2019
  • Computer Science, Mathematics
  • ArXiv
  • Learning sentence vectors from an unlabeled corpus has attracted attention because such vectors can represent sentences in a lower dimensional and continuous space. Simple heuristics using pre-trained word vectors are widely applied to machine learning tasks. However, they are not well understood from a theoretical perspective. We analyze learning sentence vectors from a transfer learning perspective by using a PAC-Bayes bound that enables us to understand existing heuristics. We show that… CONTINUE READING


    Supervised Learning of Universal Sentence Representations from Natural Language Inference Data
    • 1,123
    • PDF
    Distributed Representations of Sentences and Documents
    • 5,622
    • Highly Influential
    • PDF
    Skip-Thought Vectors
    • 1,606
    • PDF
    Efficient Estimation of Word Representations in Vector Space
    • 16,444
    • Highly Influential
    • PDF
    Enriching Word Vectors with Subword Information
    • 4,332
    • PDF
    Advances in Pre-Training Distributed Word Representations
    • 528
    • PDF
    Learning Distributed Representations of Sentences from Unlabelled Data
    • 379
    • PDF
    Learning Word Vectors for Sentiment Analysis
    • 2,119
    • Highly Influential
    • PDF
    Natural Language Processing (Almost) from Scratch
    • 5,790
    • PDF
    No Training Required: Exploring Random Encoders for Sentence Classification
    • 58
    • PDF