Corpus ID: 216867622

TLDR: Extreme Summarization of Scientific Documents

@article{Cachola2020TLDRES,
  title={TLDR: Extreme Summarization of Scientific Documents},
  author={Isabel Cachola and Kyle Lo and Arman Cohan and Daniel S. Weld},
  journal={ArXiv},
  year={2020},
  volume={abs/2004.15011}
}
  • Isabel Cachola, Kyle Lo, +1 author Daniel S. Weld
  • Published 2020
  • Computer Science
  • ArXiv
  • We introduce TLDR generation for scientific papers, a new automatic summarization task with high source compression, requiring expert background knowledge and complex language understanding. To facilitate research on this task, we introduce SciTLDR, a dataset of 3.9K TLDRs. Furthermore, we introduce a novel annotation protocol for scalably curating additional gold summaries by rewriting peer review comments. We use this protocol to augment our test set, yielding multiple gold TLDRs for… CONTINUE READING

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 28 REFERENCES

    Headline Generation: Learning from Decomposable Document Titles

    VIEW 2 EXCERPTS

    Text Summarization with Pretrained Encoders

    VIEW 2 EXCERPTS

    Sentence Centrality Revisited for Unsupervised Summarization

    VIEW 3 EXCERPTS
    HIGHLY INFLUENTIAL