Scientific Statement Classification over arXiv.org

  title={Scientific Statement Classification over arXiv.org},
  author={Deyan Ginev and Bruce R. Miller},
We introduce a new classification task for scientific statements and release a large-scale dataset for supervised learning. [] Key Method We demonstrate that the task setup aligns with known success rates from the state of the art, peaking at a 0.91 F1-score via a BiLSTM encoder-decoder model. Additionally, we introduce a lexeme serialization for mathematical formulas, and observe that context-aware models could improve when also trained on the symbolic modality. Finally, we discuss the limitations of both…

