Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q&A Sites

  title={Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q\&A Sites},
  author={Suyu Ma and Chunyang Chen and Hourieh Khalajzadeh and John C. Grundy},
  journal={Proceedings of the ACM on Human-Computer Interaction},
  pages={1 - 24}
  • Suyu Ma, Chunyang Chen, +1 author J. Grundy
  • Published 20 September 2021
  • Computer Science
  • Proceedings of the ACM on Human-Computer Interaction
Collaborative editing questions and answers plays an important role in quality control of Mathematics StackExchange which is a math Q&A Site. Our study of post edits in Mathematics Stack Exchange shows that there is a large number of math-related edits about latexifying formulas, revising LaTeX and converting the blurred math formula screenshots to LaTeX sequence. Despite its importance, manually editing one math-related post especially those with complex mathematical formulas is time-consuming… Expand


Predicting Collaborative Edits of Questions and Answers in Online Q&A Sites
A framework to predict whether questions and answers need be collaboratively edited just after they are posted, which mainly extracts features from questions, answers, and posters, and adopts machine learning techniques to do prediction. Expand
Image to Latex
Converting images of mathematical formulas to LATEX code is a problem that combines challenges both from computer vision and natural language processing, close to the recent breakthroughs in image… Expand
Editing Unfit Questions in Q&A
Examination of participants editing unfit questions on Stack Overflow finds that early edits come from high-reputation users who do not participate as a questioner or answerer, indicating that these users work to retain certain questions. Expand
What You Get Is What You See: A Visual Markup Decompiler
A general-purpose, deep learning-based system to decompile an image into presentational markup that employs a convolutional network for text and layout recognition in tandem with an attention-based neural machine translation system. Expand
Automated Query Reformulation for Efficient Search Based on Query Logs From Stack Overflow
An automated software-specific query reformulation approach based on deep learning is proposed that outperforms five state-of-the-art baselines, and achieves a 5.6% to 33.5% boost in terms of ExactMatch and a 4.8% to 14.4% boostIn terms of GLEU. Expand
Image To Latex with DenseNet Encoder and Joint Attention
Improve the encoder by employing densely connected convolutional network (DenseNet) because it can strengthen feature extraction and facilitate gradient propagation and improve the performance of formula analysis. Expand
Is It Good to Be Like Wikipedia?: Exploring the Trade-offs of Introducing Collaborative Editing Model to Q&A Sites
By examining five years' archival data of Stack Overflow, it is found that the benefits of collaborative editing outweigh its risks and has implications for understanding and designing large-scale social computing systems. Expand
Domain-specific machine translation with recurrent neural network for software localization
The results show that the proposed neural-network based translation model outperforms the general machine translation tool, Google Translate, and generates more acceptable translation for software localization with less needs for human revision. Expand
Neural Quality Estimation of Grammatical Error Correction
This work proposes the first neural approach to automatic quality estimation of GEC output sentences that does not employ any hand-crafted features, and shows that a state-of-the-art GEC system can be improved when quality scores are used as features for re-ranking the N-best candidates. Expand
Data-Driven Proactive Policy Assurance of Post Quality in Community q&a Sites
A Convolutional Neural Network based approach to learn editing patterns from historical post edits for predicting the need of editing a post is developed and evaluated, which provides a proactive policy assurance mechanism that warns users potential quality issues in a post before it is posted. Expand