Corpus ID: 10353059

Automatic Argumentative-Zoning Using Word2vec

  title={Automatic Argumentative-Zoning Using Word2vec},
  author={Haixia Liu},
  • Haixia Liu
  • Published 29 March 2017
  • Computer Science
  • ArXiv
In comparison with document summarization on the articles from social media and newswire, argumentative zoning (AZ) is an important task in scientific paper analysis. [...] Key Method The learned word embeddings formed a feature space, to which the examined sentence is mapped to. Those features are input into the classifiers for supervised classification. Using 10-cross-validation scheme, evaluation was conducted on the Argumentative-Zoning (AZ) annotated articles. The results showed that simply averaging the…Expand
Word Embedding for Rhetorical Sentence Categorization on Scientific Articles
This research showed that Word2Vec CBOW performed better than Skip-Gram and GloVe in rhetorical sentence categorization of scientific articles by using word embedding to capture semantically similar words. Expand
ARTU / TU Wien and Artificial Researcher@ LongSumm 20
This approach is inspired by the concept of Argumentative Zoning (AZ) that de- fines the main rhetorical structure in scientific articles and utilizes a sentence selection algorithm described in previous literature to select sentences for the final summary that covers the two aforementioned aspects. Expand
Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data
It is suggested that combining clinical free-text notes and structured data provide the best approach in classifying patient SBDH status, which is most challenging among S BDH with low prevalence and high lexical diversity. Expand


Chinese comments sentiment classification based on word2vec and SVMperf
A method for sentiment classification based on word2vec and SVMperf is proposed, which trains faster and predicts more accurate than other SVM packages and can reach more than 90% accuracy. Expand
Support vector machines and Word2vec for text classification with semantic features
This work demonstrates the effectiveness ofword2vec by showing that tf-idf and word2vec combined can outperform tf-IDf because word2 Vec provides complementary features (e.g. semantics that TF-idF can't capture) to tf- idf. Expand
Argumentative zoning information extraction from scientific text
It is demonstrated that this type of text analysis can be used for generating usertailored and task-tailored summaries and for performing more informative citation analyses, and the application of a robust discourse and argumentation analysis can make text understanding techniques for practical document management more robust. Expand
Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification
Three neural networks are developed to effectively incorporate the supervision from sentiment polarity of text (e.g. sentences or tweets) in their loss functions and the performance of SSWE is improved by concatenating SSWE with existing feature set. Expand
A Multiclass-based Classification Strategy for Rethorical Sentence Categorization from Scientific Papers
This paper employsk-fold cross validation over training data to obtain the category-classifier mapping and then re-learn the classification model of the corresponding classifier using full training data on that particular category. Expand
Document-Word Co-regularization for Semi-supervised Sentiment Analysis
This paper proposes a novel semi-supervised sentiment prediction algorithm that utilizes lexical prior knowledge in conjunction with unlabeled examples based on joint sentiment analysis of documents and words based on a bipartite graph representation of the data. Expand
Ranking with Recursive Neural Networks and Its Application to Multi-Document Summarization
We develop a Ranking framework upon Recursive Neural Networks (R2N2) to rank sentences for multi-document summarization. It formulates the sentence ranking task as a hierarchical regression process,Expand
Distributed Representations of Sentences and Documents
Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models. Expand
Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection
This work introduces a method for paraphrase detection based on recursive autoencoders (RAE) and unsupervised RAEs based on a novel unfolding objective and learns feature vectors for phrases in syntactic trees to measure word- and phrase-wise similarity between two sentences. Expand
Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification
A unified framework to expand short texts based on word embedding clustering and convolutional neural network and semantic cliques via fast clustering is proposed, which validates the effectiveness of the proposed method on two open benchmarks. Expand