Multilingual Clustering of Streaming News

@inproceedings{Miranda2018MultilingualCO,
  title={Multilingual Clustering of Streaming News},
  author={Sebasti{\~a}o Miranda and Arturs Znotins and Shay B. Cohen and Guntis Barzdins},
  booktitle={EMNLP},
  year={2018}
}
Clustering news across languages enables efficient media monitoring by aggregating articles from multilingual sources into coherent stories. Doing so in an online setting allows scalable processing of massive news streams. To this end, we describe a novel method for clustering an incoming stream of multilingual documents into monolingual and crosslingual clusters. Unlike typical clustering approaches that report results on datasets with a small and known number of labels, we tackle the problem… Expand
7 Citations
Batch Clustering for Multilingual News Streaming
  • 3
  • Highly Influenced
  • PDF
Event-Driven News Stream Clustering using Entity-Aware Contextual Embeddings
  • Highly Influenced
  • PDF
Tanbih: Get To Know What You Are Reading
  • 9
  • Highly Influenced
  • PDF
Training with Streaming Annotation
  • PDF
Russian News Clustering and Headline Selection Shared Task
  • PDF

References

SHOWING 1-10 OF 21 REFERENCES
Unified analysis of streaming news
  • 85
  • PDF
News Across Languages - Cross-Lingual Document Similarity and Event Tracking
  • 32
  • Highly Influential
Distributed Document and Phrase Co-embeddings for Descriptive Clustering
  • 3
  • PDF
Massively Multilingual Word Embeddings
  • 209
  • PDF
Software Framework for Topic Modelling with Large Corpora
  • 3,047
  • PDF
A Framework for Clustering Massive Text and Categorical Data Streams
  • 119
  • Highly Influential
  • PDF
Distributed Representations of Sentences and Documents
  • 5,956
  • PDF
Translation Invariant Word Embeddings
  • 40
  • PDF
...
1
2
3
...