Multilingual Clustering of Streaming News

@inproceedings{Miranda2018MultilingualCO,
  title={Multilingual Clustering of Streaming News},
  author={Sebasti{\~a}o Miranda and Arturs Znotins and Shay B. Cohen and Guntis Barzdins},
  booktitle={EMNLP},
  year={2018}
}
Clustering news across languages enables efficient media monitoring by aggregating articles from multilingual sources into coherent stories. Doing so in an online setting allows scalable processing of massive news streams. To this end, we describe a novel method for clustering an incoming stream of multilingual documents into monolingual and crosslingual clusters. Unlike typical clustering approaches that report results on datasets with a small and known number of labels, we tackle the problem… Expand
Batch Clustering for Multilingual News Streaming
NewsEmbed: Modeling News through Pre-trained Document Representations
Russian News Clustering and Headline Selection Shared Task
Tanbih: Get To Know What You Are Reading
Training with Streaming Annotation

References

SHOWING 1-10 OF 21 REFERENCES
Unified analysis of streaming news
News Across Languages - Cross-Lingual Document Similarity and Event Tracking
Distributed Document and Phrase Co-embeddings for Descriptive Clustering
Massively Multilingual Word Embeddings
Software Framework for Topic Modelling with Large Corpora
A Framework for Clustering Massive Text and Categorical Data Streams
Distributed Representations of Sentences and Documents
Translation Invariant Word Embeddings
...
1
2
3
...