Skip to search formSkip to main contentSkip to account menu

Text segmentation

Known as: Chinese word segmentation, Word segmentation, Word splitting 
Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics. The term applies both to mental… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
This article proposes a novel character-aware neural machine translation (NMT) model that views the input sequences as sequences… 
2012
2012
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and part-of-speech tagging. We… 
2012
2012
Unknown words and word segmentation granularity are two main problems in Chinese word segmentation for ChineseJapanese Machine… 
2010
2010
Sentence-level aligned parallel texts are important resources for a number of natural language processing (NLP) tasks and… 
2008
2008
This paper presents a novel approach to improve Chinese word seg- mentation (CWS) that attempts to utilize unlabeled data such as… 
2006
2006
This paper describes an indexing system that automatically creates metadata for multimedia broadcast news content by integrating… 
Highly Cited
1999
Highly Cited
1999
Information Extraction (IE) is the process of analyzing natural language text or speech, and collecting information about… 
1997
1997
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our method of short-word… 
Highly Cited
1994
Highly Cited
1994
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which…