Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 230,508,475 papers from all fields of science
Search
Sign In
Create Free Account
Text segmentation
Known as:
Chinese word segmentation
, Word segmentation
, Word splitting
Expand
Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics. The term applies both to mental…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
15 relations
Cluster analysis
Delimiter
Document classification
Hidden Markov model
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2012
2012
Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations
Weiwei Sun
,
Xiaojun Wan
Annual Meeting of the Association for…
2012
Corpus ID: 470570
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and part-of-speech tagging. We…
Expand
2010
2010
An N-Gram-and-Wikipedia joint approach to Natural Language Identification
Xi Yang
,
Wenxin Liang
International Universal Communication Symposium
2010
Corpus ID: 20191900
Natural Language Identification is the process of detecting and determining in which language or languages a given piece of text…
Expand
2008
2008
Natural Language and Information Systems, 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008, London, UK, June 24-27, 2008, Proceedings
E. Kapetanios
,
V. Sugumaran
,
M. Spiliopoulou
International Conference on Applications of…
2008
Corpus ID: 35068266
2006
2006
Chinese Word Segmentation Based on Dictionary and Statistics
Zuo Wan-li
2006
Corpus ID: 63009673
Proposed a method based on dictionary integrated with statistics.The method uses the segmentation method based on dictionary in…
Expand
2005
2005
Chinese Word Segmentation with Multiple Postprocessors in HIT-IRLab
Huipeng Zhang
,
Ting Liu
,
Jinshan Ma
,
Xiantao Liao
International Joint Conference on Natural…
2005
Corpus ID: 29849813
This paper presents the results of the system IRLAS from HIT-IRLab in the Second International Chinese Word Segmentation Bakeoff…
Expand
Highly Cited
2001
Highly Cited
2001
Report on CLEF-2001 Experiments: Effective Combined Query-Translation Approach
J. Savoy
Conference and Labs of the Evaluation Forum
2001
Corpus ID: 9943803
In our first participation in clef retrieval tasks, the primary objective was to define a general stopword list for various…
Expand
Highly Cited
2000
Highly Cited
2000
Compound splitting and lexical unit recombination for improved performance of a speech recognition system for German parliamentary speeches
M. Larson
,
D. Willett
,
J. Köhler
,
G. Rigoll
Interspeech
2000
Corpus ID: 1788721
This paper proposes a novel combined compound splitting and phrase recombination method that optimizes the composition of the…
Expand
Highly Cited
1999
Highly Cited
1999
A Statistical Information Extraction System for Turkish
Gökhan Tür
,
Dilek Z. Hakkani-Tür
,
Kemal Oflazer
1999
Corpus ID: 13429290
Information Extraction (IE) is the process of analyzing natural language text or speech, and collecting information about…
Expand
Highly Cited
1994
Highly Cited
1994
Text segmentation in mixed-mode images
N. Chaddha
,
Rosen Sharma
,
Avneesh Agrawal
,
Anoop Gupta
Proceedings of 28th Asilomar Conference on…
1994
Corpus ID: 58358347
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which…
Expand
Highly Cited
1989
Highly Cited
1989
Phonemic Analysis: effects of word properties
R. Schreuder
,
Wim H. J. Bon
1989
Corpus ID: 54201004
The relation between performance in phonemic segmentation and reading and writing ability is discussed. Not much is known about…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE