Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 226,115,603 papers from all fields of science
Search
Sign In
Create Free Account
Text segmentation
Known as:
Chinese word segmentation
, Word segmentation
, Word splitting
Expand
Text segmentation is the process of dividing written text into meaningful units, such as words, sentences, or topics. The term applies both to mental…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
15 relations
Cluster analysis
Delimiter
Document classification
Hidden Markov model
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2007
Highly Cited
2007
Topic segmentation with shared topic detection and alignment of multiple documents
Bingjun Sun
,
P. Mitra
,
C. Lee Giles
,
J. Yen
,
H. Zha
Annual International ACM SIGIR Conference on…
2007
Corpus ID: 17839569
Topic detection and tracking and topic segmentation play an important role in capturing the local and sequential information of…
Expand
Highly Cited
2001
Highly Cited
2001
Report on CLEF-2001 Experiments: Effective Combined Query-Translation Approach
J. Savoy
Conference and Labs of the Evaluation Forum
2001
Corpus ID: 9943803
In our first participation in clef retrieval tasks, the primary objective was to define a general stopword list for various…
Expand
Highly Cited
2001
Highly Cited
2001
Text analysis using local energy
Woei Chan
,
G. Coghill
Pattern Recognition
2001
Corpus ID: 14593040
Highly Cited
2000
Highly Cited
2000
Compound splitting and lexical unit recombination for improved performance of a speech recognition system for German parliamentary speeches
M. Larson
,
D. Willett
,
J. Köhler
,
G. Rigoll
Interspeech
2000
Corpus ID: 1788721
This paper proposes a novel combined compound splitting and phrase recombination method that optimizes the composition of the…
Expand
Highly Cited
1998
Highly Cited
1998
Text identification for document image analysis using a neural network
C. Strouthopoulos
,
N. Papamarkos
Image and Vision Computing
1998
Corpus ID: 1755656
Highly Cited
1997
Highly Cited
1997
Automatic separation of words in multi-lingual multi-script Indian documents
U. Pal
,
B. B. Chaudhuri
Proceedings of the Fourth International…
1997
Corpus ID: 7753713
In a multi-lingual country like India, a document may contain more than one script forms. For such a document it is necessary to…
Expand
Highly Cited
1994
Highly Cited
1994
Text segmentation in mixed-mode images
N. Chaddha
,
Rosen Sharma
,
Avneesh Agrawal
,
Anoop Gupta
Proceedings of 28th Asilomar Conference on…
1994
Corpus ID: 58358347
Block based algorithms have found widespread use in image and video compression. However, popular algorithms such as JPEG, which…
Expand
Highly Cited
1993
Highly Cited
1993
Segmentation of Fluent Speech into Words: Learning Models and the Role of Maternal Input
R. Aslin
1993
Corpus ID: 62503223
Two research strategies aimed at understanding how maternal speech input enables pre-productive infants to segment words from…
Expand
Highly Cited
1990
Highly Cited
1990
Reluctance motor with strong rotor anisotropy
D. Platt
Conference Record of the IEEE Industry…
1990
Corpus ID: 940585
A new rotor design is presented for the synchronous reluctance motor. It has something in common with the segmented and axially…
Expand
Highly Cited
1989
Highly Cited
1989
Phonemic Analysis: effects of word properties
R. Schreuder
,
Wim H. J. Bon
1989
Corpus ID: 54201004
The relation between performance in phonemic segmentation and reading and writing ability is discussed. Not much is known about…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE