Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 226,746,889 papers from all fields of science
Search
Sign In
Create Free Account
Charset detection
Known as:
Codepage sniffing
Character encoding detection, charset detection, or code page detection is the process of heuristically guessing the character encoding of a series…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
13 relations
Browser sniffing
Bush hid the facts
Character encoding
Content sniffing
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2012
2012
POS Annotated 50M Corpus of Tajik Language
Gulshan Dovudov
,
Vít Suchomel
,
Pavel Smerk
2012
Corpus ID: 57397626
Paper presents by far the largest available computer corpus of Tajik language of the size of more than 50 million words. To…
Expand
2007
2007
Automatic Detection of Character Encoding and Language
Seung-Ho Kim
,
Jongsoo Park
2007
Corpus ID: 14539503
The Internet is full of textual contents in various languages and character encodings, and their communication across the…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE