Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 225,436,943 papers from all fields of science
Search
Sign In
Create Free Account
Lexical analysis
Known as:
Semicolon insertion
, Token splitting
, Lexical analyzer
Expand
In computer science, lexical analysis is the process of converting a sequence of characters (such as in a computer program or web page) into a…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
49 relations
.NET Compiler Platform
ALGOL
ANTLR
Atari BASIC
Expand
Broader (1)
Programming language implementation
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2012
2012
PoliMorf: a (not so) new open morphological dictionary for Polish
Marcin Woliński
,
M. Miłkowski
,
Maciej Ogrodniczuk
,
A. Przepiórkowski
International Conference on Language Resources…
2012
Corpus ID: 9683080
This paper presents preliminary results of an effort aiming at the creation of a morphological dictionary of Polish, PoliMorf…
Expand
Highly Cited
2007
Highly Cited
2007
The influence of semantic transparency on eye movements during English compound word recognition
B. Juhasz
2007
Corpus ID: 59646607
2007
2007
Parsing Expression Grammar as a Primitive Recursive-Descent Parser with Backtracking
Roman R. Redziejowski
Fundamenta Informaticae
2007
Corpus ID: 16157510
Two recent developments in the field of formal languages are Parsing Expression Grammar (PEG) and packrat parsing. The PEG…
Expand
2006
2006
Open Source Corpus Analysis Tools for Malay
Timothy Baldwin
,
Su'ad Awab
International Conference on Language Resources…
2006
Corpus ID: 15034074
Tokenisers, lemmatisers and POS taggers are vital to the linguistic and digital furtherment of any language. In this paper, we…
Expand
2006
2006
A Historical Dictionary of Yukaghir
I. Nikolaeva
2006
Corpus ID: 60485380
Tundra and Kolyma Yukaghir are highly endangered languages spoken in the extreme North-East of Siberia and usually considered…
Expand
Highly Cited
2004
Highly Cited
2004
SpamBayes: Effective open-source, Bayesian based, email classification system
T. Meyer
,
Brendon Whateley
International Conference on Email and Anti-Spam
2004
Corpus ID: 2368172
This paper introduces the SpamBayes classification engine and outlines the most important features and techniques which…
Expand
2003
2003
Language identification using parallel sub-word recognition
A. Jayram
,
V. Ramasubramanian
,
T. Sreenivas
IEEE International Conference on Acoustics…
2003
Corpus ID: 5269830
Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need…
Expand
Review
2001
Review
2001
Visual Programming in the Wild: A Survey of LabVIEW Programmers
Kirsten N. Whitley
,
A. Blackwell
Journal of Visual Languages and Computing
2001
Corpus ID: 40587121
As part of research into the cognitive effects of visual programming representations, a worldwide survey of LabVIEW programmers…
Expand
2001
2001
Towards a Lexicographic Approach to Lexical Transfer in Machine Translation (Illustrated by the German–Russian Language Pair)
I. Mel'cuk
,
L. Wanner
Machine Translation
2001
Corpus ID: 34104511
The translation of lexical items is still a formidableobstacle in the field of Machine Translation. The present articleaddresses…
Expand
Highly Cited
1998
Highly Cited
1998
Phonemic transcription by analogy in text-to-speech synthesis: Novel word pronunciation and lexicon compression
P. Bagshaw
Computer Speech and Language
1998
Corpus ID: 34625536
Abstract The synthesis of speech from unrestricted text needs a phonemic transcription including syllabification and lexical…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE