Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 224,708,364 papers from all fields of science
Search
Sign In
Create Free Account
Bigram
Known as:
Skipping bigrams
, Skipping bigram
, Bigram frequency attack
Expand
A bigram or digram is a sequence of two adjacent elements from a string of tokens, which are typically letters, syllables, or words. A bigram is an n…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
22 relations
Automatic summarization
Banburismus
Cipher Department of the High Command of the Wehrmacht
Collocation
Expand
Broader (1)
Natural language processing
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2011
2011
Gender Classification for Web Forums
Yulei Zhang
,
Yan Dang
,
Hsinchun Chen
IEEE Transactions on Systems, Man, and…
2011
Corpus ID: 41453648
More and more women are participating in and exchanging opinions through community-based online social media. Questions…
Expand
Highly Cited
2011
Highly Cited
2011
Automated Whole Sentence Grammar Correction Using a Noisy Channel Model
Y. A. Park
,
R. Levy
Annual Meeting of the Association for…
2011
Corpus ID: 7784892
Automated grammar correction techniques have seen improvement over the years, but there is still much room for increased…
Expand
2010
2010
Parsing Word Clusters
Marie Candito
,
Djamé Seddah
SPMRL@NAACL-HLT
2010
Corpus ID: 1283622
We present and discuss experiments in statistical parsing of French, where terminal forms used during training and parsing are…
Expand
Highly Cited
2005
Highly Cited
2005
Using Syntactic and Semantic Relation Analysis in Question Answering
Renxu Sun
,
Jing Jiang
,
Yee Fan Tan
,
H. Cui
,
Tat-Seng Chua
,
Min-Yen Kan
Text Retrieval Conference
2005
Corpus ID: 33023199
Our participation at TREC thi integrating dependency and analysis of external resources int system. In TREC-13, we have p…
Expand
Highly Cited
2005
Highly Cited
2005
Universal text preprocessing for data compression
J. Abel
,
W. Teahan
IEEE transactions on computers
2005
Corpus ID: 32842375
Several preprocessing algorithms for text files are presented which complement each other and which are performed prior to the…
Expand
2004
2004
Auto-induced semantic classes
A. Pargellis
,
E. Fosler-Lussier
,
Chin-Hui Lee
,
A. Potamianos
,
Augustine Tsai
Speech Communication
2004
Corpus ID: 205221891
2004
2004
Using Language Models for Text Classification
Jing Bai
,
Jian-Yun Nie
2004
Corpus ID: 10792263
paper describes an approach to text classification using language models. This approach is a natural extension of the traditional…
Expand
Highly Cited
2002
Highly Cited
2002
Stochastic natural language generation for spoken dialog systems
Alice H. Oh
,
Alexander I. Rudnicky
Computer Speech and Language
2002
Corpus ID: 29211719
Highly Cited
1999
Highly Cited
1999
Automatic generation of multiple pronunciations based on neural networks
Toshiaki Fukada
,
Takayoshi Yoshimura
,
Y. Sagisaka
Speech Communication
1999
Corpus ID: 18863705
1997
1997
Phrase Discovery for English and Cross-language Retrieval at TREC 6
F. Gey
,
Aitao Chen
Text Retrieval Conference
1997
Corpus ID: 16639755
In our TREC 6 experiments for the main tasks and tracks, Berkeley worked primarily on extending our probabilistic document…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE