Bigram

Known as: Skipping bigrams, Skipping bigram, Bigram frequency attack

A bigram or digram is a sequence of two adjacent elements from a string of tokens, which are typically letters, syllables, or words. A bigram is an n…

Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.

2011

Gender Classification for Web Forums

Yulei ZhangYan DangHsinchun Chen
IEEE Transactions on Systems, Man, and…
2011
Corpus ID: 41453648

More and more women are participating in and exchanging opinions through community-based online social media. Questions…

Highly Cited

2011

Highly Cited

2011

Automated Whole Sentence Grammar Correction Using a Noisy Channel Model

Automated grammar correction techniques have seen improvement over the years, but there is still much room for increased…

2010

Parsing Word Clusters

Marie CanditoDjamé Seddah
SPMRL@NAACL-HLT
2010
Corpus ID: 1283622

We present and discuss experiments in statistical parsing of French, where terminal forms used during training and parsing are…

Highly Cited

2005

Highly Cited

2005

Using Syntactic and Semantic Relation Analysis in Question Answering

Renxu SunJing JiangYee Fan TanH. CuiTat-Seng ChuaMin-Yen Kan
Text Retrieval Conference
2005
Corpus ID: 33023199

Our participation at TREC thi integrating dependency and analysis of external resources int system. In TREC-13, we have p…

Highly Cited

2005

Highly Cited

2005

Universal text preprocessing for data compression

Several preprocessing algorithms for text files are presented which complement each other and which are performed prior to the…

2004

Auto-induced semantic classes

A. PargellisE. Fosler-LussierChin-Hui LeeA. PotamianosAugustine Tsai
Speech Communication
2004
Corpus ID: 205221891

2004

Using Language Models for Text Classification

Jing BaiJian-Yun Nie
2004
Corpus ID: 10792263

paper describes an approach to text classification using language models. This approach is a natural extension of the traditional…

Highly Cited

2002

Highly Cited

2002

Stochastic natural language generation for spoken dialog systems

Highly Cited

1999

Highly Cited

1999

Automatic generation of multiple pronunciations based on neural networks

Toshiaki FukadaTakayoshi YoshimuraY. Sagisaka
Speech Communication
1999
Corpus ID: 18863705

1997

Phrase Discovery for English and Cross-language Retrieval at TREC 6

In our TREC 6 experiments for the main tasks and tracks, Berkeley worked primarily on extending our probabilistic document…

Bigram

Related topics

Broader (1)

Papers overview