Skip to search formSkip to main contentSkip to account menu

Byte pair encoding

Known as: Byte pair compression, Digram coding, Dual tile encoding 
Byte pair encoding or digram coding is a simple form of data compression in which the most common pair of consecutive bytes of data is replaced with… 
Wikipedia (opens in a new tab)

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2020
Highly Cited
2020
While many BERT-based cross-modal pre-trained models produce excellent results on downstream understanding tasks like image-text… 
2020
2020
In this paper, we aim to do code completion based on implementing a Neural Network from Li et. al.. Our contribution is that we… 
2019
2019
The amount of user-generated content in the cyberspace keeps increasing in the 21st century. However, it has also meant an… 
2017
2017
We classify the ergodic invariant random subgroups of block-diagonal limits of symmetric groups in the cases when the groups are… 
2017
2017
Byte pair encoding(BPE) is an approach that segments the corpus in such a way that frequent sequence of characters are combined… 
2017
2017
Recent many password guessing algorithms based on the Probabilistic Context-Free Grammars (PCFGs) model brought significant… 
2017
2017
Factored neural machine translation (FNMT) is founded on the idea of using the morphological and grammatical decomposition of the… 
2016
2016
Most existing Neural Machine Translation models use groups of characters or whole words as their unit of input and output. We… 
2016
2016
We explore the use of segments learnt using Byte Pair Encoding (referred to as BPE units) as basic units for statistical machine… 
2010
2010
In this paper, a new lossless data compression method that is based on digram coding is introduced. This data compression method…