Fully Character-Level Neural Machine Translation without Explicit Segmentation

Challenge = Computation Computation is quadratic in length of source sentence. This is because the the attention mechanism is used t times, where t is the length of the target sentence (which is usually proportional to the length of the source sentence). And each time the attention mechanism looks at the entire representation of the source sentence. Main… CONTINUE READING

8 Figures & Tables



Citations per Year

123 Citations

Semantic Scholar estimates that this publication has 123 citations based on the available data.

See our FAQ for additional information.