• Publications
  • Influence
Rapformer: Conditional Rap Lyrics Generation with Denoising Autoencoders
TLDR
Experimental results show that Rapformer is capable of generating technically fluent verses that offer a good trade-off between content preservation and style transfer, and a Turing-test-like experiment reveals that the method fools human lyrics experts 25% of the time.
Character-level Chinese-English Translation through ASCII Encoding
TLDR
This paper enables character-level NMT for Chinese, by breaking down Chinese characters into linguistic units similar to that of Indo-European languages by using the Wubi encoding scheme, which preserves the original shape and semantic information of the characters, while also being reversible.
Character-Level Translation with Self-attention
We explore the suitability of self-attention models for character-level neural machine translation. We test the standard transformer model, as well as a novel variant in which the encoder block
Large-scale Hierarchical Alignment for Author Style Transfer
TLDR
It is shown that pseudo-parallel sentences extracted from comparable corpora representative of two different author styles not only improve existing parallel data, but can even lead to competitive performance on their own.
Data-driven Summarization of Scientific Articles
TLDR
This work generates two novel multi-sentence summarization datasets from scientific articles and test the suitability of a wide range of existing extractive and abstractive neural network-based summarization approaches, demonstrating that scientific papers are suitable for data-driven text summarization.
Large-Scale Hierarchical Alignment for Data-driven Text Rewriting
TLDR
It is shown that pseudo-parallel sentences extracted with the proposed unsupervised method not only supplement existing parallel data, but can even lead to competitive performance on their own.
Abstractive Document Summarization without Parallel Data
TLDR
This work develops an abstractive summarization system that relies only on large collections of example summaries and non-matching articles, consisting of an unsupervised sentence extractor that selects salient sentences to include in the final summary, as well as a sentence abstractor that is trained on pseudo-parallel and synthetic data.
Conditional Rap Lyrics Generation with Denoising Autoencoders
TLDR
A method for automatically synthesizing a rap verse given an input text written in another form, such as a summary of a news article, to reconstruct rap lyrics from content words is developed.
Summary Refinement through Denoising
We propose a simple method for post-processing the outputs of a text summarization system in order to refine its overall quality. Our approach is to train text-to-text rewriting models to correct
...
1
2
...