PAWS: Paraphrase Adversaries from Word Scrambling
- Yuan Zhang, Jason Baldridge, Luheng He
- Computer ScienceNorth American Chapter of the Association for…
- 1 April 2019
PAWS (Paraphrase Adversaries from Word Scrambling), a new dataset with 108,463 well-formed paraphrase and non-paraphrase pairs with high lexical overlap, is introduced, providing an effective instrument for driving further progress on models that better exploit structure, context, and pairwise comparisons.
PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification
- Yinfei Yang, Y. Zhang, C. Tar, Jason Baldridge
- Computer ScienceConference on Empirical Methods in Natural…
- 1 August 2019
PAWS-X, a new dataset of 23,659 human translated PAWS evaluation pairs in six typologically distinct languages, shows the effectiveness of deep, multilingual pre-training while also leaving considerable headroom as a new challenge to drive multilingual research that better captures structure and contextual information.
Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns
- Kellie Webster, Marta Recasens, Vera Axelrod, Jason Baldridge
- Computer ScienceInternational Conference on Topology, Algebra and…
- 15 July 2018
GAP, a gender-balanced labeled corpus of 8,908 ambiguous pronoun–name pairs sampled, is presented and released to provide diverse coverage of challenges posed by real-world text and shows that syntactic structure and continuous neural models provide promising, complementary cues for approaching the challenge.
Supervised Text-based Geolocation Using Language Models on an Adaptive Grid
- Stephen Roller, Michael Speriosu, Sarat Rallapalli, Benjamin Wing, Jason Baldridge
- Computer ScienceConference on Empirical Methods in Natural…
- 12 July 2012
The adaptive grid achieves competitive results with a uniform grid on small training sets and outperforms it on the large Twitter corpus and the two grid constructions can also be combined to produce consistently strong results across all training sets.
Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph
- Michael Speriosu, Nikita Sudan, Sid Upadhyay, Jason Baldridge
- Computer ScienceULNLP@EMNLP
- 30 July 2011
Results on polarity classification for several datasets show that the label propagation approach rivals a model supervised with in-domain annotated tweets, and it outperforms the noisily supervised classifier it exploits as well as a lexicon-based polarity ratio classifier.
Simple supervised document geolocation with geodesic grids
- Benjamin Wing, Jason Baldridge
- GeologyAnnual Meeting of the Association for…
- 19 June 2011
This work investigates automatic geolocation (i.e. identification of the location, expressed as latitude/longitude coordinates) of documents and describes several simple supervised methods for document geolocated using only the document's raw text as evidence.
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation
- Vihan Jain, Gabriel Ilharco, Alexander Ku, Ashish Vaswani, Eugene Ie, Jason Baldridge
- Computer ScienceAnnual Meeting of the Association for…
- 29 May 2019
This work highlights shortcomings of current metrics for the Room-to-Room dataset and proposes a new metric, Coverage weighted by Length Score (CLS), and shows that agents that receive rewards for instruction fidelity outperform agents that focus on goal completion.
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
- Jiahui Yu, Yuanzhong Xu, Yonghui Wu
- Computer ScienceArXiv
- 22 June 2022
The Pathways Autoregressive Text-to-Image (Parti) model is presented, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge and explores and highlights limitations of the models.
Lexically specified derivational control in combinatory categorial grammar
- Jason Baldridge
- Linguistics
- 1 July 2002
This dissertation elaborates several refinements to the Combinatory Categorial Grammar (ccg) framework, and shows how the multi-modal perspective on grammatical composition provided by the logical tradition of categorial grammar can be incorporated into ccg’s rulebased approach.
Combinatory Categorial Grammar
- Mark Steedman, Jason Baldridge
- Linguistics
- 6 June 2011
Combinatory Categorial Grammar is a generalization of classical Categorial Grammar that is at the least greater level of expressive power than context-free grammar that has yet been identified. The…
...
...