A Transformer-based Approach for Source Code Summarization
- Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang
- Computer ScienceAnnual Meeting of the Association for…
- 1 May 2020
This work explores the Transformer model that uses a self-attention mechanism and has shown to be effective in capturing long-range dependencies in source code summarization, and shows that despite the approach is simple, it outperforms the state-of-the-art techniques by a significant margin.
Unified Pre-training for Program Understanding and Generation
- Wasi Uddin Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang
- Computer ScienceNorth American Chapter of the Association for…
- 10 March 2021
Analysis reveals that PLBART learns program syntax, style, logical flow, and style that are crucial to program semantics and thus excels even with limited annotations, and outperforms or rivals state-of-the-art models.
Context Attentive Document Ranking and Query Suggestion
- Wasi Uddin Ahmad, Kai-Wei Chang, Hongning Wang
- Computer ScienceAnnual International ACM SIGIR Conference on…
- 5 June 2019
A two-level hierarchical recurrent neural network is introduced to learn search context representation of individual queries, search tasks, and corresponding dependency structure by jointly optimizing two companion retrieval tasks: document ranking and query suggestion.
On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing
- Wasi Uddin Ahmad, Zhisong Zhang, Xuezhe Ma, E. Hovy, Kai-Wei Chang, Nanyun Peng
- Computer Science, LinguisticsNorth American Chapter of the Association for…
- 1 November 2018
Investigating crosslingual transfer and posit that an orderagnostic model will perform better when transferring to distant foreign languages shows that RNN-based architectures transfer well to languages that are close to English, while self-attentive models have better overall cross-lingualtransferability and perform especially well on distant languages.
Multi-Task Learning for Document Ranking and Query Suggestion
- Wasi Uddin Ahmad, Kai-Wei Chang, Hongning Wang
- Computer ScienceInternational Conference on Learning…
- 15 February 2018
GATE: Graph Attention Transformer Encoder for Cross-lingual Relation and Event Extraction
- Wasi Uddin Ahmad, Nanyun Peng, Kai-Wei Chang
- Computer ScienceAAAI Conference on Artificial Intelligence
- 6 October 2020
This work introduces GATE, a Graph Attention Transformer Encoder, and test its cross-lingual transferability on relation and event extraction tasks, and shows that GATE outperforms three recently proposed methods by a large margin.
PolicyQA: A Reading Comprehension Dataset for Privacy Policies
- Wasi Uddin Ahmad, Jianfeng Chi, Yuan Tian, Kai-Wei Chang
- Computer ScienceFindings
- 6 October 2020
This paper argues that providing users with a short text span from policy documents reduces the burden of searching the target information from a lengthy text segment, and evaluates two existing neural QA models and performs rigorous analysis to reveal the advantages and challenges offered by PolicyQA.
Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages
- Wasi Uddin Ahmad, Zhisong Zhang, Xuezhe Ma, Kai-Wei Chang, Nanyun Peng
- Computer Science, LinguisticsConference on Computational Natural Language…
- 20 September 2019
This work explores adversarial training for learning contextual encoders that produce invariant representations across languages to facilitate cross-lingual transfer and proposes to leverage unannotated sentences from auxiliary languages to help learning language-agnostic representations.
Intent-aware Query Obfuscation for Privacy Protection in Personalized Web Search
- Wasi Uddin Ahmad, Kai-Wei Chang, Hongning Wang
- Computer ScienceAnnual International ACM SIGIR Conference on…
- 27 June 2018
This work proposes a client-centered intent-aware query obfuscation solution for protecting user privacy in a personalized web search scenario, and develops two new metrics from an information theoretic perspective to evaluate the effectiveness of provided privacy protection.
Multi-lingual Evaluation of Code Generation Models
- Ben Athiwaratkun, Sanjay Krishna Gouda, Bing Xiang
- Computer ScienceArXiv
- 26 October 2022
This work presents MBXP, an execution-based code completion benchmark in 10+ programming languages that is able to evaluate code generation models in a multi-lingual fashion, and discovers generalization ability of language models on out-of-domain languages, advantages of large multi-lingsual models over mono-lingUAL, benefits of few-shot prompting, and zero-shot translation abilities.
...
...