Share This Author
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
- Leo Gao, Stella Rose Biderman, Connor Leahy
- Computer ScienceArXiv
- 31 December 2020
TLDR
GPT-Neo: Large Scale Autoregressive Language Modeling with Mesh-Tensorflow
- Sid Black, Leo Gao, Phil Wang, Connor Leahy, Stella Rose Biderman
- Computer Science
- 21 March 2021
Multitask Prompted Training Enables Zero-Shot Task Generalization
- Victor Sanh, Albert Webson, Alexander M. Rush
- Computer ScienceICLR
- 15 October 2021
TLDR
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance
- Katherine Crowson, Stella Rose Biderman, Edward Raff
- Computer ScienceArXiv
- 18 April 2022
TLDR
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
- Isaac Caswell, Julia Kreutzer, Mofetoluwa Adeyemi
- Computer ScienceTACL
- 22 March 2021
TLDR
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
- Sid Black, Stella Rose Biderman, Samuel Weinbach
- Computer ScienceBIGSCIENCE
- 14 April 2022
We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license.…
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
- Aarohi Srivastava, Abhinav Rastogi, Uri Shaham
- Computer ScienceArXiv
- 9 June 2022
TLDR
Pitfalls in Machine Learning Research: Reexamining the Development Cycle
- Stella Rose Biderman, W. Scheirer
- Computer ScienceICBINB@NeurIPS
- 4 November 2020
TLDR
Neural Language Models are Effective Plagiarists
- Stella Rose Biderman, Edward Raff
- Computer ScienceArXiv
- 19 January 2022
TLDR
Cut the CARP: Fishing for zero-shot story evaluation
- Shahbuland Matiana, J. Smith, Spencer Frazier
- Computer ScienceArXiv
- 6 October 2021
TLDR
...
...