Skip to search formSkip to main contentSkip to account menu

About CORD-19

The Semantic Scholar team at the Allen Institute for AI built and released CORD-19 in collaboration with leading research institutes. Our goal is to provide researchers with free and open tools and datasets to find new insights about the novel coronavirus.

We are a team of researchers and engineers who have been studying the problem of information overload in science since the launch of Semantic Scholar in 2015. The research and technology that powers Semantic Scholar gave us the ability to move quickly in response to The White House’s request to aggregate the largest structured dataset of coronavirus research for the global research community. You can learn more in our joint statement: Call to Action to the Tech Community on New Machine Readable COVID-19 Dataset.

This pandemic highlights the ongoing need for publishers to provide timely, open access to scientific literature to benefit the common good. If you are a publisher who would like to contribute to this open corpus please let us know using the form below.

Resources for Data Scientists from AI2 and Semantic Scholar

Our team built the CORD-19 dataset using the open resources provided by the AI2 and Semantic Scholar. Learn more about the research and technology that’s powering this project.

Contact Us

Interested in contributing to CORD-19? Let us know and we’ll be in touch.

Contribute to CORD-19

For research inquiries, please contact (Kyle Lo) and (Lucy Lu Wang). For inquiries regarding SciSight and knowledge discovery, please contact Tom Hope (
To maximize impact and increase full text available to the global research community, we are actively encouraging publishers to make their research content openly available for AI projects like this that benefit the common good.

If you’re a publisher interested in contributing to the CORD-19 corpus, please contact
Subscribe to CORD-19 News
Join our mailing list to receive the latest CORD-19 news.