About Semantic Scholar

Helping Scholars Discover New Insights

Semantic Scholar provides free, AI-driven search and discovery tools, and open resources for the global research community.

We index over 200 million academic papers sourced from publisher partnerships, data providers, and web crawls.

Illustration: Papers being accelerated by Semantic Scholar

Accelerating Scientific Breakthroughs Using AI

We are a Research and Product Development team within the Allen Institute for AI building a better way to search and discover scientific knowledge.

Developed in-house, our state-of-the-art models process and classify papers in our pipeline. Areas of AI research include Natural Language Processing, Machine Learning, Human Computer Interaction, and Information Retrieval.

AI-Driven Tools for Scholars

Staying up-to-date with scientific literature is an increasingly pressing challenge for scholars.

With Semantic Scholar, researchers can understand a paper at a glance. Our system extracts meaning and identifies connections from within papers, then surfaces these insights to help Scholars discover and understand research.

Our AI-driven features include:

Author Homepages

We use AI to showcase authors’ impact on science and highlight their most influential papers.

Recommendations and Alerts

Easily stay up-to-date with customized recommendations based on saved papers.


Automatically generated single-sentence paper summaries to help prioritize which papers to read in-depth.

Semantic Reader

An augmented reading application makes the reading experience more accessible and richly contextual by providing citation information directly within the context of a paper.

Resources for the Global Research Community

We provide a free, reliable source of scholarly data for developers to build projects that accelerate scientific progress.

The Semantic Scholar Academic Graph (S2AG) Dataset and APIs provide records for research papers published in all fields provided as an easy-to-use JSON archive. The Semantic Scholar Open Research Corpus (S2ORC) is a general purpose corpus for NLP and text mining research over scientific papers built and maintained by Semantic Scholar’s research team. Papers are aggregated into a unified source to create the largest publicly-available collection of machine-readable academic text, provided as a JSON archive.

AI For The Common Good

At AI2, our mission is to contribute to humanity through high-impact AI research and engineering.

Semantic Scholar was launched in 2015 as a groundbreaking project at the Allen Institute for AI, a non-profit research institute founded in 2014 with the mission of conducting high-impact AI research and engineering in service of the common good. AI2 is the creation of Paul Allen, Microsoft co-founder, and is led by Dr. Oren Etzioni, a leading AI researcher.

Are you interested in our mission? Join the team!

View Openings

Experience a smarter way to search and discover scholarly research.

Create Your Account

Latest News & Updates

Announcing S2FOS, an open source academic field of study classifier

Announcing S2FOS, an open source academic field of study classifier

New model makes academic field of study classification widely available and adds Linguistics, Law, Education, and Agriculture and Food Sciences to Semantic Scholar

Featured AI2er: Rodney Kinney

Featured AI2er: Rodney Kinney

Rodney Kinney is a Principal Machine Learning Engineer on the Semantic Scholar team at AI2.

Semantic Scholar Academic Graph for Developers

Semantic Scholar Academic Graph for Developers

Access more than 200 million papers through the Semantic Scholar Academic Graph Dataset and APIs