PandaSearch: A fine-grained academic search engine for research documents

  title={PandaSearch: A fine-grained academic search engine for research documents},
  author={Feiran Huang and Jia Li and Jiaheng Lu and T. Ling and Zhaoan Dong},
  journal={2015 IEEE 31st International Conference on Data Engineering},
In the world of academia, research documents enable the sharing and dissemination of scientific discoveries. During these “big data” times, academic search engines are widely used to find the relevant research documents. Considering the domain of computer science, a researcher often inputs a query with a specific goal to find an algorithm or a theorem. However, to this date, the return result of most search engines is just as a list of related papers. Users have to browse the results, download… Expand
PANDA: A platform for academic knowledge discovery and acquisition
This work state the problem of academic knowledge discovery and acquisition using an hybrid framework, integrating the accuracy of human workers and the speed of automatic algorithms, as crowdsourcing has become a powerful paradigm for problem-solving. Expand
PivotE: Revealing and Visualizing the Underlying Entity Structures for Exploration
This demonstration presents an entity-oriented exploratory search prototype system that is able to support search and explore KGs in a exploratorysearch manner, where local structures of KGs can be dynamically discovered and utilized for guiding users to further explore the information space. Expand
Using hybrid algorithmic-crowdsourcing methods for academic knowledge acquisition
The problem of academic knowledge discovery and acquisition is considered as a crowd-sourced database problem and a hybrid framework to integrate the accuracy of crowdsourcing workers and the speed of automatic algorithms is shown. Expand
Big Scholarly Data: A Survey
The background and state of the art of scholarly data management and relevant technologies are examined, and data analysis methods, such as statistical analysis, social network analysis, and content analysis for dealing with big scholarly data are reviewed. Expand
Using Crowdsourcing for Fine-Grained Entity Type Completion in Knowledge Bases
A hybrid machine-crowdsourcing approach for fine-grained entity type completion that determines the types of some “representative” entities via crowdsourcing and then infers the types for remaining entities based on the crowdsourcing results. Expand


A Dataset Search Engine for the Research Document Corpus
This work proposes a framework to effectively identify datasets within the scientific corpus, and builds a user friendly web-based search interface for users to conveniently explore the dataset-paper relationships, and find relevant datasets and their properties. Expand
Extracting Keyphrases from Research Papers Using Citation Networks
This work proposes CiteTextRank for keyphrase extraction from research articles, a graph-based algorithm that incorporates evidence from both a document's content as well as the contexts in which the document is referenced within a citation network. Expand
Single Document Keyphrase Extraction Using Neighborhood Knowledge
This paper proposes to use a small number of nearest neighbor documents to provide more knowledge to improve single document keyphrase extraction. Expand
Fast and unified local search for random walk based k-nearest-neighbor query in large graphs
FLoS (Fast Local Search) is presented, a unified local search method for efficient and exact top-k proximity query in large graphs based on the no local optimum property of proximity measures. Expand
Single-pass active learning with conflict and ignorance
The results based on real-world binary and multi-class classification streaming data show that the single-pass active learning approach yields evolving classifiers whose performance is similar to that of classifiers using all samples for adaptation; however, the annotation effort in terms of the number of class label requests is reduced by up to 90 %. Expand
LIBSVM: A library for support vector machines
Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail. Expand