Expedition: A Time-Aware Exploratory Search System Designed for Scholars

  title={Expedition: A Time-Aware Exploratory Search System Designed for Scholars},
  author={Jaspreet Singh and Wolfgang Nejdl and Avishek Anand},
  journal={Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval},
  • Jaspreet SinghW. NejdlAvishek Anand
  • Published 7 July 2016
  • Computer Science
  • Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval
Archives are an important source of study for various scholars. Digitization and the web have made archives more accessible and led to the development of several time-aware exploratory search systems. However these systems have been designed for more general users rather than scholars. Scholars have more complex information needs in comparison to general users. They also require support for corpus creation during their exploration process. In this paper we present Expedition - a time-aware… 

Figures from this paper

Designing Search Tasks for Archive Search

This paper proposes a set of search tasks, with varying complexity, that IIR researchers can utilize to study user search behavior in archives, and discusses how these tasks were created and refined as the result of a pilot study using a temporal search engine.

Personal Research Assistant for Online Exploration of Historical News

This paper focuses on the intelligent Personal Research Assistant component in the environment and the web interface, an interactive exploratory engine that combines results of various text analysis tools in an unsupervised fashion to conduct autonomous investigations on the data according to users’ needs.

Full-Text and URL Search Over Web Archives

While web search engines enable searching over the most recent web snapshot, web archives allow searching overmultiple snapshots from the past, which means that web archives have to deal with a temporal dimension that is the cause of new challenges and opportunities.

A Brief Survey on Exploratory Search Systems

An overview of exploratory search systems is provided and the state of the art is discussed based on features such as algorithms, result ranking, explanations, etc.

ClioQuery: Interactive Query-oriented Text Analytics for Comprehensive Investigation of Historical News Archives

ClioQuery applies text simplification techniques from natural language processing to help historians quickly and comprehensively gather and analyze all occurrences of a query word across an archive, and pairs these new NLP methods with more traditional features like linked views and in-text highlighting to help engender trust in summarization techniques.

Toward comprehensive event collections

This paper presents an approach for building event-centric sub-collections from such large archives, which includes not only the core documents related to the event itself but, even more importantly, documents describing related aspects (e.g., premises and consequences).

Towards a coherence-oriented complex search experience management method

Ranking Archived Documents for Structured Queries on Semantic Layers

Two ranking models are proposed which jointly consider the relativeness of documents to entities, the timeliness of documents, and the temporal relations among the entities and the experimental results show the effectiveness of the proposed models and allow us to understand their limitations.

Building Entity-Centric Event Collections

This paper presents an approach for building event-centric sub-collections from such large archives, which includes not only the core documents related to the event itself but, even more importantly, documents describing related aspects (e.g., premises and consequences).

Building and querying semantic layers for web archives (extended version)

This paper proposes an RDF/S model and a distributed framework for building semantic profiles (“layers”) that describe semantic information about the contents of web archives that can answer information needs that existing keyword-based systems are not able to sufficiently satisfy.



Clustering and exploring search results using timeline constructions

This paper presents an add-on to traditional information retrieval applications in which various temporal information associated with documents are exploited to present and cluster documents along timelines and shows how temporal expressions are made explicit and used in the construction of multiple-granularity timelines.

EXPOSÉ: EXploring Past news fOr Seminal Events

Recent increases in digitization and archiving efforts on news data have led to overwhelming amounts of online information for general users, thus making it difficult for them to retrospect on past

Searching through time in the New York Times HCIR Challenge 2010

The Time Explorer is designed to help users discover how entities such as people and locations associated with a query change over time, and by searching on time expressions extracted automatically from text allows the user to explore not only how topics evolved in the past, but also how they will continue to evolve in the future.

InZeit: Efficiently Identifying Insightful Time Points

The demonstrated system named InZeit (pronounced "insight") assists users by determining insightful time points for a given query, which are the time points at which the top-k time-travel query result changes substantially and for which the user should therefore inspect query results.

NEAT: News Exploration Along Time

The demonstration of the NEAT (News Exploration Along Time) prototype system, an attempt towards building an intuitive and exploratory interface for search results over large news archives using timelines, consists of an exploratory search interface where it is shown how queries can produce different timelines and how one can use temporal information to discover interesting facts.

Survey of Temporal Information Retrieval and Related Applications

A survey of the existing literature on temporal information retrieval is presented, categorize the relevant research, describe the main contributions, and compare different approaches to provide a coherent view of the field.

History by Diversity: Helping Historians search News Archives

A novel algorithm, HistDiv, that explicitly models the aspects and important time windows based on a historian's information seeking behavior is presented and it is found that HistDiv outperforms all competitors in subtopic recall with a slight loss in precision.

The newspaper as an information exploration metaphor

AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables

A Web-based online interface for AIDA is developed where different formats of inputs can be processed on the fly, returning proper entities and showing intermediate steps of the disambiguation process.

Diversifying search results

This work proposes an algorithm that well approximates this objective in general, and is provably optimal for a natural special case, and generalizes several classical IR metrics, including NDCG, MRR, and MAP, to explicitly account for the value of diversification.