• Publications
  • Influence
Joining Extractions of Regular Expressions
TLDR
We investigate the complexity of querying text by Conjunctive Queries (CQs) and Unions of CQs (UCQs), on top of regex formulas. Expand
  • 38
  • 6
  • PDF
Detecting Ambiguity in Prioritized Database Repairing
In its traditional definition, a repair of an inconsistent database is a consistent database that differs from the inconsistent one in a "minimal way." Often, repairs are not equally legitimate, asExpand
  • 14
  • PDF
Recursive Programs for Document Spanners
TLDR
A document spanner models a program for Information Extraction (IE) as a function that takes as input a document (string over a finite alphabet) and produces a relation of spans (intervals in the document) over a predefined schema. Expand
  • 14
  • PDF
Complexity Bounds for Relational Algebra over Document Spanners
TLDR
We investigate the complexity of evaluating queries in Relational Algebra (RA) over the relations extracted by regex formulas (i.e., regular expressions with capture variables) over text documents. Expand
  • 12
  • PDF
Weight Annotation in Information Extraction
TLDR
The framework of document spanners abstracts the task of information extraction from text as a function that maps every document (a string) into an annotated relation over the document's spans (intervals identified by their start and end indices). Expand
  • 11
  • PDF
Incorporating information extraction in the relational database model
TLDR
We extend the relational model by incorporating into it the theory of document spanners which provides the means and methods for the model to engage Information Extraction (IE) tasks. Expand
  • 6
A note on the emptiness problem for alternating finite-memory automata
TLDR
We present alternative relatively simple and self-contained proofs of decidability of the emptiness problems for one-register alternating finite-memory automata. Expand
  • 2
Unambiguous Prioritized Repairing of Databases
TLDR
We investigate the complexity of deciding whether the priority relation suffices to clean the database unambiguously, or in other words, whether there is exactly one optimal repair. Expand
  • 2
  • PDF
Grammars for Document Spanners
TLDR
We propose a new grammar-based language for defining information-extractors from documents (text) that is built upon the well-studied framework of Document Spanners for extracting structured data from text. Expand
  • 2
  • PDF
Visual analytics for movement behavior in traffic and transportation
TLDR
This paper illustrates the value and benefit of applying visual analytics techniques to movement data to create valuable insight for public transportation planning using vehicle-mounted devices on buses and trams. Expand
  • 3
...
1
2
...