Distilling Task Knowledge from How-To Communities

@article{Chu2017DistillingTK,
  title={Distilling Task Knowledge from How-To Communities},
  author={Cuong Xuan Chu and Niket Tandon and Gerhard Weikum},
  journal={Proceedings of the 26th International Conference on World Wide Web},
  year={2017}
}
Knowledge graphs have become a fundamental asset for search engines. A fair amount of user queries seek information on problem-solving tasks such as building a fence or repairing a bicycle. However, knowledge graphs completely lack this kind of how-to knowledge. This paper presents a method for automatically constructing a formal knowledge base on tasks and task-solving steps, by tapping the contents of online communities such as WikiHow. We employ Open-IE techniques to extract noisy candidates… Expand
Know-How in Programming Tasks: From Textual Tutorials to Task-Oriented Knowledge Graph
TLDR
The resulting knowledge graph, TaskKG, includes a hierarchical taxonomy of activities, three types of activities relationships and five types of activity attributes, and enables activity-centric knowledge search and is promising in helping developers finding correct answers to programming how-to questions. Expand
What Computers Should Know, Shouldn't Know, and Shouldn't Believe
TLDR
Automatically constructed knowledge bases are a powerful asset for search, analytics, recommendations and data integration, with intensive use at big industrial stake-holders, forming the Web of Linked Open Data. Expand
HealthAid: Extracting domain targeted high precision procedural knowledge from on-line communities
TLDR
A pipeline of methods leveraging open information extraction tool to extract procedural knowledge by tapping into on-line communities is developed and a mechanism to canonicalize the task frames into clusters based on the similarity of the problems they intend to solve is devised. Expand
MyFixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair Manuals
TLDR
This paper introduces a semi-structured dataset of repair manuals and proposes methods that can serve as baselines for information extraction (IE) from the instructional text in repair manuals, including an unsupervised method based on a bags-of-n-grams similarity for extracting the needed tools in each repair step, and a deep-learning-based sequence labeling model for extracts the identity of disassembled parts. Expand
Learning Household Task Knowledge from WikiHow Descriptions
TLDR
A model to learn embeddings for tasks, as well as the individual steps that need to be taken to solve them, based on WikiHow articles are proposed, such that they are predictive of both step relevance and step ordering. Expand
Recent Trends in Natural Language Understanding for Procedural Knowledge
TLDR
This paper seeks to provide an overview of the work in procedural knowledge understanding, and information extraction, acquisition, and representation with procedures, to promote discussion and provide a better understanding of procedural knowledge applications and future challenges. Expand
Learning Procedures from Text: Codifying How-to Procedures in Deep Neural Networks
TLDR
To identify the relationships, this paper proposes an end-to-end neural network architecture, which can selectively learn important procedure-specific relationships and outperforms the existing entity relationship extraction algorithms. Expand
Mining Procedures from Technical Support Documents
TLDR
This paper outlines a system for mining procedures from technical support documents and creates models for solving important subproblems like extraction of procedures, identifying decision points within procedures, identifies blocks of instructions corresponding to these decision points and mapping instructions within a decision block. Expand
Hierarchy-aware Learning of Sequential Tool Usage via Semi-automatically Constructed Taxonomies
TLDR
This paper builds a taxonomy of tools with hyponymy and hypernymy relations from the data by decomposing all multi-word expressions of tool names and develops a sequential model that performs a binary prediction for each node in the taxonomy. Expand
Towards Modeling Revision Requirements in wikiHow Instructions
TLDR
This work extends an existing resource of textual edits with a complementary set of approx. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 60 REFERENCES
Leveraging Procedural Knowledge for Task-oriented Search
TLDR
A set of textual features and structural features are proposed to identify key search phrases from task descriptions, and then adapt similar features to extract wikiHow-style procedural knowledge descriptions from search queries and relevant text snippets. Expand
Acquiring Comparative Commonsense Knowledge from the Web
TLDR
This paper relies on open information extraction methods to obtain large amounts of comparisons from the Web and develops a joint optimization model for cleaning and disambiguating this knowledge with respect to WordNet, which relies on integer linear programming and semantic coherence scores. Expand
Open Information Extraction: The Second Generation
TLDR
The second generation of Open IE systems are described, which rely on a novel model of how relations and their arguments are expressed in English sentences to double precision/recall compared with previous systems such as TEXTRUNNER and WOE. Expand
Open Language Learning for Information Extraction
Open Information Extraction (IE) systems extract relational tuples from text, without requiring a pre-specified vocabulary, by identifying relation phrases and associated arguments in arbitraryExpand
Knowlywood: Mining Activity Knowledge From Hollywood Narratives
TLDR
A pipeline for semantic parsing and knowledge distillation is developed, to systematically compile semantically refined activity frames, mined from about two million scenes of movies, TV series, and novels. Expand
Creating Causal Embeddings for Question Answering with Minimal Supervision
TLDR
This work argues that a better approach is to look for answers that are related to the question in a relevant way, according to the information need of the question, which may be determined through task-specific embeddings, and implements causality as a use case. Expand
Cross Sentence Inference for Process Knowledge
TLDR
This work extends standard within sentence joint inference to inference across multiple sentences, which promotes role assignments that are compatible across different descriptions of the same process. Expand
POLY: Mining Relational Paraphrases from Multilingual Sentences
TLDR
A new method for building language resources that systematically organize paraphrases for binary relations and the resource itself, called POLY is presented, which shows significant improvements in precision and recall over the prior works on PATTY and DEFIE. Expand
PATTY: A Taxonomy of Relational Patterns with Semantic Types
TLDR
PATTY is a large resource for textual patterns that denote binary relations between entities that are semantically typed and organized into a subsumption taxonomy that harnesses the rich type system and entity population of large knowledge bases. Expand
Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language
TLDR
The results show that non-expert annotators can produce high quality QA-SRL data, and also establish baseline performance levels for future work on this task, and introduce simple classifierbased models for predicting which questions to ask and what their answers should be. Expand
...
1
2
3
4
5
...