Using Automatic Metadata Extraction to Build a Structured Syllabus Repository

  title={Using Automatic Metadata Extraction to Build a Structured Syllabus Repository},
  author={Xiaoyan Yu and Manas Tungare and Weiguo Fan and Manuel A. P{\'e}rez-Qui{\~n}ones and Edward A. Fox and William Cameron and Lillian N. Cassel},
Syllabi are important documents created by instructors for students. [] Key Method We discuss our detailed process for converting unstructured syllabi to structured representations through entity recognition, segmentation, and association. Our evaluation results demonstrate the effiectiveness of our extractor and also suggest improvements. We hope our work will benefit not only users of our services but also people who are interested in building other genre-specific repositories.
Representation of Latin American University Syllabuses in a Semantic Network
This study carried out a content analysis to extract the common terms in the syllabuses from the best universities in Latin America, and combined automatic and manual techniques to infer the structure of a syllabus into a semantic network.
Semantic Model of Syllabus and Learning Ontology for Intelligent Learning System
An effective method for enhancing the learning effect of students through the construction of subject ontology, which is used in discussion, visual presentation, and knowledge sharing between instructor and students is proposed.
The Syllabus Based Web Content Extractor (SBWCE)
As multiple format educational information is needed for Syllabus based content; the technique used makes the finding of such content easier and creates an instant online book view based on the links relevant to the givenSyllabus.
Learning Concept Sequencing through Semantic-based Syllabus Design and Integration
A layered structure of learning ontologies is proposed, which are composed of curriculum ontology, syllabus ontologies, learning subject ontology and learning concept ontology to support adaptive learning sequencing based on the syllabus.
Development of a National Syllabus Repository for Higher Education in Ireland
A prototype syllabus repository system for higher education in Ireland is described that has been developed by utilising a number of information extraction and document classification techniques, including a new fully unsupervised document classification method that uses a web search engine for automatic collection of training set for the classification algorithm.
Stanza Type Identification using Systematization of Versification System of Hindi Poetry
The paper covers various challenges and the best possible solutions for those challenges, describing the methodology to generate automatic metadata for “Chhand” based on the poems’ stanzas, and provides some advanced information and techniques for metadata generation for ”Muktak Chhands”.
Computational linguistic prosody rule-based unified technique for automatic metadata generation for Hindi poetry
This research paper majorly focuses on the unified-rule based technique for the generation of metadata based on the different set of rules of prosody, which was able to achieve 98.09% accuracy with the implementation of this unified- rule based technique.
Anotação semântica automática de Objetos de Aprendizagem Digitais: Um mapeamento sistemático de literatura
The Semantic Web technologies enable the semantic annotation of Learning Objects (LO), which allows more accurate and possibly quicker methods for the search and retrieval of LO. Since the LO manual
The use of Semantic Technologies in Computer Science Curriculum: A Systematic Review
A systematic review is carried out to provide an overview of the application of Semantic technologies in the context of the Computer Science curriculum and discuss the limitations in this area, whilst offering insights for future research.


Towards a Standardized Representation of Syllabi to Facilitate Sharing and Personalization of Digital Library Content
The current practices in creating and publishing syllabi are reported on and the motivation for a standardized syllabus schema is presented, which describes the tools needed for working with syllabi schema, and applications made possible with the availability of syllabi in standardized formats are described.
Automatic syllabus classification
A syllabus classifier to filter noise out from search results and discusses various steps in the classification process, including class definition, training data preparation, feature selection, and classifier building using SVM and Naïve Bayes.
Finding Educational Resources on the Web: Exploiting Automatic Extraction of Metadata
This work explores using text classification and information extraction techniques to automatically gather metadata for classificatory metadata for web educational resources put up by faculty members.
Automatic document metadata extraction using support vector machines
It is found that discovery and use of the structural patterns of the data and domain based word clustering can improve the metadata extraction performance and an appropriate feature normalization also greatly improves the classification performance.
Reasoning and Ontologies for Personalized E-Learning in the Semantic Web
It is shown how the semantic web resource description formats can be utilized for automatic generation of hypertext structures from distributed metadata and a logic-based approach to educational hypermedia using TRIPLE, a rule and query language for the semantic net.
Stepping Stones and Pathways:Improving Retrieval by Chains of Relationships between Documents
A retrieval scheme is defined that enhances retrieval through a framework that combines multiple sources of evidence and has the potential to improve retrieval results whenever there is a mismatch between the user's understanding of the collection and the actual collection content.
Bibliographic attribute extraction from erroneous references based on a statistical model
  • A. Takasu
  • Computer Science
    2003 Joint Conference on Digital Libraries, 2003. Proceedings.
  • 2003
A statistical model for attribute extraction that represents both the syntactical structure of references and OCR error patterns is proposed and it is shown that the proposed model has advantages in reducing the cost of preparing training data.
Web-assisted annotation, semantic indexing and search of television and radio news
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described, and an evaluation shows that the system operates with high precision, and with a moderate level of recall.
Advances in domain independent linear text segmentation
This paper describes a method for linear text segmentation which is twice as accurate and over seven times as fast as the state-of-the-art (Reynar, 1998). Inter-sentence similarity is replaced by
The Core: Digital Library Education in Library and Information Science Programs
This paper identifies the "state of the art" in digital library education in Library and Information Science programs, by identifying the readings that are assigned in digital library courses and the