Improved annotation of the blogosphere via autotagging and hierarchical clustering

@inproceedings{Brooks2006ImprovedAO,
  title={Improved annotation of the blogosphere via autotagging and hierarchical clustering},
  author={Christopher H. Brooks and Nancy Montanez},
  booktitle={WWW '06},
  year={2006}
}
Tags have recently become popular as a means of annotating and organizing Web pages and blog entries. Advocates of tagging argue that the use of tags produces a 'folksonomy', a system in which the meaning of a tag is determined by its use among the community as a whole. We analyze the effectiveness of tags for classifying blog entries by gathering the top 350 tags from Technorati and measuring the similarity of all articles that share a tag. We find that tags are useful for grouping articles… 
Socialtagger - collaborative tagging for blogs in the long tail
TLDR
This work proposes and implements a system to automatically recommend collaborative tags for a blog, which will help to surface the blogs by making them available on social book marking sites and allow them to be easily discovered and potentially further tagged by a wider population.
Blogosonomy: Autotagging Any Text Using Bloggers' Knowledge
TLDR
A method of multi-autotagging, based on k-NN, which is a case-based classification method that has the functions of merging tags with the same meaning and identifying informative tags and the term weighting method named residual document frequency(RDF); it can score the similarity between tags.
Automatic tag recommendation for the web 2.0 blogosphere using collaborative tagging and hybrid ANN semantic structures
TLDR
This research makes use of collective intelligence extracted from Web 2.0 collaborative tagging as well as word semantics to learn how to predict the best set of tags to use, using a hybrid artificial neural network (ANN).
Blog Classification Using Tags: An Empirical Study
TLDR
The experiments showed that tag expansion helped to improve the recall of blog classification with the price of precision degradation, and proposed a tag expansion algorithm that assigns a blog more tags that are often co-occur with those already associated with the blog.
CDIP: Collection-Driven, yet Individuality-Preserving Automated Blog Tagging
TLDR
This paper proposes CDIP (a collection-driven, yet individuality- preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.
CDIP: Collection-Driven, yet Individuality-Preserving Automated Blog Tagging
TLDR
This paper proposes CDIP (a collection-driven, yet individuality- preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.
Tag Similarity in Folksonomies
TLDR
This paper proposes an extension for Jensen-Shannon Divergence which is sensitive for the size of the sample from which the co-occurrence probability distributions are calculated and evaluated the approach by applying it on folksonomies obtained from Flickr.
Contextualising tags in collaborative tagging systems
TLDR
This work investigates how the different contexts in which individual tags are used can be revealed automatically without consulting any external resources, and shows that networks which explicitly take the social context into account are more likely to give a better picture of the semantics of a tag.
Using association rules for discovering tag bundles in social tagging data
TLDR
It is shown how common structures in form of tag bundles (a set of related tags) can automatically be derived from a bag of tags by applying a well-established data mining algorithm.
An Overview of Social Tagging and Applications
TLDR
This survey paper would like to summarize different techniques employed to study various aspects of tagging, including properties of tag streams, tagging models, tag semantics, generating recommendations using tags, visualizations of tags, applications oftags, integration of different tagging systems and problems associated with tagging usage.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 17 REFERENCES
Folksonomies-Cooperative Classification and Communication Through Shared Metadata
This paper examines user-generated metadata as implemented and applied in two web services designed to share and organize digital media to better understand grassroots classification. Metadata data
N-gram-based text categorization
TLDR
An N-gram-based approach to text categorization that is tolerant of textual errors is described, which worked very well for language classification and worked reasonably well for classifying articles from a number of different computer-oriented newsgroups according to subject.
Scatter/Gather: a cluster-based approach to browsing large document collections
TLDR
This work presents a document browsing technique that employs document clustering as its primary operation, and presents fast (linear time) clustering algorithms which support this interactive browsing paradigm.
WordNet: A Lexical Database for English
TLDR
WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.
The Description Logic Handbook
This introduction presents the main motivations for the development of Description Logics (DLs) as a formalism for representing knowledge, as well as some important basic notions underlying all
Introduction to Modern Information Retrieval
TLDR
Reading is a need and a hobby at once and this condition is the on that will make you feel that you must read.
Editors
  • Brain Research Bulletin
  • 1986
This specification defines the Document Object Model Level 2 HTML, a platformand language-neutral interface that allows programs and scripts to dynamically access and update the content and structure
Folksonomies: power to the people. Paper presented at the ISKO Italy-UniMIB meeting
  • Available at http://www.iskoi.org/doc/folksonomies.htm,
  • 2005
Folksonomies: power to the people. Pape r presented at the ISKO Italy-UniMIB meeting
  • Available at http://www.iskoi.org/doc/folksonomies.htm,
  • 2005
Blog entry at http://www.corante.com/many/archives
  • Blog entry at http://www.corante.com/many/archives
  • 2004
...
1
2
...