Harnessing Wikipedia for Smart Tags Clustering

The quality of the current tagging services can be greatly improved if the service is able to cluster tags by their meaning. Tag clouds clustered by higher level topics enable the users to explore their tag space, which is especially needed when tag clouds become large. We demonstrate TagCluster – a tool for automated tag clustering that harnesses knowledge from Wikipedia about semantic relatedness between tags and names of categories to achieve smart clustering. Our approach shows much better quality of clusters compared to the existing techniques that rely on tag co-occurrence analysis in the tagging service.

Dynamic Network Analysis of Wikis

Wikis have their seeds in the easy collaborative editing and maintenance of web pages. This was picked up by tremendously successful public projects such as the online encyclopedia Wikipedia. Creating, modifying and maintaining of wiki articles implies social structures and dependencies between wiki authors and wiki articles themselves. The general challenge of this work is to consider these structures as dynamic evolving networks and to point out prominent behaviors in large wiki-based networks. We present an environment capable of handling data management, measurement and visualization issues for the dynamic network analysis of publicly available wiki data.