
hah. so true, but tagging popular stuff is almost useless. since it is popular i don’t need help to find it again.
Tag: tagging
LOC on Flickr
That’s why it is so exciting to let people know about the launch of a brand-new pilot project the Library of Congress is undertaking with Flickr, the enormously popular photo-sharing site that has been a Web 2.0 innovator. If all goes according to plan, the project will help address at least 2 major challenges: how to ensure better and better access to our collections, and how to ensure that we have the best possible information about those collections for the benefit of researchers and posterity. In many senses, we are looking to enhance our metadata (one of those Web 2.0 buzzwords that 90% of our readers could probably explain better than me).
LOC has a blog? when can we tag the whole LOC catalog?
cgoldfed library
CiteULike tag your research papers, etc. if this stuff were not behind paywalls, you’d think academia would do this themselves. a pretty crucial functionality
Tag Cloud
Pulse an inspired way to put a book online: not just a html dump (or worse, pdf), but finely granular posts with proper metadata, making deep linking simple. plus, tag clouds. now to begin assimilating the juicy morsels into my del.icio.us account..
Language evolution
I tend to rely on a more sensitive organ of hearing: a bookmarklet that I call dc, for del.icio.us conversation. I use it all the time. Suppose, for example, I’d found that University of Maryland page through some other means of referral than del.icio.us. I’d have reflexively clicked the dc bookmarklet to produce this report which shows who else has bookmarked that page, and how it has been described. In this case there’s not much to see. The URL was bookmarked once in Feb 07, by elzzup, to the tags data and class, and again in Jul 07, by manyeyes, to the tag publicdata. This view is interesting for a couple of reasons that I don’t think are widely appreciated. First, it shows a progression from general ways of describing the resource to a more particular way. Note, by the way, that the proposed refinement of data to publicdata is not visible when you launch the bookmarking form, which recommends only class and publicdata. Note also that the introduction of publicdata is really a hack. It would arguably be better to rely on the individual tags public and data. But that would make it necessary to query for the conjunction, and that connection is too fragile. So publicdata also suggests something about how to form tags — that is, by making these conjunctions explicit.
Google Image Labeler
Until now, for each match you got 100 points. But many people realized that they could easily win points if they typed generic tags like “man”, “people”, “photo”. To make the game more exciting and to improve the quality of tags, Google decided to change the way you get points. Now you can get anywhere from 50 to 150 depending on how specific your tag is.
refinements for harnessing slacker energy. maybe this is the long-sought zero-point energy?
Supervised labeling

A probabilistic formulation for semantic image annotation and retrieval is proposed. Annotation and retrieval are posed as classification problems where each class is defined as the group of database images labeled with a common semantic label. It is shown that, by establishing this one-to-one correspondence between semantic labels and semantic classes, a minimum probability of error annotation and retrieval are feasible with algorithms that are 1) conceptually simple, 2) computationally efficient, and 3) do not require prior semantic segmentation of training images. In particular, images are represented as bags of localized feature vectors, a mixture density estimated for each image, and the mixtures associated with all images annotated with a common semantic label pooled into a density estimate for the corresponding semantic class. This pooling is justified by a multiple instance learning argument and performed efficiently with a hierarchical extension of expectation-maximization. The benefits of the supervised formulation over the more complex, and currently popular, joint modeling of semantic label and visual feature distributions are illustrated through theoretical arguments and extensive experiments. The supervised formulation is shown to achieve higher accuracy than various previously published methods at a fraction of their computational cost. Finally, the proposed method is shown to be fairly robust to parameter tuning.
this system can produce tags on par with humans for many types of images.
Structured News
makes the point that news organizations could add a lot of value by marking news up properly with location, main actors, etc. they would do so out of SEO interest, but also do a huge favor for historians and knowledge representation.
Advanced Tagging and TripleTags
geobloggers hmm, that is starting to look very rdf-like. via bergie
Fragment Search
Greasemonkey script which allows people to create URLs which link to content within a page without having control over that page. this is great for more accurate tagging, and adding structure where the original authors failed