Category Archives: Folksonomy

Folksonomy

A Folksonomy is an emergent system of describing online data with tags added by a large number of users. This is contrasted with a deliberately designed taxonomy of predefined hierarchical categories.

Contents
Categories and Tags on edocr
Wiki Wednesday and Forest Friday
Emergent coding analysis facilitated by mediawiki

A Folksonomy is an emergent system of describing online data with tags added by a large number of users. This is contrasted with a deliberately designed taxonomy of predefined hierarchical categories.

Categories and Tags on edocr

Categories and Tags on Document Sharing Websites

This post emerges from a conversation on twitter arising from a comment I made about categories after uploading a document to edocr.com

To get straight to the point, I usually hate being forced to choose a category from a drop down menu. It smacks of technocracy where the system designer is unnecessarily imposing, subtly or not, a narrow view of how data should be described. This clumsily reasserts the unequal relationship between the developer and user. Instead of being empowered to input your own stuff and personalise your profile, a poor uploading experience can leave you feeling like a patronised data entry clerk. I’m also convinced that it’s not the cleverest way to organise content

I happened to mention that I was writing about railways and ManojRanaweera tweeted me “we now have a whole category for railways”

I took this as encouragement to submit an upload in document format ( .pdf) to the excellent edocr.com service, of which I am an occasional but enthusiastic user. But why would the existence of a new category code make any difference? As if there were a whole load of content waiting around which cannot be submitted for want of more categories? Categories are only descriptions, they are not empty containers compelling somebody to fill them. If there isn’t a perfect category available, which is nearly all of the time, you just have to choose the nearest. It’s annoying, but common experience. Yes, but since we now live in the new twitter enabled webosphere, I don’t have to keep these thoughts to myself. The CEO, owner, promoter and whatnot of edocr.com is an active participant in the conversation and a long term twitter contact of mine so I can ask directly to Manoj:

@ManojRanaweera I’ll upload my post with far too much detail about Yorkshire railways, but do categories mean more than tags?

M replies:

@aroberts #edocr tags totally rely on publisher – that means no discipline. Categories force you 2 choose 1 category that is most relevant. http://lin.cr/fac helps structure the library of docs and make it easy to find within edocr.com

me:

@ManojRanaweera successful social object sites have used used folksonomy tagging only, eg flickr whereas fixed categories always problematic. On the other hand youTube insists on a broad category, but at least it’s not multi-level #edocr

Actually, that’s not even strictly true about flickr. They do have a top level categorisation but it is subtle and defaults sensibly so you might not even notice that all flickr pictures have to be designated either as

Categories and Tags on edocr flickrcategoriesSafe, Moderate or Restricted AND as either a Photo, Screenshot or Art/Illus

so immediately I can think of an exceptional case. How should scanned documents fit within these three? I’d probably choose photo but only after hesitating.

YouTube categories are silly, mixing subject matter and genre indiscriminately:

Categories and Tags on edocr youtubecategories

But back to document sharing:

@aroberts – depends on who your audience is. I do not think we are too far off other players providing similar product

The only similar product I know of is scribd so I went through the uploading process there as well and soon discovered:

“Scribd forces you to choose from some odd categorisations too”

Twitter must be really catching on now because Jason at Scribd clearly had an alert set and replied

jasonatscribd @aroberts Categories are a work in progress, and welcome suggestions, but they’re largely to help people browsing Scribd. The real power is in tags. The more rich and descriptive your tags, the better your chances of being found on Google, etc.

which is probably closer to my own view than Manoj on this topic, but there’s more to come on this from him:

@aroberts #edocr – The importance of this level of categorisation will become clearer as we continue to build edocr functionality. Will blog my thinking.

Manoj correctly suggests taking the discussion further on the blogs so this is my contribution.

Web2.0 may be old hat by now, but that doesn’t mean we have to bring back the old printed matter, library and directory based metaphors of web 1.

However brilliantly you have concieved your system of categories, parts of it will simply look stupid to somebody else because taxonomy is subjective and different people’s world views are inevitably at odds with each other

Search and tags are the contemporary means by which content is discovered and browsed. Explore has some value, but is secondary and library classification, indexing and directories are hangovers from last century because they always come up against the top down design problem, ignoring the more powerful emergent patterns that come from bottom up self organisation and collaborative meta data.

Posted in Folksonomy, social objects | Tagged , , , , , , , |

Wiki Wednesday and Forest Friday

I’ve got two evening appointments to write about this week. London Wiki Wednesday is the first since last March, when I spoke about my online exhibition and barn raising.

Wiki Wednesday and Forest Friday 108032318 10e0b9a566 m

It’s at 29/30 Fitzroy Square, in the shadow of the BT Tower – nearest tube Warren Street. The format for the evening is this: lots of speakers get 5 minutes each, a short time for networking and some food and wine. For my turn, I shall be talking about Wiki Facilitation, Taxonomy development from Folksonomy tagging and then introduce the concepts behind PajamaNation.

Then on Friday I’m down on the list of performers for the monthly Forest Roots club in Forest Gate, East London.

Wiki Wednesday and Forest Friday thumb forest

I’ll probably sing my latest song, “The Wreckers Prayer” and one other. Not Gernika though, that would take up the entire slot by itself and is better saved for April.

Posted in Folksonomy, London, Music | Tagged , , |

Emergent coding analysis facilitated by mediawiki

Emerging structure from tags on the DAR wiki.

I decided to spend a little time systematically tagging some more of the pages in the DAR wiki.

The mediawiki software has a facility to extract a list of all uncategorised pages automatically, and this is accessed from Special:Specialpages and then Special:Uncategorizedpages.
So I went through the list and added one or more tags (called categories in mediawiki) to each. For the sake of consistency, I kept another browser tab open with the list of all categories Special:Categories to refer to.

At the end of this process I had about 40 categories for the 70+ pages of content, and that’s far too many categories to list on the front page in the manner which has evolved up until now. I wondered if I could group clusters of them together into much broader categories, perhaps ending up with just a handful of top level tags. At this point I remembered noticing another special page endearingly named – Uncategorized categories. This suggested that category pages themselves can have category tags added to them and that this is also cross referenced somewhere in the database so worth a try. I added the category “Projects” to the category pages for both “drupal” and “Barn Raising” lo and behold a protopage for category Projects can be accessed which states

Category:Projects

Subcategories
There are 2 subcategories to this category.
B
* Barn raising
D
* Drupal

So I had instantly created a hierarchical structure with three levels. Main category, sub-category, and individual pages. And all without having to declare any categories or structure in advance. The meaning can emerge out of the development of the resource itself rather than having to be pre-defined with all teh constraints that imposes. This is a very powerful tool indeed, highly flexible and emergent, and particularly rewarding of the emergent design methodology which I ventured to adopt in the first place, although it did take nearly 12 months to reach this stage, nearing completion of one of the long cycles begun in my final year at Ultraversity.

To explain in more detail what can happen here:

New pages of content can be created in isolation, without any reference to how they might fit in with the rest of the dynamic resource hosted on a mediawiki.

At the point of creation or any time later, pages can be tagged with category descriptions just by inserting the code similar to the following at the bottom of the page:

[[Category: COPs]] [[Category: Links]]

The software then automatically picks up the creation of a new category code or the addition to an existing one. Category pages are special pages which list all of the pages to which the code has currently been applied. Category pages can themselves be tagged by the same method and this is also recognised by the software which can then list all of the sub-categories hanging off that tag.

Thus one of the recommended methods for emergent coding analysis of data is perfectly facilitated by this type of system, with the proviso that the data being analysed is loosely structured around a concept of pages. It probably wouldn’t work with lists or conversations. The amazing thing to me is that there doesn’t seem to be any conflict or tension between the loosely organised, unstructured, emergent method of collecting and developing the data and ending up with a highly organised strictly hierachical taxonomical index.

I then added the tag “Main category” to the page for “Projects” just to round things off, and all I have left is to work my way through the uncategorised categories from time to time and perhaps merge any duplicate concepts with similar names.

Posted in Action Research, Folksonomy, Tools, Wiki |

Thanks for reading Andy Roberts articles about Folksonomy on the DARnet Blog