43 Folders

43 Folders feed subscription icon - Shiny! Drowning in email? Try Inbox Zero to learn sane tips for dealing with high-volume email. And don’t miss the free Inbox Zero video. »

Login or register

Register for free on 43 Folders to comment on articles, post to our forum, customize your visits, and much more. Current users can login now.

Becoming a tagging kung-fu master

You’ve heard the hype about tagging. You’ve seen people flocking to sites like Flickr and del.icio.us, where they jump head-first into a pulsing mass of disjointed tags, possibly never to be heard from again. And you’ve wondered: how exactly is tagging worthwhile again?

Any idiot can tag, but you want tags that are useful rather than a disorganized mess. This is not an unreasonable desire, and by completing three simple steps before you start tagging, you too can become a tagging kung-fu master. (Or, if you want more intellectual cred, explicate your personal taxonomy.)

Whether you are tagging in a private, public, or collaborative system, consistency is the byword when tagging. Without a consistent pattern you won’t know what tags to assign items, what tags to search for to find items, or what items you’ll likely get while browsing your tags. The following three steps will help you create a consistent pattern to follow. Even if you’ve been tagging for a while, you may find these steps helpful to refine your knowledge of your own tagging habits and practices. (Please note, however, that these steps are focused on developing a personal tagging system; to optimize your tagging for collaborative use you would need to develop your system somewhat differently.)

Step 1: Know what

Are you tagging PDFs in Yep, notes in Notae, characters in Avenir, or photos in iPhoto ‘08? Whether you’re tagging in one program or several, you need to make a list of the general types of different items that you want to tag.

Tagging many different kinds of items does not make planning a tagging system much more complicated, but because you’ll tag different kinds of items differently you definitely need to think about what you’re going to tag.

Step 2: Know when

Part of knowing your target is knowing what kind of metadata is already available to you (through, say, Spotlight or the Finder) and not duplicating that metadata in your tags. For instance, every file in Mac OS X has a date created and date modified attached to it. As a result, tagging your files with a date is typically a silly idea. Tagging Word documents “word” is also redundant; the system knows which documents are Word documents and finding all of them is only a saved search away. Before you proceed to the third step, you need to make sure you know what information about your target you already have available. You don’t have to write it down if you don’t want to; just be aware.

Although there may be some situations in which you want to tag an item with every possible tag you can think of, most of the time you will want to keep your tags succinct and well-targeted, which means avoiding redundancy. Tags may be extremely flexible but they are the least efficient kind of metadata in some ways because they have no indication what they are marking. When you search the “date modified” field, you know exactly what you’re finding. An “05-31-2007” tag, on the other hand, could be any number of things.

Step 3: Pick your attributes

This is the heart of a consistent tagging system, and can be summed up in a single question: how do you think about the item you are tagging? For instance, when you are filing or searching for a photo, what do you think of? The location of the photo? The subject or people in the photo? The event taking place when you took the photo? Something else entirely?

Write out a list of the attributes that you think of when thinking of your target items. Ideally, you should make this a brainstormed list that includes every attribute you can possibly think of that you might want to tag. As you make the list for your different target items, star the attributes that spring immediately to mind.

Once you have a list, go through it to weed out the attributes that are covered by the item’s non-tag metadata. Then go through it again and pick out what attributes you want to use for tagging. Try to keep it a short, specific list focused on the attributes that sprang immediately to mind. You should also add attributes that didn’t spring immediately to mind, but that you want to make a habit of tagging anyway because they will be useful.

When you have this list of attributes, you are ready to tag. You should probably put your list of attributes somewhere visible, for example a Post-It by your computer or a virtual sticky note on-screen, at least until you’ve either memorized them or developed good tagging habits.

When you’re tagging, try to consistently attach a tag for every one of the attributes that you’ve selected. The more often you can hit all of them, the easier it will be for you to find files later. Additionally, knowing what attributes you are tagging makes coming up with specific tags much easier. Rather than sitting worrying over every photograph you can quickly attach a location, person, and event (or whatever attributes you decide on). Ideally, your attributes and tags should fit into the following sentence: “This [item]’s [attribute] is [tag].” For example, “this photo’s location is New York.”

The specific tags that you use will doubtless shift over time and circumstance, but the attributes that you are tagging should remain much more stable. By defining a standardized set of attributes for each kind of item that you are tagging and only deviating when necessary (or when the way you think about a given type of item begins to change), you will be able to create a consistent tagging system that helps you find items quickly because it matches the way you think.

And more importantly, you will have taken your first steps on the road to becoming a full tagging kung-fu master. Or developing a stream-lined personal taxonomy. Whichever works for you.


11 Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
Merlin Mann's picture

Merging Drupal terms, controlling vocabulary

This is orthogonal to Ian’s (very swell) post, but I wanted to insert a plug for Term Merge for anybody who’s struggling with taxonomy cleanup in Drupal. It was really useful to me in tidying up the 43f topics, and I still use it almost every day to try and keep the most popular categories consistent.

Also Drupal-related (but useful in many other contexts), I really like tagging interfaces that autocomplete. This makes it convenient for users to capitalize on existing tags, and really helps ensure that the vocabulary stays useful for everyone.

Great post, Ian. Thanks for this.

Ian Beck's picture

Re: Merging Drupal terms, controlling vocabulary

Glad you liked it! Autocomplete is definitely a must-have for any good tagging software. Another thing that Drupal has that I wish desktop software (and other web tagging software) would integrate is the ability to define synonyms (for instance with Term Merge). Particularly in public and collaborative tagging environments, being able to specify which tags are synonymous can be invaluable when it comes to people actually finding what they’re searching for and makes it far easier to manage the otherwise unwieldy mess that tags can so easily degenerate into.

Merlin Mann's picture

Drupal synonyms?

ability to define synonyms

Whoa. So, if I go to “Mac OS X” and define “OS X” and “OSX” and “Leopard” as synonyms, will hits to those category URLs redirect to “Mac OS X”?

Ultimate Tag Warrior for Wordpress (highly recommended) does that, and I’d love to have it on the new site, too.

Ian Beck's picture

Re: Drupal synonyms?

I believe so. You might need an additional module like the Synonyms module, though. I'm not terribly familiar with Drupal, so I've never set it up myself.

Unfortunately Ultimate Tag Warrior doesn't work with the newest version of WordPress (2.3) because WordPress now includes its own tagging. UTW's developer has released a number of "WordPress Things" that provide some of UTW's functionality on top of 2.3's tagging, but I don't think they offer synonyms (although they do provide tag merging).

Lyle Johnson's picture

How does this apply to del.icio.us bookmarks?

First of all, great idea for an article. The only application in which I’ve successfully used tagging is iPhoto, where I’ve tagged each photo with the names of the people that appear in it. When I need to, I can construct a smart folder of all the pictures that include Oscar, for example.

I would really love to hear what kinds of “attributes” people have settled on for tagging URLs in their del.icio.us bookmarks. I have a lot of bookmarks, but the tags that I assign to them are always very ad hoc and decided on the spur of the moment. As a result, I can’t reliably use them to “find” anything later. Does anyone have a more coherent system for doing this? What kinds of tags do you use?

shimon_s's picture

Tagging Philosophy

Thanks Ian for bringing up this topic, because many people think that tagging is just a matter of throwing whatever comes to their mind. Which of course brings their tagging system to a big mess.

The biggest frustration for the tagger would be when he 'knows' something is in his database and he can't find it (e.g. he used synonyms or didn't tag the right attributes).

I would like to refer to this issue more deeply and give a real world example - 'arranging your scanned documents using a tagging system'.

First let me introduce myself - my name is Shimon Sofer and I am a co founder of '42Tags' - a software for scanning and tagging documents (Windows only for now).

I would like to show how your ideas are implemented in the field of 'scanned document tagging' using '42Tags', and would like to introduce another idea - 'Major/Minor tags'.

In our application we distinguish between two types of tags. The first type - the 'Major tag' and the second type - the 'Minor tag'.

'Major tags' - should be relatively general and you should have the ability to easily control them,
because there shouldn't be much of them (if you are a home/small office user, 42 will be enough...:) ).

'Minor tags' - are those that are not part of the first group.

I would like to demonstrate the idea:

Let's say that you have a few medical documents which deal with 'a blood test for your daughter Jenny, which
was done to verify if she's got pneumonia. The doctor was John Smith".

In this case I suggest that the 'Major tags' will be "Medical" and "Jenny" and the minor tags will be "blood test", "pneumonia", "John Smith".

The division to attributes you talked about is more relevant to 'Major tags' than 'Minor tags'.

How does a 'Major tag' and a 'Minor tag' practically differ?

In 42Tags we added the ability to add a pushable button for every major tag.

Actually there is only one difference between a 'Major tag' and a 'Minor tag' - 'Major tags' have a shortcut button in a specified tag panel.

Regarding the separation to attributes we offer the 'Tag sections' in which the user should add 'Major tags' of the same attributes.

For example - 'Person section' - Each tag is a name of persons
'Places section' - Each tag is a name of place
'Paper type section' - e.g. Contract, Fax, Invoice, etc...

Anyone can define his own attributes for every section.

If the user would like to reach a 'Minor Tag' - he can just start typing, and auto-completion will help him through it.

Another issue not throughly discussed is the dynamic nature of tags.

After tagging 10-100 items you notice that you need to change your tagging scheme.

For example - you live in Paris and all the papers relevant for your apartment are tagged with
'Paris', and 'Apartment'. After a year you leave to another apartment - in Paris!

You may want to distinguish between the collections.

What we offer is making a search on 'Paris' and 'Apartment', and then using a multiple "Tag/Untag" feature in order for example to add "Rue Barute" to the old material.

Maybe I wrote too much, but I really enjoyed reading your post ;-)

Thanks,
Shimon Sofer
http://www.42tags.com/paperless

phenom.bade@gmail.com's picture

photosynth

microsoft uses the same tagging functionality to convert pictures into 3D models...check out microsofts photosynth

http://vidsonly.blogspot.com

infogrind's picture

To categorize or to retrieve, that is the question

I’ve been thinking my fair bit about tagging, and have come to the conclusion that there are two fundamentally different tagging strategies, depending on whether your goal is to categorize your information, or to make it easy to retrieve a particular item. I’ll use the example of bookmarks here, but the same essentially applies to any collection of information.

The “categorize” strategy is essentially the one proposed by Ian. The goal of this strategy is to be able to quickly retrieve all bookmarks belonging to a particular category. It consists of two steps. In the first step, you make a list of tags that you will use, corresponding to your categories. In the second step, you use tags from this list to tag your bookmarks as you add them to your collection. From time to time you might decide to add another tag to your list, if you’ve found that tag to be useful.

For example, your tags might be microsoft, windows, linux, gentoo, debian, kernel, cplusplus. If you’d want to see all bookmarks about kernels, whether they be about the Linux kernel or the Windows XP kernel, you’d use the kernel tag to find these. Or you might use the combination of kernel and linux to find only bookmarks about the Linux kernel.

The second, alternative strategy I call the “retrieval” strategy. Its goal is to quickly be able to retrieve one particular bookmark. The idea is that if you remember approximately the content of one particular website, you quickly find the right bookmark. For example, you would use the tags gentoo, kernel, compilation, and udev to tag a page about compiling the kernel with Udev support in Gentoo Linux.

The fundamental difference between the two strategies now emerges. If you were using the “categorize” strategy, you would be likely to also tag the Gentoo kernel page with linux, since it clearly falls into that category, but you would not use udev, since you will be very likely to never have another bookmark about Udev, and you want to avoid categories containing single items. On the other hand, if you are using the “retrieval” strategy and you want to find that particular page you remember about compiling Udev into the gentoo kernel, then the tag gentoo makes the tag linux obsolete. Indeed, since Gentoo implies Linux (unless your collection contains bookmarks about a that particular kind of penguin), the bookmarks tagged gentoo are a subset of those tagged linux, and adding the tag linux to your search does not alter your results. The udev tag, though, will be very helpful since it will narrow down your results to the wanted bookmark.

To summarize, the “categorize” strategy asks for relatively broad tag categories (such as linux or windows, but not compiling or udev), otherwise your tag list would quickly be a cluttered mess. On the other hand, for the “retrieval” strategy the tags should be as specific as possible. This will clutter up my tag list, you say? Never mind, if you don’t use the tags to browse through your bookmarks.

I am curious if there is a way to reconcile the two strategies, to get the “best of both worlds”.

(As an aside, note that the “categorize” strategy is not much different to conventional bookmark handling in a web browser: First you create your bookmark folder hierarchy, then you add bookmarks to these folders as you surf the web. The difference is that with folders you can only make sets and subsets, but you cannot have intersections of sets, since if a bookmark belongs to a particular folder, it cannot also belong to a folder which is not a subset or a superset of the first folder. Another difference between between the two approaches is that there is no tagging system I know of that allows you to define subset-relationships between tasks. For example, you would define gentoo as a sub-tag of linux, so that anything you tag with gentoo would also be tagged with linux. Similarly, you would define linux as a sub-tag of the operating systems tag, and so on.)

Ian Beck's picture

Re: to categorize or retrieve

I think your take on tagging for categorization vs. retrieval is a very interesting way to think about the process. Over on Tagamac I actually refer to three different actions with tags: tagging (categorizing), searching (retrieval), and browsing (concept introduced here). In the "Kung-fu master" article I focus exclusively on tagging, the idea being that if you have put thought into how you tag items then finding them will be easier because you know what sort of things to look for.

Searching is akin to your retrieval; it's using tags to find a specific item. Browsing is slightly different from your retrieval because you have no specific item as your target. Instead you're using tags to find groups of items or to see the relationships between items.

The reason I bring up tagging/searching/browsing is because what your tagging software supports makes a huge difference in how you should tag. I tend to think that tagging items with general tags vs. specific tags has less to do with categorizing vs. retrieving and far more to do with how your tagging software searches tags.

For instance, if I'm using software that has a tag cloud I will want to use as few tags as possible, balanced somewhere in the middle of the specific <-> general continuum. Too many tags, and the cloud becomes extremely unwieldy. Too few and it's useless for finding anything. A tag recipe, on the other hand, may work better if you tag items with as many specific tags as possible.

I think the way to find your "best of both worlds" lies less in how people tag and far more in what software is available for tagging. If you can arrange tags into complex hierarchies wherein browsing to a parent shows you all of the items with child tags (without attaching the parent tag to those items) then you can tag items with the most specific tags possible but still find items based on general concepts.

Unfortunately, I don't know of any software that doe this really well. Tagging is one of those features that has erupted everywhere, but has not (to my mind) been developed carefully enough.

Also, as a side note, I tried not to suggest coming up with a list of tags in my article. Rather, I think you should come up with a list of general attributes (categories or types of tags, if you will). This is an important difference; with a list of tags you'd like to use, you are far more limited (and it's a major pain to come up with such a list, because it's hard to predict what specific items you'll be tagging). With a list of attributes, your actual tag cloud has a lot more flexibility, but you still have a governing principle on how to tag items (which leads to consistent tagging, which leads to locating items easier).

Nalusk's picture

Surprised there's no Inbox Zero-like implementation in this one.

Hi, I’ve just joined so maybe I’m commenting in the wrong section or this has been suggested elsewhere before.

I don’t know how it works for every site but for those with tagging systems similar to delicious, I think the idea of Inbox Zero works well for tagging too.

I say this because one of the things I hate about tagging (I’m a folders type of guy) is how I have to really think about every detail of an item prior to tagging and still, I might miss a tag or so or add an extra tag that I’ll find I don’t actually need when doing the actual search.

First would be to have an inbox. For those who imported bookmarks in delicious, the import tags works well in my opinion.

The idea is to go wild on tagging or what aleene from Microcontent Musings calls informal tagging and add an “inbox” tag at the end of those tags. In this example, I would add an import tag for every bookmark I submit to delicious even the ones that I didn’t import. As an added bonus, all the bookmarks that got imported into delicious and had received informal tags due to delicious’ autotagging or just from a previous mistake are also tagged with the import tag assuming the person didn’t remove it.

Then if a user wants the advantage and flexibility of informal tagging, they could just go to the import tag and work on the related tags there. Still the goal would be to not have anything under import as much as possible without having to worry about every little item that’s going to be tagged.

The next step would be to just follow the above guide through and through and then add another consistent tag say satisfactory and replace the inbox tag with it. Then, when the user needs formal tagging, he gets to have that option to. The downside being that the items aren’t probably going to be organized formally as soon as they’re added but I find I’m much more focused in adding tags when I’m in organize item mode rather than add tags to new item mode.

Ian Beck's picture

Re: inbox tagging

It is indeed surprising that more of the productivity ideas in other realms have not been applied to tagging. Tagging is one of those things that is so popular that people tend to include it in their programs without thinking about it much, and as a result it could stand to be improved a lot.

Personally, I often use a "totag" tag on items that are partially or unorganized. Of course, I have to add this manually, so it's not ideal, but there you go. I totally agree that programs should institute a better system in place for collecting untagged or sloppily tagged items.

The only program that I know of and use personally that does something like this is Eagle Filer, which attaches an "unread" tag to absolutely everything that you toss at it. On the upside, this allows easier "inbox" editing. On the downside, "unread" is essentially meaningless for the use that I'm putting it to, and there's no option to change the default.

 
EXPLORE 43Folders THE GOOD STUFF

An Oblique Strategy:
Discard an axiom


STAY IN THE LOOP:

Subscribe with Google Reader

Subscribe on Netvibes

Add to Technorati Favorites

Subscribe on Pageflakes

Add RSS feed

The Podcast Feed

Inbox Zero

The original 43 Folders series looking at the skills, tools, and attitude needed to empty your email inbox — and then keep it that way. Don’t miss the free video of Merlin’s Inbox Zero presentation.

Get Started with ‘GTD’

David Allen’s popular productivity book and the system on which it’s based help turn ‘stuff’ into actions that support valuable outcomes.