Tags and Categories are not the Same! [en]

[fr] Les tags et les catégories, ce n'est pas la même chose. En bref, les catégories forment une structure hiérarchique, prédéfinie, qui régit l'architecture de notre contenu et aide autrui à s'y retrouver. Les tags sont spontanés, ad hoc, de granularité variable, tournés vers le partage et la recherche d'information.

Update, Sept. 2007: when I saw Matt in San Francisco this winter, he told me he had finally “seen the light” (his words!) about tags and categories. Six months later, it’s a reality for WordPress users. Thanks for listening.

I got a bit heated up last night between Matt’s comment that tags and categories function the same and a discussion I was having with Kevin on IM at the same time, about the fact that Technorati parses categories as tags.

I went back to read two of my old posts: Technorati Tagified and Plugin Idea: Weighted Tags by Category which I wrote about a year ago. In both, it’s very clear that as a user, I don’t percieve tags to be the same thing as categories. Tags were something like “public keywords”. Is anybody here going to say that keywords and categories are the same thing? (There is a difference between keywords and tags, but this isn’t the topic here; keywords and tags are IMHO much closer in nature than tags and categories).

Here are, in my opinion, the main differences between tags and categories, from the “tagger” point of view.

  • categories exist before the item I’m categorizing, whereas tags are created in reaction to the item, often in an ad hoc manner: I need to fit the item in a category, but I adapt tags to the item;
  • categories should be few, tags many;
  • categories are expected to have a pretty constant granularity, whereas tags can be very general like “switzerland” or very particular like “bloggyfriday“;
  • categories are planned, tags are spontanous, they have a brainstorm-like nature, as Kevin explains very well: You look at the picture and type in the few words it makes you think of, move on to the next, and you’re done.
  • relations between categories are tree-like, but those between tags are network-like;
  • categories are something you choose, tags are generally something you gush out;
  • categories help me classify what I’m talking about, and tags help me share or spread it;

There’s nothing wrong with Technorati treating categories as tags. I’d say categories are a kind of tag. They are special tags you plan in advance to delimit zones of content, and that you display them on your blog to help your readers find their way through what you say or separate areas of interest (ie, my Grandma will be interested by my Life and Ramblings category and subscribe to that if she has an RSS reader, but she knows she doesn’t care about anything in the Geek category. (By the way, CTTS is not a good example of this, the categories are a real mess.)

So, let’s say categories are tags. I can agree with that. But tags are not categories! Tags help people going through a “search” process. Click on a tag to see related posts/photos. See things outside the world of this particular weblog which have the same label attached. Provide a handy label to collect writings, photos, and stuff from a wide variety of people without requiring them to change the architecture of their blog content (their categories). If you want to, yeah, you can drop categories and use only tags. It works on http://del.icio.us/. But have you noticed how most Flickr users have http://flickr.com/photos/bunny/sets/ in addition to tagging their photos? Sets aren’t categories, but they can be close. They are a way of presenting and organizing things for human beings rather than machines, search engines, database queries.

To get back to my complaint that WordPress.com does not provide real tags, it’s mainly a question of user interface. I don’t care if from a software point of view, tags and categories are the same thing for WordPress. As a user, I need a field in which I can let my fingers gush out keyword-tags once I’ve finished writing my post. I also need someplace to define and structure category-tags. I need to be able to define how to display these two kids of tags (if you want to call them both that) on my blog, because they are ways of classifying or labeling information which I live very differently.

Am I a tag weirdo? Do you also perceive a difference between tags and categories? How would you express or define it? If categories and tags are the same, the new WP2.0 interface for categories should make the Bunny Tags Plugin obsolete — does it?

Requirements for a Multilingual WordPress Plugin [en]

[fr] Quelques réflexions concernant un plugin multilingue pour WordPress.

My blog has been bilingual for a long time now. I’ve hacked bilingualism into it and then plugged it in. Other plugins for multilingual bloggers have been written, and some unfortunately got stuck somewhere in the development limbo.

Defining specs is a hairy problem. They need to work for the person visiting the site (polyglot or monoglot). They need to work for the person (or people! translation often involves more than one person) writing the posts. They need to work for all the robots, search engines, and fancy browsers who deal with the site.

Here is what I would like a multiple language plugin to do (think “feature requirements”, suggestion, draft):

  1. Recognize the browser language preference of the visitor and serve “page furniture” and navigation in the appropriate language. This can be overridden by a cookie-set preference when clicking on a “language link”.
    • “WordPress” furniture can be provided by the normal localization files
    • how do we deal with other furniture content in the theme (navigation, taglines, etc.)? should the plugin provide with guidelines for theme localization? do such guidelines already exist? extra information appreciated on this point
    • “language links” shouldn’t be flags, but language names in their respective languages; can this list be generated automatically based on present localization files? otherwise, can it be set in an admin panel?
    • upon “language change” (clicking on a language link), could the localization (action) be done in an AHAH– or AJAX-like way?
    • inevitable hairy problem: tag and category localization
  2. Manage “lazy multilingualism” in the spirit of the Basic Bilingual plugin and “true multilingualism” elegantly and on a per-post basis.
    • allow for “other language abstracts”
    • allow for actual other language version of the post
    • given the “general user language” defined above, show posts in that language if a version for that language exists, with mention of other language versions or abstracts
    • if that language doesn’t exist, show post in “main blog language” or “main post language” (worst case scenario: the wordpress install default) and show alongside other language abstracts/versions
    • abstract in one language (would be “excerpt” in the “main” language) and existence of the post in that language are not mutually exclusive, both can coexist
    • does it make more sense to have one WordPress post per language version, or a single post with alternate language content in post_meta? For lazy multilingualism, it makes more sense to have a single WP post with meta content, but fore “translation multilingualism”, it would make more sense to have separate posts with language relationships between them clearly defined in post_meta
  3. Use good markup. See what Kevin wrote sometime back. Make it nice for both polyglot and monoglot visitors. Inspiration?
    • use <div lang="xx"> and also rel attributes
  4. Provide a usable admin panel.
    • when I’m writing the other version of a post, I need access to the initial version for translation or abstracting
    • ideally, different language version should be editable on the same admin panel, even if they are (in the WordPress database) different posts
    • languages in use in the blog should be defined in an options screen, and the plugin should use that information to adapt the writing and editing admin panels
    • idea: radio button to choose post language; N other language excerpt/abstract fields with radio buttons next to them too; abstract radio buttons change dynamically when main post language is set; in addition to other language abstract fields, another field which can contain a post id/url (would have to see what the best solution is) to indicate “this is an equivalent post in another language” (equivalent can be anything from strict translation to similar content and ideas); this means that when WP displays the blog, it must make sure it’s not displaying a post in language B which has an equivalent in language A (language A being the visitor’s preferred language as defined above)
  5. Manage URLs logically (whatever that means).
    • if one post in two languages means two posts in WP, they will each have their own slug; it could be nice, though, to be able to switch from one to an other by just adding the two-letter language code on the end of any URL; a bit of mod_rewrite magic should do it
  6. Integrate into the WordPress architecture in a way that will not break with each upgrade (use post-meta table to define language relationships between different posts, instead of modifying the posts table too much, for example.)
    • one post translated into two other languages = 3 posts in the WP posts table
    • excerpts and post relationships stored in post_meta
    • language stored in post_meta

I have an idea for plugin development. Once the specs are drafted out correctly, how about a bunch of us pool a few $ each to make a donation to (or “pay”) the person who would develop it? Who would be willing to contribute to the pool? Who would be willing to develop such a plugin (and not abandon the project half-way) in these conditions?

These specs need to be refined. We should start from the markup/reader end and get that sorted out first. Then, think about the admin panel/writer end. Then worry about code architecture. How does that sound?

We’ve started a discussion over on the microformats.org wiki. Please join us!

Update: this post is going to suffer from ongoing editing as I refine and add ideas.

Plugin Updates for WordPress 2.0 [en]

[fr] Les plugins "Bunny's Technorati Tags" et "Basic Bilingual" fonctionnent à  présent avec WordPress 2.0.

It took me a couple of hours, but both Basic Bilingual and Bunny’s Technorati Tags are now WordPress 2.0-compatible.

A few minor tweaks have been made, most significant of which is that these two plugins can now be used for Pages in addition to normal posts.

They should work, though I haven’t troubleshooted them extensively — please ring the bell if you bump into any problems.

Kit d'installation de WordPress en français [fr]

Quelques liens et instructions très sommaires (un pense-bête, plutôt) pour installer Wordpress en français.

[en] A few links and very brief instructions to install WordPress in French "my way".

Avertissement: WordPress 2.0 sort bientôt, et la suggestion présentée ici sera très certainement caduque à  ce moment-là .

Il vous faut:

Côté réglages:

  • chmod 775 (ou 777, j’ai un doute) sur le répertoire wp-content/plugins ainsi que wp-content/themes
  • uploader un .htaccess vide, chmod 777
  • dans les options, copier-coller l’exemple donné pour les permaliens
  • activer le plugin Markdown (et les autres plugins qu’on a installés)