1 Jul 2002 RossBurton   » (Master)


For the first time in a long time, my girlfriend and I had a weekend where neither us were being savaged by hay fever, flu, back ache or work. This felt remarkably good, so I celebrated by refusing to turn my computer on until the early evening for a quick "when is the new Counting Crows album coming out" check (next week, dammit). Saturday was a good day -- long chats in pubs, a bit of shopping, laughing, cooking, followed by red wine and Best Of Luis Theroux in the evening.


Now that I finally have some web space to go with my domain, I can start putting some content up. This led to my routine adventure into metadata...

I want to keep a collection of screenshots and photos online, as we all do. I'm against 100 files, all called screenshot-43.png, so some form of index is required. I'm also against information being in the wrong place, i.e. in a seperate file. The only file which knows what am image is about, is the image. A quick peek at the PNG specification confirmed that the text chunks are have keywords associated with them, so a PNG image can have a creation date, title, author etc.

So, all I need is:

  1. A CGI script which can examine a directory, scan the metadata, and create a titled, date sorted index
  2. A nice GUI tool to edit said metadata

(1) I could code up in a day or so if Apache Cocoon 2 were installed on the server, and I still had my love for Java. Cocoon2 is a wonderful piece of software, and excellent for this sort of thing. However, its too heavy for a single directory... does anyone know of a tool which does this? Or should I just sit down and write it in Python myself...

(2) I suppose needs writing — I remember a Java tool written by a Dublin Core dude, but that only handled JPEG images. I'm thinking Python + GTK + Bonobo + gnome-vfs should be good, and handle the file opening and viewing nicely. Drop in a few objects and edit the metadata of any file desired. All I need now is for jamesh to add the application stuff from gnome-vfs to the GNOME bindings for Python... :-)

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!