As part of the current upgrade effort, I decided it was time to go through and tag older blog entries, the ones written before the introduction of tags. These entries were categorized, and the filesystem was used to manage the categorization. Each directory represented a category, and each subdirectory a subcategory. The directories were given descriptive names, suitable for use with the pycategories plugin for pyblosxom.
An obvious approach to automating the retagging of these old entries was to use the category hierarchy itself to provide the tags. I wrote a python program to walk the directory tree, and add tags to the old entries.
I chose to ignore the general
category, though, because as a
tag general
doesn't provide much information. I think I might try
a more sophisticated approach to those entries, analyzing the content of
the entry to choose tags. I haven't worked out all the details, yet, but
I'm considering building a map of tagged entries, based on the frequency
of certain words or phrases that appear in them, then applying that map
to a histogram of the entries currently categorized as
general
.
In preparation for adding a tagcloud, I've started revising the layout of my blog. The goals are as stated before: to be standards compliant, readable, minimal, and to provide control to the user.
Progress can be seen here, complete with
ugly background colors for debugging. This iteration will include styles
for embedded code fragments, and markup for photos, along with a
fixed-fluid
two-column layout.
Comment welcome...
Well, comments and authentication are working, however I moved to the off-the-shelf comments module, which has it's own scheme for comment formatting, most notably substituting <br /> for newlines in the comment body.
I should merge my changes to parse HTML and include it in the comment database as CDATA. Maybe I can do that today.
Also, it's time to bring back the sidebar, with the tagcloud, linkroll, feeds, a flickr, etc.
I'm with several feeds to display, I'm going to revisit a generalized approach, with decoupled retrieval/caching and parsing/display.
Finally, I'd like to put a related tags
list on each tag
page, a feature not currently supported by the tag plugin.
At last, after struggling for a while with the session plugin, I gave up and fell back to a javascript-based check for comment spam, so the comment form works again (as long as you have javascript enabled in your browser).
Aside from it works
, the advantage of this method is it's
transparent to most users, with the notable exception of Jeff, a contrarian who uses the
NoScript
extension of Firefox. He refused my captcha session cookies, too, so it
probably doesn't matter in the long run. If you want to refute or rebut,
Jeff, you'll have to enable scripts for this site.
Next on the list should probably be better site navigation. I've heard complaints that there's no link from the a particular entry page back to the main page. Also tagging of older entries and tag clusters.
'Tis the morning of Christmas and all through the house, not a creature is stirring, except me and the cats—actually, that's most of the creatures—all the creatures were stirring except Barbara, who's still having visions of sugarplums.
Instead of sugarplums, I had a dream last night about a method for
tagging blog entries. Every concept
used as a tag should have at
least two aspects, one more general, one more specfic. For example, to
tag an item christmas
, the item would receive the tags
holiday
and christmas
. To add the tag season
, one
might use season
and winter
. Each tag is an orthogonal
axis, and receives several more or less general points along that axis.
I don't know if this is a terrible idea or not, but I suspect I should
probably read more about taxonomies before I present my recently
invented round-thing that I've decided to call the wheel
.
I'm intrigued by the way F-Spot uses a tree structure to
organize tags. If you have
a tag Places
, and under that tag, you add a child,
Nebraska
, and under that, another, Lincoln
, when you add
the Lincoln
tag to a photo, it automatically and transparently
behaves as if it also has the Places
and Nebraska
tags. That seems like a cool approach, though it makes adding a new tag
a little more cumbersome: it requires that the tree structure be edited
whenever a new tag is added, to put the new tag on the correct branch of
the tree. Also, photos exported to flickr only receive
the leaf tags. I'm not sure, yet, if that's a bug or a feature.
All that is just a digression from the actual subject of this post, which was intended to be a set of Christmas wishes, two front teeth style. So without further ado, here are a few things I'd like to find under the tree this morning.