28 Aug 2002 Rhys   » (Journeyer)

I have this account's passphrase back (it was obvious when I saw it, but then these things always are like that I guess). Thanks to Telsa and to yosh for their help.

I've been wondering about the way that the current group of probabilistic spam-filters, from Vipul's Razor via spamassassin to those inspired by Paul Graham's work, actually collect their spam/non-spam corpuses, and, where appropriate, adapt their n-gram and other lexical analyses. I'm putting that here in order to embarrass myself into writing something about it in the very near future.

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!