6 Sep 2007 Akira   » (Master)

Coding Vacation

A few years ago for my thesis, I was interested in the implementation of engines to filter information - using Lisp language.

One of the sources of inspiration was work of Christian Queinnec and in particular its book “filtering: an application of (and for) lisp” unfortunately available only in French.

I have just ordered again this book (lost in my many relocations) and I should receive it soon.

Indeed, I intend to take one week of vacation to code! and I envisaged to produce a generic engine to filter information - engine written in Python.

Soon or later, this engine will be used in the iTrade project to extract financial information from websites ; but I intend also to comment on and to develop this library to improve the effectiveness of Python when it is a question of extracting and managing information.

I wish to show that the power of expression of a filter makes it possible to appreciably reduce the size of the code necessary to extract information and that it is possible in Python.

Not to waste time for the logistic of this new project, I will park this work under a sub folder of the iTrade project. I would give more information later on when the coding session starts.

NB: for the name of the project, I imagine something around “Paragon”.

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!