5 Sep 2006 gnutizen   » (Journeyer)

Slashdot mentioned that Google had release a free (depending on it's dependence on the included aspirin package, and the status of that) OCR program. I tried it out on an image that clara and gocr had done, it bailed on me. I just ran one of the gocr test images with it and it did OK (although it had real problems with numbers, and seemed to split one letter into multiple letters, e.g. W -> VV). It is written to C++. I'm glad they released it, although I don't think it is the complete free OCR magic bullet yet, more work needs to be done on it, or it needs to borrow (or be borrowed) concepts and code from other OCR programs etc. I want to contribute to free OCR but I don't have the time so much.

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!