7 Oct 2008 tinixtech   » (Observer)

I wanted to convert documents from the Internet in plain text “html”

I thought to use the library open-uri which is the easiest way to obtain the contents of html

require 'open-uri' example = open('htp:/http://www.ruby-lang.org/) => #<File:/tmp/open-uri20081002-7271-1exa3en-0> html = example.read

but the Read method returns an entire chain and not what I want is a plain text….I have to do everything to return ordered

Latest blog entries     Older blog entries

New Advogato Features

New HTML Parser: The long-awaited libxml2 based HTML parser code is live. It needs further work but already handles most markup better than the original parser.

Keep up with the latest Advogato features by reading the Advogato status blog.

If you're a C programmer with some spare time, take a look at the mod_virgule project page and help us with one of the tasks on the ToDo list!