RAA - htmltokenizer/0.93

htmltokenizer / 0.93

Short description: A port of Perl's HTML::TokeParser::Simple to Ruby
Category: Library/HTML
Status: stable
Created: 2003-07-18 03:29:25 GMT
Last update: 2004-03-04 21:35:49 GMT
Owner: Ben Giddings (Projects of this owner)
Homepage: http://rubyforge.org/projects/htmltokenizer/
Download: http://rubyforge.org/download.php/382/htmltokenizer.tgz
License: Ruby's
Dependency:
None
Description:

This is a simple HTML parsing class which takes a string and parses out tokens. A quick example:

require 'htmltokenizer'

page = getSomePageFromTheInternetAsAString()
tokenizer = HTMLTokenizer.new(page)
while token = tokenizer.getTag('a', 'font', '/tr', 'div')
if 'div' == token.tag_name and 'headlinesheader' == token.attr_hash['class']
doSomething()
end
end

Now updated to handle namespaces.

Versions: [1.0.0 (2005-08-07)] [0.93 (2004-03-04)]

Edit this project (for project owner)

back to RAA top