ClickAider
You are currently browsing the Bogle’s Blog weblog archives for the day Saturday, December 17th, 2005.

Rubyful Soup: A scraping tool for Ruby

Beautiful Soup is my favorite python parsing and scraping tool. I was delighted to discover that the author has created a Ruby port called Rubyful Soup

1. Rubyful Soup won’t choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and then run away.
2. Rubyful Soup provides a few simple methods and Ruby-like idioms for navigating and searching a parse tree: a toolkit for dissecting a document and extracting what you need. You don’t have to create a custom parser for each application. It’s more flexible and easier to learn than XPath.

I enjoy the author’s sense of humor. This comment block is from the original:


# Enterprise class names! It has come to our attention that some people
# think the names of the Beautiful Soup parser classes are too silly
# and “unprofessional” for use in enterprise screen-scraping. We feel
# your pain! For such-minded folk, the Beautiful Soup Consortium And
# All-Night Kosher Bakery recommends renaming this file to
# “RobustParser.py” (or, in cases of extreme enterprisitude,
# “RobustParserBeanInterface.class”) and using the following
# enterprise-friendly class aliases:
class RobustXMLParser(BeautifulStoneSoup):
pass
class RobustHTMLParser(BeautifulSoup):
pass
class RobustWackAssHTMLParser(ICantBelieveItsBeautifulSoup):
pass
class SimplifyingSOAPParser(BeautifulSOAP):
pass

Gmail mobile

From Search Engine Watch:

If you’ve got a web-enabled mobile phone and a Gmail account, you can now access your mail on the run…

Gmail mobile detects the type of device you’re using, and formats your mail accordingly, meaning you’ll have a different view using a comparatively wide-screen Treo than a smaller Motorola handset. You’ll also be able to view attachments, including Office files and PDF documents.

Another cool feature is “call to reply.” If you have saved the phone number of an email sender in your contacts list, you have the option to reply via email or by calling them on the phone when using Gmail mobile.

To access Gmail mobile, just enter m.gmail.com on your mobile phone browser.