You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@commons.apache.org by Brant Hahn <br...@insightbb.com> on 2004/11/19 23:29:46 UTC

[HttpClient] Screen Scraping Components?

Hi, 

 

I've been using HttpClient for a few months now.  I was wondering if anyone
out there using had a recommendation on any 3rd party component for screen
scraping?  I've seen a few out there, including Jericho, but generally have
to write more code than I want to when using it.  Just curious if there was
something out there that takes-in regex Pattern objects (or just regex
pre-compiled strings) to easily get the data that I want off of any page.

 

Thanks,

Brant