You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@commons.apache.org by Brant Hahn <br...@insightbb.com> on 2004/11/19 23:29:46 UTC
[HttpClient] Screen Scraping Components?
Hi,
I've been using HttpClient for a few months now. I was wondering if anyone
out there using had a recommendation on any 3rd party component for screen
scraping? I've seen a few out there, including Jericho, but generally have
to write more code than I want to when using it. Just curious if there was
something out there that takes-in regex Pattern objects (or just regex
pre-compiled strings) to easily get the data that I want off of any page.
Thanks,
Brant