You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Juho Mäkinen <ju...@gmail.com> on 2005/06/01 10:10:18 UTC

Re: [jira] Updated: (NUTCH-54) Fetcher improvements

I'm looking very forward to get this patch into main trunk.

I have developped a single-signon framework on top of the protocol-httpclient.
This will provide an easy way to create authentication plugins to support
different form based (even with multiple different forms, GET and POST
methods and
cookie handling for session tickets) authentication frameworks and single-signon
systems.

My future patch will affect protocol-httpclient a bit, but I'm hoping
that Andrrzej's
NUTCH-54 patch will be accepted into the main trunk, so that I can easily
create my own patch agains the main trunk and not against NUTCH-54 patches.

 - Juho Mäkinen, juho.makinen@gmail.com

On 5/31/05, Andrzej Bialecki  (JIRA) <ji...@apache.org> wrote:
>      [ http://issues.apache.org/jira/browse/NUTCH-54?page=all ]
> 
> Andrzej Bialecki  updated NUTCH-54:
> -----------------------------------
> 
>     Attachment: final-plugins.zip
> 
> Final version of the new/modified plugins:
> 
> * parse-js: heuristic link extractor for JavaScript files and code snippets.
> 
> * protocol-httpclient: HTTP/HTTPS protocol handler
> 
> * parse-html: modified HTML parser, including new TagSoup-based implementation.
> 
> > Fetcher  improvements
> > ---------------------
> >
> >          Key: NUTCH-54
> >          URL: http://issues.apache.org/jira/browse/NUTCH-54
> >      Project: Nutch
> >         Type: Improvement
> >   Components: fetcher
> >     Reporter: Andrzej Bialecki
> >     Assignee: Andrzej Bialecki
> >  Attachments: 20050518.patch, ProtocolOutput.java, final-plugins.zip, final.diff, new-plugins.zip, parsestatus.patch, status.patch
> >
> > Fetcher improvements.