You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/08/23 17:53:29 UTC

[jira] [Resolved] (NUTCH-1089) short compressed pages caused Exception

     [ https://issues.apache.org/jira/browse/NUTCH-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche resolved NUTCH-1089.
----------------------------------

    Resolution: Fixed

1.4 Committed revision 1160753.
trunk Committed revision 1160754

Thanks Simone!

> short compressed pages caused Exception  
> -----------------------------------------
>
>                 Key: NUTCH-1089
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1089
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: simone frenzel
>              Labels: patch
>         Attachments: HttpResponsePatch.patch
>
>
> Hi,
> tested nutch on compressed pages, and on pages with Basic Auth and compression. On short compressed pages this Exception is thrown: 
> 2011-08-19 17:06:55,190 ERROR httpclient.Http - java.io.IOException: unzipBestEffort returned null
> 2011-08-19 17:06:55,190 ERROR httpclient.Http - at org.apache.nutch.protocol.http.api.HttpBase.processGzipEncoded(HttpBase.java:310)
> 2011-08-19 17:06:55,191 ERROR httpclient.Http - at org.apache.nutch.protocol.httpclient.HttpResponse.<init>(HttpResponse.java:163)
> 2011-08-19 17:06:55,191 ERROR httpclient.Http - at org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:154)
> 2011-08-19 17:06:55,191 ERROR httpclient.Http - at org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:138)
> 2011-08-19 17:06:55,191 ERROR httpclient.Http - at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628)
> In same cases Basic Auth failt also. 
> Works fine with the patch.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira