You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (Jira)" <ji...@apache.org> on 2021/09/14 16:18:00 UTC

[jira] [Commented] (TIKA-3553) Improve robustness of httpfetcher

    [ https://issues.apache.org/jira/browse/TIKA-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415031#comment-17415031 ] 

Hudson commented on TIKA-3553:
------------------------------

SUCCESS: Integrated in Jenkins build Tika ยป tika-main-jdk8 #328 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/328/])
TIKA-3553 -- improve robustness of http fetcher (tallison: [https://github.com/apache/tika/commit/2eac95cb3221c3649eef63e40654be1c264a6913])
* (edit) tika-pipes/tika-fetchers/tika-fetcher-http/src/main/java/org/apache/tika/pipes/fetcher/http/HttpFetcher.java
* (edit) tika-pipes/tika-httpclient-commons/src/main/java/org/apache/tika/client/HttpClientFactory.java


> Improve robustness of httpfetcher
> ---------------------------------
>
>                 Key: TIKA-3553
>                 URL: https://issues.apache.org/jira/browse/TIKA-3553
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Major
>             Fix For: 2.1.1
>
>
> In practice, the httpclient library can return nulls in (to me) unexpected places.  We should add more null checks.  
> Also, we should try to back off and retry with a disallow compression if there was a length exception in making a request.
> Finally, users should be able to configure the UserAgent sent in the header.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)