You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Varun Thacker (JIRA)" <ji...@apache.org> on 2016/12/27 23:16:58 UTC

[jira] [Commented] (TIKA-2091) regression: Zip bomb detected! for HTML file

    [ https://issues.apache.org/jira/browse/TIKA-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15781497#comment-15781497 ] 

Varun Thacker commented on TIKA-2091:
-------------------------------------

Hi [~tallison@apache.org],

Here's some details on why Solr added it's own Mapper and doesn't use DefaultHtmlMapper : https://issues.apache.org/jira/browse/SOLR-6856?focusedCommentId=14289762&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14289762

> regression: Zip bomb detected! for HTML file
> --------------------------------------------
>
>                 Key: TIKA-2091
>                 URL: https://issues.apache.org/jira/browse/TIKA-2091
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.13
>         Environment: Debian jessie Linux, Oracle Java 8
>            Reporter: Rodrigo Rosenfeld Rosas
>
> Hi, while discussing an issue on Solr's mailing list it was suggested to me to open a ticket here. Please let me know if this is not the proper place for such ticket.
> After upgrading to latest Solr, this document is no longer indexing properly in Solr. They told me they upgraded Tika from 1.7 to 1.13 in Solr 6.2. Before the upgrade this documented was indexed as expected:
> https://www.sec.gov/Archives/edgar/data/1472033/000119380513001310/e611133_f6ef-eutelsat.htm
> I hope a fix could go on time for 1.14 ;)
> Cheers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)