You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/07/27 18:55:00 UTC

[jira] [Updated] (TIKA-1484) Boilerpipe dependency is evil

     [ https://issues.apache.org/jira/browse/TIKA-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison updated TIKA-1484:
------------------------------
    Attachment: TIKA-1484.patch

> Boilerpipe dependency is evil
> -----------------------------
>
>                 Key: TIKA-1484
>                 URL: https://issues.apache.org/jira/browse/TIKA-1484
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: Ben McCann
>            Priority: Major
>         Attachments: TIKA-1484.patch
>
>
> The Boilerpipe project bundles inside it two classes from org.cyberneko.html. We're already using NekoHTML in our project. Depending on which library shows up on our classpath certain parts of our project will either work or not. I'd really love it if Boilerpipe could be fixed or replaced with some other library that is a better citizen.
> I see I'm not the first person to run into this as another Tika user has filed a bug on the Boilerpipe project: https://code.google.com/p/boilerpipe/issues/detail?id=62



--
This message was sent by Atlassian Jira
(v8.20.10#820010)