You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ben McCann (JIRA)" <ji...@apache.org> on 2014/11/19 22:48:33 UTC

[jira] [Created] (TIKA-1484) Boilerpipe dependency is evil

Ben McCann created TIKA-1484:
--------------------------------

             Summary: Boilerpipe dependency is evil
                 Key: TIKA-1484
                 URL: https://issues.apache.org/jira/browse/TIKA-1484
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.6
            Reporter: Ben McCann


The Boilerpipe project bundles inside it two classes from org.cyberneko.html. We're already using NekoHTML in our project. Depending on which library shows up on our classpath certain parts of our project will either work or not. I'd really love it if Boilerpipe could be fixed or replaced with some other library that is a better citizen.

I see I'm not the first person to run into this as another Tika user has filed a bug on the Boilerpipe project: https://code.google.com/p/boilerpipe/issues/detail?id=62



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)