You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/03/19 14:25:50 UTC

[jira] Created: (NUTCH-724) Drop the JAI libraries

Drop the JAI libraries
----------------------

                 Key: NUTCH-724
                 URL: https://issues.apache.org/jira/browse/NUTCH-724
             Project: Nutch
          Issue Type: Bug
            Reporter: Jukka Zitting
            Priority: Blocker
             Fix For: 1.0.0


The PDF parser plugin contains Java Advanced Imaging (JAI) libraries (jai_core.jar and jai_codec.jar) that are licensed under the Sun Binary Code License. The license is incompatible with Apache policies, so we need to drop those libraries.

AFAIK (see PDFBOX-381) PDFBox only uses the JAI libraries for handling page rotations and tiff images, so simply dropping the JAI jars shouldn't have too much impact. A better solution would be to switch to using Apache PDFBox that has a proper workaround for this issue, but the first Apache PDFBox release has not yet been made.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (NUTCH-724) Drop the JAI libraries

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sami Siren resolved NUTCH-724.
------------------------------

    Resolution: Duplicate

> Drop the JAI libraries
> ----------------------
>
>                 Key: NUTCH-724
>                 URL: https://issues.apache.org/jira/browse/NUTCH-724
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Jukka Zitting
>            Priority: Blocker
>             Fix For: 1.0.0
>
>
> The PDF parser plugin contains Java Advanced Imaging (JAI) libraries (jai_core.jar and jai_codec.jar) that are licensed under the Sun Binary Code License. The license is incompatible with Apache policies, so we need to drop those libraries.
> AFAIK (see PDFBOX-381) PDFBox only uses the JAI libraries for handling page rotations and tiff images, so simply dropping the JAI jars shouldn't have too much impact. A better solution would be to switch to using Apache PDFBox that has a proper workaround for this issue, but the first Apache PDFBox release has not yet been made.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.