You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Luis Filipe Nassif (JIRA)" <ji...@apache.org> on 2018/04/02 13:06:00 UTC

[jira] [Commented] (TIKA-2620) Set sys property to get better rendering speed by default

    [ https://issues.apache.org/jira/browse/TIKA-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16422231#comment-16422231 ] 

Luis Filipe Nassif commented on TIKA-2620:
------------------------------------------

Hi [~tilman]. When printing PDFs to images before OCR, our default is to use 300dpi. If the image is bigger than that, it will be scaled down at the end. Reading PDFBOX-4137, I understood images will be subsampled before being decoded and not when rendering, possibly saving lots of memory, or am I wrong?

Thanks

> Set sys property to get better rendering speed by default
> ---------------------------------------------------------
>
>                 Key: TIKA-2620
>                 URL: https://issues.apache.org/jira/browse/TIKA-2620
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>             Fix For: 1.18, 2.0.0
>
>
> After upgrading to PDFBox 2.0.9, we now get a logged warning:
> {noformat}
> INFO  To get higher rendering speed on JDK8 or later,
> INFO    use the option -Dsun.java2d.cmm=sun.java2d.cmm.kcms.KcmsServiceProvider
> INFO    or call System.setProperty("sun.java2d.cmm", "sun.java2d.cmm.kcms.KcmsServiceProvider")
> {noformat}
> Unless there are objections, I'll add a static call to the PDFParser to {{System.setProperty...}}.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)