You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/02/22 00:34:00 UTC

[jira] [Resolved] (TIKA-2586) PDFParser documentation has incorrect DPI default

     [ https://issues.apache.org/jira/browse/TIKA-2586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison resolved TIKA-2586.
-------------------------------
    Resolution: Fixed

Thank you!

> PDFParser documentation has incorrect DPI default
> -------------------------------------------------
>
>                 Key: TIKA-2586
>                 URL: https://issues.apache.org/jira/browse/TIKA-2586
>             Project: Tika
>          Issue Type: Improvement
>          Components: documentation
>            Reporter: Ewan Mellor
>            Priority: Minor
>
> On [https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox%29] it says:
> {quote}This method of OCR is triggered by the ocrStrategy parameter, but users can manipulate other parameters, including the image type (see org.apache.pdfbox.rendering.ImageType for options) and the dots per inch dpi. The defaults are: gray and 200 respectively.
> {quote}
> The stated DPI default here is incorrect.  In both tika/tika-parsers/src/main/resources/org/apache/tika/parser/pdf/PDFParser.properties and tika/tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDFParserConfig.java the ocrDPI value is set to 300.
> This is an immutable wiki page (at least to me) so I can't change it.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)