You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Kevin Oberlag (JIRA)" <ji...@apache.org> on 2017/03/08 02:46:37 UTC

[jira] [Updated] (TIKA-2290) PDFParser 'ocr' properties cannot be set via headers when using Tika JAXRS

     [ https://issues.apache.org/jira/browse/TIKA-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kevin Oberlag updated TIKA-2290:
--------------------------------

Thanks for getting to it so quickly!




> PDFParser 'ocr' properties cannot be set via headers when using Tika JAXRS
> --------------------------------------------------------------------------
>
>                 Key: TIKA-2290
>                 URL: https://issues.apache.org/jira/browse/TIKA-2290
>             Project: Tika
>          Issue Type: Bug
>          Components: ocr, parser
>    Affects Versions: 1.13, 1.14
>            Reporter: Kevin Oberlag
>            Assignee: Tim Allison
>             Fix For: 2.0, 1.15
>
>
> I have created a stackoverflow question on this topic [here | http://stackoverflow.com/questions/42602834/x-tika-pdfocrstrategy-is-an-invalid-x-tika-ocr-header-error], but I'll reiterate the main issue. 
> I am trying to use TikaJAXRS and add headers for setting PDFParser properties. Specifically the ocrStrategy property. However, when I add the header using X-Tika-PDFocrStrategy, I get an error stating that it is an invalid X-Tika-OCR header.
> After looking into the source code, I believe the issue might be with the 'fillParseContext' method in the TikaResource.java file.
> The if statement first looks for a key that starts with the OCR header prefix, and since the PDFParser's property name contains 'ocr', it is trying to find a property named 'ocrStrategy' in the OCRParser class, which doesn't exist.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)