You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Davide Giannella (JIRA)" <ji...@apache.org> on 2017/07/06 15:35:06 UTC
[jira] [Closed] (OAK-6377) Text extraction with oak-run and tika
requires fake string in the command to work
[ https://issues.apache.org/jira/browse/OAK-6377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Davide Giannella closed OAK-6377.
---------------------------------
Bulk close for 1.7.3
> Text extraction with oak-run and tika requires fake string in the command to work
> ---------------------------------------------------------------------------------
>
> Key: OAK-6377
> URL: https://issues.apache.org/jira/browse/OAK-6377
> Project: Jackrabbit Oak
> Issue Type: Bug
> Components: lucene, run
> Affects Versions: 1.8, 1.7.2
> Reporter: Paul Chibulcuteanu
> Assignee: Chetan Mehrotra
> Priority: Minor
> Fix For: 1.8, 1.7.3
>
>
> According to the [text-extraction documentation| https://github.com/apache/jackrabbit-oak/blob/trunk/oak-doc/src/site/markdown/query/pre-extract-text.md#step-3---perform-the-text-extraction] there is currently no need to set a segmentstore for the extract command.
> {code}
> java -cp tika-app-1.15.jar:oak-run.jar \
> org.apache.jackrabbit.oak.run.Main tika \
> --data-file binary-stats.csv \
> --store-path ./store \
> --fds-path /path/to/datastore --extract
> {code}
> The command parser expects a string option so the workaround for this would be to provide a fake string at the end.
> e.g:
> {code}
> java -cp .......... --extract fakestore
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)