You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2013/12/05 17:40:37 UTC

[jira] [Commented] (TIKA-1121) Socket server text parsing error on large text files

    [ https://issues.apache.org/jira/browse/TIKA-1121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13840252#comment-13840252 ] 

Sergey Beryozkin commented on TIKA-1121:
----------------------------------------

Can you experiment with the latest snapshots ? You may have to build the tika-server manually. Use multipart/form-data payloads, though it might also do better even with the regular requests now as I've removed a call leading to creating a temp FileInputStream

> Socket server text parsing error on large text files
> ----------------------------------------------------
>
>                 Key: TIKA-1121
>                 URL: https://issues.apache.org/jira/browse/TIKA-1121
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.4
>         Environment: Ubuntu 10.04, 10.10, 12.04.02
>            Reporter: Dave Meikle
>            Assignee: Dave Meikle
>
> As reported on the user list[1], when using the tika-app socket server command with the -t switch to parse text, the process hangs on large text files.
> This occurs on Ubuntu 10.04, 10.10 and 12.04.02.
> [1]http://mail-archives.apache.org/mod_mbox/tika-user/201305.mbox/%3CCAGxBzUFxSJ4h5jWdeUX9HhD2FxtTQ1vsbM7u-VfSyGE9VmrQHQ@mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.1#6144)