You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Shinichiro Abe (JIRA)" <ji...@apache.org> on 2013/01/15 12:48:16 UTC
[jira] [Comment Edited] (CONNECTORS-613) The content of sjis file
can't be extracted
[ https://issues.apache.org/jira/browse/CONNECTORS-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13553701#comment-13553701 ]
Shinichiro Abe edited comment on CONNECTORS-613 at 1/15/13 11:47 AM:
---------------------------------------------------------------------
It doesn't work ManifoldCF 1.0.1 and trunk. This problem occurs for Solr 4.x not for Solr 3.x. It seems the cause comes from Tika.
was (Author: shinichiro abe):
it doesn't work ManifoldCF 1.0.1 and trunk. This problem occurs for Solr 4.x not for Solr 3.0. It seems the cause comes from Tika.
> The content of sjis file can't be extracted
> -------------------------------------------
>
> Key: CONNECTORS-613
> URL: https://issues.apache.org/jira/browse/CONNECTORS-613
> Project: ManifoldCF
> Issue Type: Bug
> Components: File system connector, Lucene/SOLR connector
> Affects Versions: ManifoldCF 1.0.1, ManifoldCF 1.1
> Environment: Solr 4.x (not Solr 3.x)
> Reporter: Shinichiro Abe
> Attachments: files.zip
>
>
> When posting sjis text file by using curl, the content can be extracted.
> {noformat}
> curl "http://localhost:8983/solr/update/extract?literal.id=1&commit=true" -F "myfile=@sjis.txt"
> {noformat}
> But when posting this file by File system connector, it can't be extracted. it results empty.
> It seems that the content of utf-8 text file can be extracted.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira