You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Shinichiro Abe (JIRA)" <ji...@apache.org> on 2013/01/15 12:48:16 UTC

[jira] [Comment Edited] (CONNECTORS-613) The content of sjis file can't be extracted

    [ https://issues.apache.org/jira/browse/CONNECTORS-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13553701#comment-13553701 ] 

Shinichiro Abe edited comment on CONNECTORS-613 at 1/15/13 11:47 AM:
---------------------------------------------------------------------

It doesn't work ManifoldCF 1.0.1 and trunk. This problem occurs for Solr 4.x not for Solr 3.x. It seems the cause comes from Tika.
                
      was (Author: shinichiro abe):
    it doesn't work ManifoldCF 1.0.1 and trunk. This problem occurs for Solr 4.x not for Solr 3.0. It seems the cause comes from Tika.
                  
> The content of sjis file can't be extracted
> -------------------------------------------
>
>                 Key: CONNECTORS-613
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-613
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: File system connector, Lucene/SOLR connector
>    Affects Versions: ManifoldCF 1.0.1, ManifoldCF 1.1
>         Environment: Solr 4.x (not Solr 3.x)
>            Reporter: Shinichiro Abe
>         Attachments: files.zip
>
>
> When posting sjis text file by using curl, the content can be extracted.
> {noformat}
> curl "http://localhost:8983/solr/update/extract?literal.id=1&commit=true" -F "myfile=@sjis.txt"
> {noformat} 
> But when posting this file by File system connector, it can't be extracted. it results empty.
> It seems that the content of utf-8 text file can be extracted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira