You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Furkan KAMACI (JIRA)" <ji...@apache.org> on 2013/05/07 10:45:27 UTC

[jira] [Commented] (SOLR-4358) SolrJ, by preventing multi-part post, loses key information about file name that Tika needs

    [ https://issues.apache.org/jira/browse/SOLR-4358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13650611#comment-13650611 ] 

Furkan KAMACI commented on SOLR-4358:
-------------------------------------

This has been committed into 4.3 (http://lucene.apache.org/solr/4_3_0/changes/Changes.html#4.3.0.new_features) We can change fix version/s of this issue.
                
> SolrJ, by preventing multi-part post, loses key information about file name that Tika needs
> -------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4358
>                 URL: https://issues.apache.org/jira/browse/SOLR-4358
>             Project: Solr
>          Issue Type: Bug
>          Components: clients - java
>    Affects Versions: 4.0
>            Reporter: Karl Wright
>            Assignee: Ryan McKinley
>             Fix For: 5.0, 4.4
>
>         Attachments: additional_changes.diff, SOLR-4358.patch, SOLR-4358.patch, SOLR-4358.patch
>
>
> SolrJ accepts a ContentStream, which has a name field.  Within HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this filename is transmitted as part of the form boundary information.  However, if SolrJ chooses not to use multipart post, the filename information is lost.
> This information is used by SolrCell (Tika) to make decisions about content extraction, so it is very important that it makes it into Solr in one way or another.  Either SolrJ should set appropriate equivalent headers to send the filename automatically, or it should force multipart posts when this information is present.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org