You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Alexandre Rafalovitch (JIRA)" <ji...@apache.org> on 2016/10/02 01:44:20 UTC

[jira] [Closed] (SOLR-2217) Odd response format when using extractOnly option with Solr Cell

     [ https://issues.apache.org/jira/browse/SOLR-2217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexandre Rafalovitch closed SOLR-2217.
---------------------------------------
    Resolution: Cannot Reproduce

This is many versions behind for all components. If this issue is seen with more recent Solr version, a new ticket can be opened.

> Odd response format when using extractOnly option with Solr Cell
> ----------------------------------------------------------------
>
>                 Key: SOLR-2217
>                 URL: https://issues.apache.org/jira/browse/SOLR-2217
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - Solr Cell (Tika extraction)
>    Affects Versions: 1.4.1
>         Environment: Ubuntu 10.4 LTS (Lucid), Java version "1.6.0_18" OpenJDK Runtime Environment (IcedTea6 1.8.2) (6b18-1.8.2-4ubuntu2) OpenJDK 64-Bit Server VM (build 16.0-b13, mixed mode), Tomcat 6
>            Reporter: Donovan Jimenez
>            Priority: Minor
>
> When using the extractOnly request parameter, the oas.handler.extraction.ExtractingDocumentLoader is using stream.getName() for parts of the response, but this name appears to be null because the serialized response will return an unnamed string and a list named "null_metadata". It seems more appropriate to use "content" (producing a named string "content" and list "content_metadata") or to use whatever oas.handler.extraction.SolrContentHandler is using for the content field name (coded to "content", but mappable by request parameters).
> 201	          rsp.add(*stream.getName()*, writer.toString());
> 202	          writer.close();
> 203	          String[] names = metadata.names();
> 204	          NamedList metadataNL = new NamedList();
> 205	          for (int i = 0; i < names.length; i++) {
> 206	            String[] vals = metadata.getValues(names[i]);
> 207	            metadataNL.add(names[i], vals);
> 208	          }
> 209	          rsp.add(*stream.getName()* + "_metadata", metadataNL);
> This is mostly to avoid having to use the odd empty string and null_metadata identifiers in unserialized data (like JSON, PHP, RUBY, etc)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org