You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "John Wooden (JIRA)" <ji...@apache.org> on 2013/07/08 23:55:49 UTC

[jira] [Updated] (SOLR-1837) Reconstruct a Document (stored fields, indexed fields, payloads)

     [ https://issues.apache.org/jira/browse/SOLR-1837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Wooden updated SOLR-1837:
------------------------------

    Attachment: SOLR-1837_WithHandler.patch

I've updated this patch to use a handler rather than JSP. Patch is also confirmed working with 4.2.1.

Performance is still quite slow. The SolrDocReconstructor class hasn't changed much since the prior version.

-- How to use --

1. Add the handler to your config:

<requestHandler name="/admin/docinspector" class="solr.DocumentReconstructorHandler" />

2. Sample call:

/solr/coreX/admin/docinspector?documentid=12345

3. Wait. Time required varies by size of document and index. A large document in a large index may allow enough time for a doughnut & coffee run.

4. Sample output:

<response>
   <lst name="responseHeader">
      <int name="status">0</int>
      <int name="QTime">x</int>
   </lst>
   <str name="DocumentID">12345</str>
   <lst name="Fields">
      <lst name="Stored">
         <str name="documentid">12345</str>
         <str name="isstarter.b_s">true</str>
         <str name="jerseynumber.i_is">16</str>
         <str name="schema">test</str>
         <str name="solrdt">2013-07-03T19:06:42.069Z</str>
      </lst>
      <lst name="Indexed">
         <str name="documentid">12345</str>
         <str name="dodges.i_i">28 | 0 | 0 | 0</str>
         <str name="hits.i_i">17 | 0 | 0 | 0</str>
         <str name="jerseynumber.i_is">16 | 0 | 0 | 0</str>
         <str name="schema">test</str>
         <str name="solrdt">2013-07-03T19:06:42.069Z | 2013-07-03T19:06:42.048Z | 2013-07-03T19:05:40.096Z | 2013-07-03T14:46:48.064Z | 2013-06-01T13:49:27.424Z | 2004-11-03T19:53:47.776Z | 1970-01-01T00:00:00Z | 1970-01-01T00:00:00Z</str>
      </lst>
   </lst>
</response>
                
> Reconstruct a Document (stored fields, indexed fields, payloads)
> ----------------------------------------------------------------
>
>                 Key: SOLR-1837
>                 URL: https://issues.apache.org/jira/browse/SOLR-1837
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis, web gui
>    Affects Versions: 1.5
>         Environment: All
>            Reporter: Trey Grainger
>            Priority: Minor
>              Labels: admin, indexed, luke, payload, reconstruct, stored
>             Fix For: 4.4
>
>         Attachments: SOLR-1837.patch, SOLR-1837_WithHandler.patch
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> One Solr feature I've been sorely in need of is the ability to inspect an index for any particular document.  While the analysis page is good when you have specific content and a specific field/type your want to test the analysis process for, once a document is indexed it is not currently possible to easily see what is actually sitting in the index.
> One can use the Lucene Index Browser (Luke), but this has several limitations (gui only, doesn't understand solr schema, doesn't display many non-text fields in human readable format, doesn't show payloads, some bugs lead to missing terms, exposes features dangerous to use in a production Solr environment, slow or difficult to check from a remote location, etc.).  The document reconstruction feature of Luke provides the base for what can become a much more powerful tool when coupled with Solr's understanding of a schema, however.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org