You are viewing a plain text version of this content. The canonical link for it is here.

Posted to solr-commits@lucene.apache.org by Apache Wiki <wi...@apache.org> on 2008/12/31 13:46:04 UTC

[Solr Wiki] Update of "UpdateRichDocuments" by GrantIngersoll

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The following page has been changed by GrantIngersoll:
http://wiki.apache.org/solr/UpdateRichDocuments

------------------------------------------------------------------------------
  = Updating a Solr Index with Rich Documents such as PDF and MS Office =
  
  <!> NOTE: This page covers the !RichDocumentHandler as created by Eric Pugh and Chris Harris.  Solr's Tika integration, which will replace the !RichDocumentHandler is described at ExtractingRequestHandler. This page is being preserved here for those users who currently use the !RichDocumentHandler. <!>
+ 
+ <!> NOTE 2 from Eric: As the ExtractingRequestHandler is completed, users of !RichDocumentHandler should move to the native Solr solution.
+ 
+ <!>  DEPRECATED.  This code will not be committed to Solr.
  
  Solr has an extensible DocumentHandler architecture that allows you to feed it XML and CSV documents.  There is now a patch file available as part of [https://issues.apache.org/jira/browse/SOLR-284 SOLR-284] that adds support for parsing rich binary formats.
  
@@ -10, +14 @@

  
  Any additions, enhancements, or questions, please email me at epugh at opensourceconnections.com, or just update the JIRA issue!
  
- <!> NOTE 2 from Eric: As the ExtractingRequestHandler is completed, users of !RichDocumentHandler should move to the native Solr solution.
+ 
  
  
  [[TableOfContents]]