You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Jan Høydahl (JIRA)" <ji...@apache.org> on 2016/05/28 21:38:12 UTC

[jira] [Closed] (SOLR-1763) Integrate Solr Cell/Tika as an UpdateRequestProcessor

     [ https://issues.apache.org/jira/browse/SOLR-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jan Høydahl closed SOLR-1763.
-----------------------------
    Resolution: Won't Fix

Resolving as won't fix due to lack of interest both from my self and others. And besides I don't think it is a great idea anymore :)

> Integrate Solr Cell/Tika as an UpdateRequestProcessor
> -----------------------------------------------------
>
>                 Key: SOLR-1763
>                 URL: https://issues.apache.org/jira/browse/SOLR-1763
>             Project: Solr
>          Issue Type: New Feature
>          Components: update
>            Reporter: Jan Høydahl
>              Labels: extracting_request_handler, solr_cell, tika, update_request_handler
>
> From Chris Hostetter's original post in solr-dev:
> As someone with very little knowledge of Solr Cell and/or Tika, I find myself wondering if ExtractingRequestHandler would make more sense as an extractingUpdateProcessor -- where it could be configured to take take either binary fields (or string fields containing URLs) out of the Documents, parse them with tika, and add the various XPath matching hunks of text back into the document as new fields.
> Then ExtractingRequestHandler just becomes a handler that slurps up it's ContentStreams and adds them as binary data fields and adds the other literal params as fields.
> Wouldn't that make things like SOLR-1358, and using Tika with URLs/filepaths in XML and CSV based updates fairly trivial?
> -Hoss
> I couldn't agree more, so I decided to add it as an issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org