You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Tomás Fernández Löbbe (JIRA)" <ji...@apache.org> on 2011/01/15 04:23:47 UTC

[jira] Updated: (SOLR-1526) Client Side Tika integration

     [ https://issues.apache.org/jira/browse/SOLR-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tomás Fernández Löbbe updated SOLR-1526:
----------------------------------------

    Attachment: clientextraction.tar.gz

I'll upload the code I mentioned a couple of days ago in case somebody want it. I added as a new contrib, that's why I'm uploading the tar file instead of a patch, it contains the same libraries as the extracting contrib. It still doesn't work with dates and have lot's of pending things, but I think we should take a decision on how to implement this patch before I continue coding.
You will see that the "lib" directory is empty, that's because:
1°) I can't upload a file with all the jars because it size would be more than 10 MB, the maximum for upload in Jira
2°) It uses the same jars as the "extraction" contrib, so, for use the clientexctraction, simply copy the jars from the extraction contrib.

> Client Side Tika integration
> ----------------------------
>
>                 Key: SOLR-1526
>                 URL: https://issues.apache.org/jira/browse/SOLR-1526
>             Project: Solr
>          Issue Type: New Feature
>          Components: clients - java
>            Reporter: Grant Ingersoll
>            Priority: Minor
>             Fix For: Next
>
>         Attachments: clientextraction.tar.gz
>
>
> Often times it is cost prohibitive to send full, rich documents over the wire.  The contrib/extraction library has server side integration with Tika, but it would be nice to have a client side implementation as well.  It should support both metadata and content or just metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org