You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2014/06/10 01:25:03 UTC

[jira] [Commented] (CONNECTORS-954) Amazon Cloud Search connector's use of Tika should be revisited after pipelines are added

    [ https://issues.apache.org/jira/browse/CONNECTORS-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14025925#comment-14025925 ] 

Karl Wright commented on CONNECTORS-954:
----------------------------------------

Pipelines have now been added, so it's time to look at this seriously.

> Amazon Cloud Search connector's use of Tika should be revisited after pipelines are added
> -----------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-954
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-954
>             Project: ManifoldCF
>          Issue Type: Task
>          Components: Amazon CloudSearch output connector
>    Affects Versions: ManifoldCF 1.7
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 1.7
>
>
> Amazon Cloud Search connector uses Tika to extract content from binaries.
> When the pipeline support in CONNECTORS-946 is committed to trunk, we should do two things:
> (a) Create a Transformation Connection that extracts binary data into metadata, and
> (b) Remove the Tika dependency from the Amazon connector



--
This message was sent by Atlassian JIRA
(v6.2#6252)