You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Daniel Ciborowski (JIRA)" <ji...@apache.org> on 2013/09/09 20:11:52 UTC

[jira] [Issue Comment Deleted] (NUTCH-1517) CloudSearch indexer

     [ https://issues.apache.org/jira/browse/NUTCH-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daniel Ciborowski updated NUTCH-1517:
-------------------------------------

    Comment: was deleted

(was: Does this process work with the data stored in hdfs? or does it have to be stored on local file system? Still not able to get nutch to save segments though... But when I tried to use the index on my previously crawled data I am still getting the matched 0 files errors.
)
    
> CloudSearch indexer
> -------------------
>
>                 Key: NUTCH-1517
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1517
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer
>            Reporter: Julien Nioche
>             Fix For: 1.9
>
>         Attachments: 0023883254_1377197869_indexer-cloudsearch.patch
>
>
> Once we have made the indexers pluggable, we should add a plugin for Amazon CloudSearch. See http://aws.amazon.com/cloudsearch/. Apparently it uses a JSON based representation Search Data Format (SDF), which we could reuse for a file based indexer.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira