You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/08/09 22:22:27 UTC

[jira] [Closed] (NUTCH-296) Image Search

     [ https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lewis John McGibbney closed NUTCH-296.
--------------------------------------

    Resolution: Won't Fix
      Assignee: Lewis John McGibbney

As there has been no progress made with this issue for years, and that it deviates from the direction in which Nutch branch-1.4 and trunk 2.0 are moving it is being closed. This may not have been the case if there had been some contribution from the GSoC suggestions, however this has sadly not been the case.

Due to no objections we are closing this issue.

> Image Search
> ------------
>
>                 Key: NUTCH-296
>                 URL: https://issues.apache.org/jira/browse/NUTCH-296
>             Project: Nutch
>          Issue Type: New Feature
>            Reporter: Thomas Delnoij
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>
> Per the discussion in the Nutch-User mailing list, there is a wish for an "Image Search" add-on component that will index images.
> Must have:
> - retrieve outlinks to image files from fetched pages
> - generate thumbnails from images
> - thumbnails are stored in the segments as ImageWritable that contains the compressed binary data and some meta data 
> Should have:
> - implemented as hadoop map reduce job
> - should be seperate from main Nutch codeline as it breaks general Nutch logic of one url == one index document.
> Could  have:
> - store the original image in the segments
> Would like to have:
> - search interface for image index
> - parameterizable thumbnail generation (width, height, quality)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira