You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Ake Tangkananond <ia...@gmail.com> on 2012/08/08 09:54:19 UTC

Nutch plugins/feed

Hi,

I see there is a rss parser under src/plugins but it wasn't put into
deployment profile in src/plugins/build.xml. Are there a substitution to
this parser now? Which one I should use, or I should myself port previous
rss parser to nutch 2.x ?

Thank you.
 

Regards,
Ake Tangkananond



Re: Nutch plugins/feed

Posted by Julien Nioche <li...@gmail.com>.
IIRC the Tika parser should handle RSS feeds. The one in src/plugins
probably hasn't been ported to 2.x yet as it generates X sub documents from
a single source which parsers in Nutch 2.x can't do at the moment

On 8 August 2012 08:54, Ake Tangkananond <ia...@gmail.com> wrote:

> Hi,
>
> I see there is a rss parser under src/plugins but it wasn't put into
> deployment profile in src/plugins/build.xml. Are there a substitution to
> this parser now? Which one I should use, or I should myself port previous
> rss parser to nutch 2.x ?
>
> Thank you.
>
>
> Regards,
> Ake Tangkananond
>
>
>


-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble