You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Elwin <ma...@gmail.com> on 2006/02/22 16:09:53 UTC

Why Perl5 regular expressions?

Why the url filter of nutch use Perl5 regular expressions? Any benefits?

--
《盖世豪侠》好评如潮,让无线收视居高不下,
无线高兴之余,仍未重用。周星驰岂是池中物,
喜剧天分既然崭露,当然不甘心受冷落,于是
转投电影界,在大银幕上一展风采。无线既得
千里马,又失千里马,当然后悔莫及。

Re: Why Perl5 regular expressions?

Posted by Stefan Groschupf <sg...@media-style.com>.
I guess this it is a historically reason.
I remember a discussion to replace it but didn't remember the details  
may you find something in the mail archive (developer list).

Am 22.02.2006 um 16:09 schrieb Elwin:

> Why the url filter of nutch use Perl5 regular expressions? Any  
> benefits?
>
> --
> 《盖世豪侠》好评如潮,让无线收视居高不下,
> 无线高兴之余,仍未重用。周星驰岂是池中物,
> 喜剧天分既然崭露,当然不甘心受冷落,于是
> 转投电影界,在大银幕上一展风采。无线既得
> 千里马,又失千里马,当然后悔莫及。

---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net



Re: Why Perl5 regular expressions?

Posted by Jérôme Charron <je...@gmail.com>.
> Why the url filter of nutch use Perl5 regular expressions? Any benefits?

In the trunk, the RegexURLFilter no more use the Oro Perl5 regular
expressions but the Java ones
(http://svn.apache.org/viewcvs.cgi?rev=367408&view=rev)

Jérôme

--
http://motrech.free.fr/
http://www.frutch.org/