You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/06/28 15:10:17 UTC

[jira] [Commented] (NUTCH-1021) Migrate OutlinkExtractor from Apache ORO to java.util.regex

    [ https://issues.apache.org/jira/browse/NUTCH-1021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13056487#comment-13056487 ] 

Markus Jelsma commented on NUTCH-1021:
--------------------------------------

Hm, the class o.a.n.parse.OutlinkExtractor seems to be only in use by parse-ext and parse-tika, for which in the latter it is only used as a fallback in case parse-tika cannot find outlinks itself.  Note to self: this may be useful for NUTCH-961.

> Migrate OutlinkExtractor from Apache ORO to java.util.regex 
> ------------------------------------------------------------
>
>                 Key: NUTCH-1021
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1021
>             Project: Nutch
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.3
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.4, 2.0
>
>
> Migrate from deprecated ORO to Java util regex.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira