You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/15 18:38:05 UTC

[jira] [Commented] (NUTCH-2139) Basic plugin to index inlinks and outlinks

    [ https://issues.apache.org/jira/browse/NUTCH-2139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14959186#comment-14959186 ] 

ASF GitHub Bot commented on NUTCH-2139:
---------------------------------------

GitHub user jorgelbg opened a pull request:

    https://github.com/apache/nutch/pull/78

    Fix for NUTCH-2139 contributed by jorgelbg

    Basic indexing capabilities for inlinks and outlinks. 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jorgelbg/nutch NUTCH-2139

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nutch/pull/78.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #78
    
----
commit f1d16ac509146aada0817d58d40bbcbfd0bad44d
Author: Jorge Luis Betancourt <be...@gmail.com>
Date:   2015-10-15T16:34:37Z

    Fix for NUTCH-2139 contributed by jorgelbg

----


> Basic plugin to index inlinks and outlinks
> ------------------------------------------
>
>                 Key: NUTCH-2139
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2139
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer, plugin
>            Reporter: Jorge Luis Betancourt Gonzalez
>            Priority: Minor
>              Labels: link, plugin
>             Fix For: 1.11
>
>
> Basic plugin that allows to index the inlinks and outlinks of the web pages, this could be very useful for analytic purposes, including neat visualizations using d3.js for instance. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)