You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/04/10 07:21:12 UTC

[jira] [Resolved] (NUTCH-1944) Add raw content to indexes

     [ https://issues.apache.org/jira/browse/NUTCH-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann resolved NUTCH-1944.
--------------------------------------
    Resolution: Fixed

Committed the pull request (#8) from meabed:

{noformat}
[chipotle:~/tmp/nutch2.x] mattmann% svn commit -m "fix for NUTCH-1944 Index HTML raw content contributed by meabed this closes #8."
Sending        CHANGES.txt
Sending        conf/schema.xml
Adding         src/plugin/index-html
Adding         src/plugin/index-html/build.xml
Adding         src/plugin/index-html/ivy.xml
Adding         src/plugin/index-html/plugin.xml
Adding         src/plugin/index-html/src
Adding         src/plugin/index-html/src/java
Adding         src/plugin/index-html/src/java/org
Adding         src/plugin/index-html/src/java/org/apache
Adding         src/plugin/index-html/src/java/org/apache/nutch
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/HtmlIndexingFilter.java
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/README.md
Adding         src/plugin/index-html/src/java/org/apache/nutch/indexer/html/package.html
Transmitting file data ........
Committed revision 1672542.
{noformat}

Has been sitting for a while and it's a good start. We can build off this with Seb's comments. Thanks meabed!

> Add raw content to indexes
> --------------------------
>
>                 Key: NUTCH-1944
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1944
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer, plugin
>            Reporter: Lewis John McGibbney
>            Assignee: Chris A. Mattmann
>             Fix For: 2.4
>
>
> The issues is described very well here
> https://github.com/Meabed/nutch2-index-html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)