You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/04/10 07:21:12 UTC
[jira] [Resolved] (NUTCH-1944) Add raw content to indexes
[ https://issues.apache.org/jira/browse/NUTCH-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann resolved NUTCH-1944.
--------------------------------------
Resolution: Fixed
Committed the pull request (#8) from meabed:
{noformat}
[chipotle:~/tmp/nutch2.x] mattmann% svn commit -m "fix for NUTCH-1944 Index HTML raw content contributed by meabed this closes #8."
Sending CHANGES.txt
Sending conf/schema.xml
Adding src/plugin/index-html
Adding src/plugin/index-html/build.xml
Adding src/plugin/index-html/ivy.xml
Adding src/plugin/index-html/plugin.xml
Adding src/plugin/index-html/src
Adding src/plugin/index-html/src/java
Adding src/plugin/index-html/src/java/org
Adding src/plugin/index-html/src/java/org/apache
Adding src/plugin/index-html/src/java/org/apache/nutch
Adding src/plugin/index-html/src/java/org/apache/nutch/indexer
Adding src/plugin/index-html/src/java/org/apache/nutch/indexer/html
Adding src/plugin/index-html/src/java/org/apache/nutch/indexer/html/HtmlIndexingFilter.java
Adding src/plugin/index-html/src/java/org/apache/nutch/indexer/html/README.md
Adding src/plugin/index-html/src/java/org/apache/nutch/indexer/html/package.html
Transmitting file data ........
Committed revision 1672542.
{noformat}
Has been sitting for a while and it's a good start. We can build off this with Seb's comments. Thanks meabed!
> Add raw content to indexes
> --------------------------
>
> Key: NUTCH-1944
> URL: https://issues.apache.org/jira/browse/NUTCH-1944
> Project: Nutch
> Issue Type: New Feature
> Components: indexer, plugin
> Reporter: Lewis John McGibbney
> Assignee: Chris A. Mattmann
> Fix For: 2.4
>
>
> The issues is described very well here
> https://github.com/Meabed/nutch2-index-html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)