You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by derevo <da...@inbox.ru> on 2007/04/21 01:16:59 UTC
Plugin to index categories by url rules
Hi,
Maybe some one know how to add this plugin to nutch 9.0
http://issues.apache.org/jira/browse/NUTCH-386
http://issues.apache.org/jira/browse/NUTCH-386
MY ERRORR FROM LOG FILE
2007-04-21 03:12:16,929 INFO indexer.IndexingFilters - Adding
org.b2b.nutch.indexer.UrlCategoryIndexFilter
2007-04-21 03:12:16,938 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.more.MoreIndexingFilter
2007-04-21 03:12:17,202 WARN mapred.LocalJobRunner - job_q411un
java.lang.AbstractMethodError:
org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
at
org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
Any help would be appreciated. Thanks!
--
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10111892
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Plugin to index categories by url rules
Posted by derevo <da...@inbox.ru>.
please help me
--
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10175998
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Plugin to index categories by url rules
Posted by Doğacan Güney <do...@gmail.com>.
On 4/21/07, derevo <da...@inbox.ru> wrote:
>
> hmm,
> the same error when try add anothe plugin
> http://issues.apache.org/jira/browse/NUTCH-422
> http://issues.apache.org/jira/browse/NUTCH-422
>
>
> 2007-04-21 05:37:38,214 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.extra.ExtraIndexingFilter
> 2007-04-21 05:37:38,224 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2007-04-21 05:37:38,561 WARN mapred.LocalJobRunner - job_lctrxl
> java.lang.AbstractMethodError:
It throws a AbstractMethodError. Normally such errors should be caught
during compilation. So, I know that this is probably not very helpful
but, can you try doing ant clean;ant?
> org.apache.nutch.indexer.extra.ExtraIndexingFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
> at
> org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
> at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
> at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> 2007-04-21 05:37:38,866 FATAL indexer.Indexer - Indexer:
> java.io.IOException: Job failed!
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
> at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
> at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
>
>
>
>
>
> derevo wrote:
> >
> > Hi,
> > Maybe some one know how to add this plugin to nutch 9.0
> > http://issues.apache.org/jira/browse/NUTCH-386
> > http://issues.apache.org/jira/browse/NUTCH-386
> >
> > MY ERRORR FROM LOG FILE
> >
> > 2007-04-21 03:12:16,929 INFO indexer.IndexingFilters - Adding
> > org.b2b.nutch.indexer.UrlCategoryIndexFilter
> > 2007-04-21 03:12:16,938 INFO indexer.IndexingFilters - Adding
> > org.apache.nutch.indexer.more.MoreIndexingFilter
> > 2007-04-21 03:12:17,202 WARN mapred.LocalJobRunner - job_q411un
> > java.lang.AbstractMethodError:
> > org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
> > at
> > org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
> > at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
> > at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
> > at
> > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> > 2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
> > java.io.IOException: Job failed!
> > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> > at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
> > at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
> > at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> > at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
> >
> >
> >
> >
> > Any help would be appreciated. Thanks!
> >
>
> --
> View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10112854
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>
--
Doğacan Güney
Re: Plugin to index categories by url rules
Posted by derevo <da...@inbox.ru>.
hmm,
the same error when try add anothe plugin
http://issues.apache.org/jira/browse/NUTCH-422
http://issues.apache.org/jira/browse/NUTCH-422
2007-04-21 05:37:38,214 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.extra.ExtraIndexingFilter
2007-04-21 05:37:38,224 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.basic.BasicIndexingFilter
2007-04-21 05:37:38,561 WARN mapred.LocalJobRunner - job_lctrxl
java.lang.AbstractMethodError:
org.apache.nutch.indexer.extra.ExtraIndexingFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
at
org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
2007-04-21 05:37:38,866 FATAL indexer.Indexer - Indexer:
java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
derevo wrote:
>
> Hi,
> Maybe some one know how to add this plugin to nutch 9.0
> http://issues.apache.org/jira/browse/NUTCH-386
> http://issues.apache.org/jira/browse/NUTCH-386
>
> MY ERRORR FROM LOG FILE
>
> 2007-04-21 03:12:16,929 INFO indexer.IndexingFilters - Adding
> org.b2b.nutch.indexer.UrlCategoryIndexFilter
> 2007-04-21 03:12:16,938 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.more.MoreIndexingFilter
> 2007-04-21 03:12:17,202 WARN mapred.LocalJobRunner - job_q411un
> java.lang.AbstractMethodError:
> org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
> at
> org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
> at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
> at
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> 2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
> java.io.IOException: Job failed!
> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
> at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
> at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
>
>
>
>
> Any help would be appreciated. Thanks!
>
--
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10112854
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Plugin to index categories by url rules
Posted by derevo <da...@inbox.ru>.
But it work in nutch-0.8.1 , i cant understand what trubble in 9.0
--
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10119053
Sent from the Nutch - User mailing list archive at Nabble.com.