You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by derevo <da...@inbox.ru> on 2007/04/21 01:16:59 UTC

Plugin to index categories by url rules

Hi, 
Maybe some one know how to add this plugin to nutch 9.0 
http://issues.apache.org/jira/browse/NUTCH-386
http://issues.apache.org/jira/browse/NUTCH-386 

MY ERRORR FROM LOG FILE

2007-04-21 03:12:16,929 INFO  indexer.IndexingFilters - Adding
org.b2b.nutch.indexer.UrlCategoryIndexFilter
2007-04-21 03:12:16,938 INFO  indexer.IndexingFilters - Adding
org.apache.nutch.indexer.more.MoreIndexingFilter
2007-04-21 03:12:17,202 WARN  mapred.LocalJobRunner - job_q411un
java.lang.AbstractMethodError:
org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
	at
org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
	at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
java.io.IOException: Job failed!
	at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
	at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
	at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
	at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
	at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)




Any help would be appreciated. Thanks!
-- 
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10111892
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Plugin to index categories by url rules

Posted by derevo <da...@inbox.ru>.
please help me
-- 
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10175998
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Plugin to index categories by url rules

Posted by Doğacan Güney <do...@gmail.com>.
On 4/21/07, derevo <da...@inbox.ru> wrote:
>
> hmm,
> the same error when try add anothe plugin
> http://issues.apache.org/jira/browse/NUTCH-422
> http://issues.apache.org/jira/browse/NUTCH-422
>
>
> 2007-04-21 05:37:38,214 INFO  indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.extra.ExtraIndexingFilter
> 2007-04-21 05:37:38,224 INFO  indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2007-04-21 05:37:38,561 WARN  mapred.LocalJobRunner - job_lctrxl
> java.lang.AbstractMethodError:

It throws a AbstractMethodError. Normally such errors should be caught
during compilation. So, I know that this is probably not very helpful
but, can you try doing ant clean;ant?

> org.apache.nutch.indexer.extra.ExtraIndexingFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
>         at
> org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
>         at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
>         at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> 2007-04-21 05:37:38,866 FATAL indexer.Indexer - Indexer:
> java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
>         at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
>         at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
>         at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
>
>
>
>
>
> derevo wrote:
> >
> > Hi,
> > Maybe some one know how to add this plugin to nutch 9.0
> >  http://issues.apache.org/jira/browse/NUTCH-386
> > http://issues.apache.org/jira/browse/NUTCH-386
> >
> > MY ERRORR FROM LOG FILE
> >
> > 2007-04-21 03:12:16,929 INFO  indexer.IndexingFilters - Adding
> > org.b2b.nutch.indexer.UrlCategoryIndexFilter
> > 2007-04-21 03:12:16,938 INFO  indexer.IndexingFilters - Adding
> > org.apache.nutch.indexer.more.MoreIndexingFilter
> > 2007-04-21 03:12:17,202 WARN  mapred.LocalJobRunner - job_q411un
> > java.lang.AbstractMethodError:
> > org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
> >       at
> > org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
> >       at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
> >       at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
> >       at
> > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> > 2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
> > java.io.IOException: Job failed!
> >       at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> >       at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
> >       at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
> >       at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> >       at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
> >
> >
> >
> >
> > Any help would be appreciated. Thanks!
> >
>
> --
> View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10112854
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>


-- 
Doğacan Güney

Re: Plugin to index categories by url rules

Posted by derevo <da...@inbox.ru>.
hmm,
the same error when try add anothe plugin
http://issues.apache.org/jira/browse/NUTCH-422
http://issues.apache.org/jira/browse/NUTCH-422 


2007-04-21 05:37:38,214 INFO  indexer.IndexingFilters - Adding
org.apache.nutch.indexer.extra.ExtraIndexingFilter
2007-04-21 05:37:38,224 INFO  indexer.IndexingFilters - Adding
org.apache.nutch.indexer.basic.BasicIndexingFilter
2007-04-21 05:37:38,561 WARN  mapred.LocalJobRunner - job_lctrxl
java.lang.AbstractMethodError:
org.apache.nutch.indexer.extra.ExtraIndexingFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
	at
org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
	at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
2007-04-21 05:37:38,866 FATAL indexer.Indexer - Indexer:
java.io.IOException: Job failed!
	at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
	at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
	at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
	at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
	at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)





derevo wrote:
> 
> Hi, 
> Maybe some one know how to add this plugin to nutch 9.0 
>  http://issues.apache.org/jira/browse/NUTCH-386
> http://issues.apache.org/jira/browse/NUTCH-386 
> 
> MY ERRORR FROM LOG FILE
> 
> 2007-04-21 03:12:16,929 INFO  indexer.IndexingFilters - Adding
> org.b2b.nutch.indexer.UrlCategoryIndexFilter
> 2007-04-21 03:12:16,938 INFO  indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.more.MoreIndexingFilter
> 2007-04-21 03:12:17,202 WARN  mapred.LocalJobRunner - job_q411un
> java.lang.AbstractMethodError:
> org.b2b.nutch.indexer.UrlCategoryIndexFilter.filter(Lorg/apache/lucene/document/Document;Lorg/apache/nutch/parse/Parse;Lorg/apache/hadoop/io/Text;Lorg/apache/nutch/crawl/CrawlDatum;Lorg/apache/nutch/crawl/Inlinks;)Lorg/apache/lucene/document/Document;
> 	at
> org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:110)
> 	at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:215)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
> 	at
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)
> 2007-04-21 03:12:17,296 FATAL indexer.Indexer - Indexer:
> java.io.IOException: Job failed!
> 	at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> 	at org.apache.nutch.indexer.Indexer.index(Indexer.java:273)
> 	at org.apache.nutch.indexer.Indexer.run(Indexer.java:295)
> 	at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> 	at org.apache.nutch.indexer.Indexer.main(Indexer.java:278)
> 
> 
> 
> 
> Any help would be appreciated. Thanks!
> 

-- 
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10112854
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Plugin to index categories by url rules

Posted by derevo <da...@inbox.ru>.
But it work in nutch-0.8.1 , i cant understand what trubble in 9.0
-- 
View this message in context: http://www.nabble.com/Plugin-to-index-categories-by-url-rules-tf3621139.html#a10119053
Sent from the Nutch - User mailing list archive at Nabble.com.