You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by eyal edri <ey...@gmail.com> on 2007/09/10 17:17:22 UTC

Injector: java.lang.IllegalStateException (at nutch fetch stage)

Hi,

I'm running nutch 0.9 on a fedora core 7 i368 machine (actually it's a
VMWARE), to testing.
while trying to fetch a single URL ("http://www.ynet.co.il") it takes ages
and then throws the following:

[eyale@localhost nutch-0.9]$ bin/nutch inject crawl/crawldb SMALL
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: SMALL
Injector: Converting injected urls to crawl db entries.
Injector: java.lang.IllegalStateException
   at java.nio.charset.CharsetEncoder.encode(libgcj.so.8rh)
   at org.apache.hadoop.io.Text.encode(Text.java:375)
   at org.apache.hadoop.io.Text.encode(Text.java:356)
   at org.apache.hadoop.io.Text.writeString(Text.java:396)
   at org.apache.hadoop.mapred.JobClient$RawSplit.write(JobClient.java:428)
   at org.apache.hadoop.mapred.JobClient.writeSplitsFile(JobClient.java:457)
   at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:358)
   at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:543)
   at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
   at org.apache.nutch.crawl.Injector.run(Injector.java:192)
   at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
   at org.apache.nutch.crawl.Injector.main(Injector.java:182)

Java ver:
[eyale@localhost nutch-0.9]$ java -version
java version "1.5.0"
gij (GNU libgcj) version 4.1.2 20070502 (Red Hat 4.1.2-12)

ANT build was successful

anyone can help?

-- 
Eyal Edri

Re: Injector: java.lang.IllegalStateException (at nutch fetch stage)

Posted by Andrzej Bialecki <ab...@getopt.org>.
eyal edri wrote:
> Hi,
> 
> I'm running nutch 0.9 on a fedora core 7 i368 machine (actually it's a
> VMWARE), to testing.
> while trying to fetch a single URL ("http://www.ynet.co.il") it takes ages
> and then throws the following:
> 
> [eyale@localhost nutch-0.9]$ bin/nutch inject crawl/crawldb SMALL
> Injector: starting
> Injector: crawlDb: crawl/crawldb
> Injector: urlDir: SMALL
> Injector: Converting injected urls to crawl db entries.
> Injector: java.lang.IllegalStateException
>    at java.nio.charset.CharsetEncoder.encode(libgcj.so.8rh)
>    at org.apache.hadoop.io.Text.encode(Text.java:375)
>    at org.apache.hadoop.io.Text.encode(Text.java:356)
>    at org.apache.hadoop.io.Text.writeString(Text.java:396)
>    at org.apache.hadoop.mapred.JobClient$RawSplit.write(JobClient.java:428)
>    at org.apache.hadoop.mapred.JobClient.writeSplitsFile(JobClient.java:457)
>    at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:358)
>    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:543)
>    at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
>    at org.apache.nutch.crawl.Injector.run(Injector.java:192)
>    at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
>    at org.apache.nutch.crawl.Injector.main(Injector.java:182)
> 
> Java ver:
> [eyale@localhost nutch-0.9]$ java -version
> java version "1.5.0"
> gij (GNU libgcj) version 4.1.2 20070502 (Red Hat 4.1.2-12)
> 
> ANT build was successful
> 
> anyone can help?
> 

Currently Nutch works only with certified JVM implementations, such as 
Sun or IBM or BEA. GCJ is not supported.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com