You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Roman P (JIRA)" <ji...@apache.org> on 2015/05/31 18:34:17 UTC

[jira] [Created] (NUTCH-2028) java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8

Roman P created NUTCH-2028:
------------------------------

             Summary: java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
                 Key: NUTCH-2028
                 URL: https://issues.apache.org/jira/browse/NUTCH-2028
             Project: Nutch
          Issue Type: Bug
          Components: fetcher
    Affects Versions: 2.3
         Environment: Mac 10.10.3 Nutch 2.3
            Reporter: Roman P


Compiled Nutch 2.3 with MongoDB as a persistence. Getting exception when fetching. Searched for similar errors online, noticed that this issue was addressed in gora 0.6. Tried recompiling with 0.6 but then getting different exception, seems that it's incompatible with hadoop 1.2.0. Tried different versions of hadoop with no luck.

FetcherJob: starting at 2015-05-31 09:29:04
FetcherJob: batchId: all
FetcherJob: threads: 10
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob : timelimit set for : -1
java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
	at org.bson.BasicBSONEncoder._putObjectField(BasicBSONEncoder.java:284)
	at org.bson.BasicBSONEncoder.putObject(BasicBSONEncoder.java:185)
	at org.bson.BasicBSONEncoder.putObject(BasicBSONEncoder.java:131)
	at com.mongodb.DefaultDBEncoder.writeObject(DefaultDBEncoder.java:33)
	at com.mongodb.OutMessage.putObject(OutMessage.java:289)
	at com.mongodb.OutMessage.writeQuery(OutMessage.java:211)
	at com.mongodb.OutMessage.query(OutMessage.java:86)
	at com.mongodb.DBCollectionImpl.find(DBCollectionImpl.java:81)
	at com.mongodb.DBCollectionImpl.find(DBCollectionImpl.java:66)
	at com.mongodb.DBCursor._check(DBCursor.java:458)
	at com.mongodb.DBCursor._hasNext(DBCursor.java:546)
	at com.mongodb.DBCursor.hasNext(DBCursor.java:571)
	at org.apache.gora.mongodb.query.MongoDBResult.nextInner(MongoDBResult.java:69)
	at org.apache.gora.query.impl.ResultBase.next(ResultBase.java:114)
	at org.apache.gora.mapreduce.GoraRecordReader.nextKeyValue(GoraRecordReader.java:119)
	at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:531)
	at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364)
	at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:223)
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
	at java.lang.Thread.run(Thread.java:745)




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)