You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Roman P (JIRA)" <ji...@apache.org> on 2015/05/31 18:34:17 UTC
[jira] [Created] (NUTCH-2028) java.lang.IllegalArgumentException:
can't serialize class org.apache.avro.util.Utf8
Roman P created NUTCH-2028:
------------------------------
Summary: java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
Key: NUTCH-2028
URL: https://issues.apache.org/jira/browse/NUTCH-2028
Project: Nutch
Issue Type: Bug
Components: fetcher
Affects Versions: 2.3
Environment: Mac 10.10.3 Nutch 2.3
Reporter: Roman P
Compiled Nutch 2.3 with MongoDB as a persistence. Getting exception when fetching. Searched for similar errors online, noticed that this issue was addressed in gora 0.6. Tried recompiling with 0.6 but then getting different exception, seems that it's incompatible with hadoop 1.2.0. Tried different versions of hadoop with no luck.
FetcherJob: starting at 2015-05-31 09:29:04
FetcherJob: batchId: all
FetcherJob: threads: 10
FetcherJob: parsing: false
FetcherJob: resuming: false
FetcherJob : timelimit set for : -1
java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8
at org.bson.BasicBSONEncoder._putObjectField(BasicBSONEncoder.java:284)
at org.bson.BasicBSONEncoder.putObject(BasicBSONEncoder.java:185)
at org.bson.BasicBSONEncoder.putObject(BasicBSONEncoder.java:131)
at com.mongodb.DefaultDBEncoder.writeObject(DefaultDBEncoder.java:33)
at com.mongodb.OutMessage.putObject(OutMessage.java:289)
at com.mongodb.OutMessage.writeQuery(OutMessage.java:211)
at com.mongodb.OutMessage.query(OutMessage.java:86)
at com.mongodb.DBCollectionImpl.find(DBCollectionImpl.java:81)
at com.mongodb.DBCollectionImpl.find(DBCollectionImpl.java:66)
at com.mongodb.DBCursor._check(DBCursor.java:458)
at com.mongodb.DBCursor._hasNext(DBCursor.java:546)
at com.mongodb.DBCursor.hasNext(DBCursor.java:571)
at org.apache.gora.mongodb.query.MongoDBResult.nextInner(MongoDBResult.java:69)
at org.apache.gora.query.impl.ResultBase.next(ResultBase.java:114)
at org.apache.gora.mapreduce.GoraRecordReader.nextKeyValue(GoraRecordReader.java:119)
at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:531)
at org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:364)
at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:223)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)