You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Mathieu Raffinot <ra...@liafa.univ-paris-diderot.fr> on 2014/09/04 11:21:32 UTC

trouble nutch parse with Tika

Dear all,

I am trying to play with nutch-2.2.1 + gora+ hbase-0.90.4 in a 
runtime/local environnement. After a long time (had to change of hbase 
version twice), eventually, the following seems to run:

+ nutch inserts
+ nutch generate -all
+ nutch fetch -all also.

but I have trouble with:

-------------------------------------------------------------------------------------------------------------------
:~/apache-nutch-2.2.1/runtime/local$ bin/nutch parse -all
ParserJob: starting
ParserJob: resuming:    false
ParserJob: forced reparse:      false
ParserJob: parsing all
Exception in thread "main" java.util.ServiceConfigurationError: 
org.apache.tika.parser.Parser: Provider org.gagravarr.tika.FlacParser 
could not be instantiated
         at java.util.ServiceLoader.fail(ServiceLoader.java:224)
         at java.util.ServiceLoader.access$100(ServiceLoader.java:181)
         at 
java.util.ServiceLoader$LazyIterator.next(ServiceLoader.java:377)
         at java.util.ServiceLoader$1.next(ServiceLoader.java:445)
         at 
org.apache.nutch.parse.tika.TikaConfig.<init>(TikaConfig.java:148)
         at 
org.apache.nutch.parse.tika.TikaConfig.getDefaultConfig(TikaConfig.java:210)
         at 
org.apache.nutch.parse.tika.TikaParser.setConf(TikaParser.java:205)
         at 
org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:160)
         at 
org.apache.nutch.parse.ParserFactory.getFields(ParserFactory.java:209)
         at org.apache.nutch.parse.ParserJob.getFields(ParserJob.java:195)
         at org.apache.nutch.parse.ParserJob.run(ParserJob.java:247)
         at org.apache.nutch.parse.ParserJob.parse(ParserJob.java:261)
         at org.apache.nutch.parse.ParserJob.run(ParserJob.java:304)
         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
         at org.apache.nutch.parse.ParserJob.main(ParserJob.java:308)
Caused by: java.lang.NoClassDefFoundError: 
org/gagravarr/vorbis/VorbisComments
         at java.lang.Class.getDeclaredConstructors0(Native Method)
         at java.lang.Class.privateGetDeclaredConstructors(Class.java:2493)
         at java.lang.Class.getConstructor0(Class.java:2803)
         at java.lang.Class.newInstance(Class.java:345)
         at 
java.util.ServiceLoader$LazyIterator.next(ServiceLoader.java:373)
         ... 12 more
Caused by: java.lang.ClassNotFoundException: 
org.gagravarr.vorbis.VorbisComments
         at java.net.URLClassLoader$1.run(URLClassLoader.java:366)
         at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
         at java.security.AccessController.doPrivileged(Native Method)
         at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
         at java.lang.ClassLoader.loadClass(ClassLoader.java:425)
         at java.lang.ClassLoader.loadClass(ClassLoader.java:358)
         ... 17 more

---------------------------------------------------------------------------------
and I could not figure out where it comes from. Did one of you face the 
same problem ?

Thanks,
Mathieu