You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Chris Harrington <ch...@heystaks.com> on 2013/02/13 15:49:28 UTC
Russian Analyzer error
Hi all,
I'm having some trouble using a RussianAnalyzer in Mahout, I thought this would suffice
RussianAnalyzer analyzer = new RussianAnalyzer(Version.LUCENE_31);
DocumentProcessor.tokenizeDocuments(new Path(inputDir), analyzer.getClass().asSubclass(Analyzer.class), tokenizedPath, conf);
but it throws this error.
java.lang.IllegalStateException: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:68)
at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:28)
at org.apache.mahout.vectorizer.document.SequenceFileTokenizerMapper.setup(SequenceFileTokenizerMapper.java:58)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
Caused by: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
at java.lang.Class.getConstructor0(Class.java:2706)
at java.lang.Class.getConstructor(Class.java:1657)
at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:62)
... 6 more
Since this happens with various other analyzers (FrenchAnalyzer , DutchAnalyzer, etc) I figure I must be going about this in the wrong way.
Any advice?