You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Chris Harrington <ch...@heystaks.com> on 2013/02/13 15:49:28 UTC

Russian Analyzer error

Hi all,

I'm having some trouble using a RussianAnalyzer in Mahout, I thought this would suffice

RussianAnalyzer analyzer = new RussianAnalyzer(Version.LUCENE_31); 
DocumentProcessor.tokenizeDocuments(new Path(inputDir), analyzer.getClass().asSubclass(Analyzer.class), tokenizedPath, conf);

but it throws this error.

java.lang.IllegalStateException: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:68)
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:28)
	at org.apache.mahout.vectorizer.document.SequenceFileTokenizerMapper.setup(SequenceFileTokenizerMapper.java:58)
	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
Caused by: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
	at java.lang.Class.getConstructor0(Class.java:2706)
	at java.lang.Class.getConstructor(Class.java:1657)
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:62)
	... 6 more


Since this happens with various other analyzers (FrenchAnalyzer , DutchAnalyzer, etc) I figure I must be going about this in the wrong way. 

Any advice?