You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Chris Birchall (JIRA)" <ji...@apache.org> on 2012/12/07 03:15:21 UTC

[jira] [Commented] (MAHOUT-1123) Support Lucene 3.6 analyzers for vectorization

    [ https://issues.apache.org/jira/browse/MAHOUT-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13526087#comment-13526087 ] 

Chris Birchall commented on MAHOUT-1123:
----------------------------------------

Sorry, I forgot to 'svn add' the most important file! I^ve updated the patch.
                
> Support Lucene 3.6 analyzers for vectorization
> ----------------------------------------------
>
>                 Key: MAHOUT-1123
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1123
>             Project: Mahout
>          Issue Type: Improvement
>    Affects Versions: 0.7
>            Reporter: Chris Birchall
>         Attachments: support-lucene36-analyzers.patch
>
>
> Passing a Lucene analyzer class name to Mahout (e.g. seq2sparse --analyzerName) results in failure with the error shown below.
> This is caused by Mahout trying to instantiate the analyzer using a zero-argument constructor. The zero-arg constructors were removed from the standard Lucene analyzers in Lucene 3.6 (if I recall correctly).
> This patch adds support for the new one-arg constructors, as well as keeping support for the legacy analyzers.
> =====
> Exception in thread "main" java.lang.IllegalStateException: java.lang.NoSuchMethodException: org.apache.lucene.analysis.standard.StandardAnalyzer.<init>()
>         at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:68)
>         at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.run(SparseVectorsFromSequenceFiles.java:204)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
>         at org.apache.mahout.vectorizer.SparseVectorsFromSequenceFiles.main(SparseVectorsFromSequenceFiles.java:55)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
>         at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
>         at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
> Caused by: java.lang.NoSuchMethodException: org.apache.lucene.analysis.standard.StandardAnalyzer.<init>()
>         at java.lang.Class.getConstructor0(Class.java:2706)
>         at java.lang.Class.getConstructor(Class.java:1657)
>         at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:62)
>         ... 11 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira