You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Stephen McGill (JIRA)" <ji...@apache.org> on 2010/08/06 22:40:16 UTC
[jira] Commented: (MAHOUT-459) Reading an Index from Lucene/Solr
4.0-dev
[ https://issues.apache.org/jira/browse/MAHOUT-459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896151#action_12896151 ]
Stephen McGill commented on MAHOUT-459:
---------------------------------------
One *important* fix that is needed right now is the ability to grab all documents from Lucene. On Line 171 of the diff, it reads:
+ String a = new String("press");
which grabs all documents with the word "press" - not the intended goal. I have some commented code that might fix this, but I am unable to try it today.
Also, I do not think this is included in this posted diff, but the DefaultAnalyzer class should be deleted.
> Reading an Index from Lucene/Solr 4.0-dev
> -----------------------------------------
>
> Key: MAHOUT-459
> URL: https://issues.apache.org/jira/browse/MAHOUT-459
> Project: Mahout
> Issue Type: Improvement
> Components: Utils
> Affects Versions: 0.4
> Environment: Windows Server 2008 R2 Standard, Cygwin, Solr-trunk, Mahout-trunk
> Reporter: Stephen McGill
> Priority: Minor
> Fix For: 0.4
>
> Attachments: Mahout-Importing-Vectors-Lucene-Solr-4-dev.diff
>
>
> It is not possible to read indexes created by Lucene/Solr 4.0-dev (the trunk development) with the Lucene libraries that are included with Mahout-dev. When adding the new Lucene/Solr 4.0-dev, there are API changes that do not allow Mahout to compile.
> By adapting mahout-utils to fit Lucene/Solr 4.0-dev's API changes, it is possible to read its index.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.