You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Terry Blankers (JIRA)" <ji...@apache.org> on 2014/04/21 18:29:14 UTC

[jira] [Created] (MAHOUT-1521) lucene2seq - Error trying to load data from stored field (when non-indexed)

Terry Blankers created MAHOUT-1521:
--------------------------------------

             Summary: lucene2seq - Error trying to  load data from stored field (when non-indexed)
                 Key: MAHOUT-1521
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1521
             Project: Mahout
          Issue Type: Bug
    Affects Versions: 0.9
            Reporter: Terry Blankers
             Fix For: 1.0


When using lucene2seq to load data from a field that is stored but not indexed I receive the following error:

{noformat}IllegalArgumentException: Field 'body' does not exist in the index{noformat}

Field is described in schema.xml as:

{noformat}<fieldname="body"type="string" stored="true" indexed="false"/>{noformat}

BTW,  field is copied to 'content' field for searching, schema.xml snippet:

{noformat}<copyField source="body" dest="content" />{noformat}

Copy field is described in schema.xml as:

{noformat}<fieldname="content" type="text" stored="false" indexed="true" multiValued="true"/>{noformat}

If I try to load data from the copy field, lucene2seq runs with no errors but I receive empty data for each key/doc:

{noformat}Key class: class org.apache.hadoop.io.Text Value Class: class org.apache.hadoop.io.Text
Key: 96C4C76CF9D7449C724CA77CB8F650EAFD33E31C: Value:
Key: D6842B81B8D09733B50BEDB4767C2A5C49E43B20: Value:{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)