You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Suneel Marthi (JIRA)" <ji...@apache.org> on 2015/04/21 05:17:59 UTC

[jira] [Created] (MAHOUT-1693) FunctionalMatrixView materializes row vectors in scala shell

Suneel Marthi created MAHOUT-1693:
-------------------------------------

             Summary: FunctionalMatrixView materializes row vectors in scala shell
                 Key: MAHOUT-1693
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1693
             Project: Mahout
          Issue Type: Bug
          Components: Mahout spark shell, Math
    Affects Versions: 0.10.0
            Reporter: Suneel Marthi
            Assignee: Andrew Palumbo
            Priority: Blocker
             Fix For: 0.10.1


FunctionalMatrixView materializes row vectors in scala shell.

Problem first reported by Michael Alton, Intel.

"When I first tried to make a large matrix, I got an out of Java heap space error. I increased the memory incrementally until I got it to work. “export MAHOUT_HEAPSIZE=8000” didn’t work, but “export MAHOUT_HEAPSIZE=64000” did. The question is why do we need so much memory? A 5000x5000 matrix of doubles should only take up ~200MB of space?"

Problem has been narrowed down to not override toString() method in FunctionalMatrixView which causes it to materialize all of the row vectors when run in Mahout Spark Shell.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)