You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Suneel Marthi (JIRA)" <ji...@apache.org> on 2015/04/21 05:17:59 UTC
[jira] [Created] (MAHOUT-1693) FunctionalMatrixView materializes
row vectors in scala shell
Suneel Marthi created MAHOUT-1693:
-------------------------------------
Summary: FunctionalMatrixView materializes row vectors in scala shell
Key: MAHOUT-1693
URL: https://issues.apache.org/jira/browse/MAHOUT-1693
Project: Mahout
Issue Type: Bug
Components: Mahout spark shell, Math
Affects Versions: 0.10.0
Reporter: Suneel Marthi
Assignee: Andrew Palumbo
Priority: Blocker
Fix For: 0.10.1
FunctionalMatrixView materializes row vectors in scala shell.
Problem first reported by Michael Alton, Intel.
"When I first tried to make a large matrix, I got an out of Java heap space error. I increased the memory incrementally until I got it to work. “export MAHOUT_HEAPSIZE=8000” didn’t work, but “export MAHOUT_HEAPSIZE=64000” did. The question is why do we need so much memory? A 5000x5000 matrix of doubles should only take up ~200MB of space?"
Problem has been narrowed down to not override toString() method in FunctionalMatrixView which causes it to materialize all of the row vectors when run in Mahout Spark Shell.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)