You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Pat Ferrel (JIRA)" <ji...@apache.org> on 2015/04/07 19:14:13 UTC

[jira] [Resolved] (MAHOUT-1674) A'A fails getting with an index out of range for a row vector

     [ https://issues.apache.org/jira/browse/MAHOUT-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pat Ferrel resolved MAHOUT-1674.
--------------------------------
    Resolution: Fixed
      Assignee: Pat Ferrel  (was: Dmitriy Lyubimov)

Made change to blas that catch this case, passes one user's test that I was able to reporduce.

> A'A fails getting with an index out of range for a row vector
> -------------------------------------------------------------
>
>                 Key: MAHOUT-1674
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1674
>             Project: Mahout
>          Issue Type: Bug
>          Components: s
>    Affects Versions: 0.10.0
>            Reporter: Pat Ferrel
>            Assignee: Pat Ferrel
>            Priority: Critical
>             Fix For: 0.10.0
>
>
> A'A and possibly A'B can fail with an index out of bounds on the row vector. This seems related to partitioning where some partitions may be empty.
> This can be reproduce with the attached data as input into spark-itemsimilarity. This is only A data and the one large csv will complete correctly but passing in the directory of part files will exhibit the error. The data is identical except in the number of files that are used to contain the data.
> The error occurs using the local raw filesystem and with master = local and is pretty fast to reach. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)