You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Erlend Hamnaberg (JIRA)" <ji...@apache.org> on 2015/02/14 00:00:12 UTC
[jira] [Created] (MAHOUT-1641) Add conversion from a RDD[(String,
String)] to a Drm[Int]
Erlend Hamnaberg created MAHOUT-1641:
----------------------------------------
Summary: Add conversion from a RDD[(String, String)] to a Drm[Int]
Key: MAHOUT-1641
URL: https://issues.apache.org/jira/browse/MAHOUT-1641
Project: Mahout
Issue Type: Question
Components: spark
Affects Versions: 1.0
Reporter: Erlend Hamnaberg
Hi.
We are using the coocurrence part of mahout as a library. We get our data from other sources, like for instance Cassandra. We dont want to write that data to disk, and read it back since we already have the data on each slave.
I have created some conversion functions based on one of the IndexedDatasetSpark readers, cant remember which one at the moment.
Is there interest in the community for this kind of feature? I can probably clean it up and add this as a github pull request.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)