You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Andrew Palumbo (JIRA)" <ji...@apache.org> on 2015/04/01 00:56:54 UTC

[jira] [Commented] (MAHOUT-1655) Refactor module dependencies

    [ https://issues.apache.org/jira/browse/MAHOUT-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389570#comment-14389570 ] 

Andrew Palumbo commented on MAHOUT-1655:
----------------------------------------

[~pferrel] I'm actually getting an error in mahout spark-testnb (sorry- I'd only tested spark-trainnb before on this branch)

it looks like t-digest jars are not being picked up in the dependency-reduced jar?... I'm not sure if its from this refactoring.  maybe best to adress after you're done with this? 

{code}
 Exception in thread "main" java.lang.NoClassDefFoundError: com/tdunning/math/stats/TDigest
        at org.apache.mahout.classifier.stats.ResultAnalyzer.<init>(ClassifierStats.scala:64)
        at org.apache.mahout.classifier.naivebayes.NaiveBayes$class.test(NaiveBayes.scala:303)
        at org.apache.mahout.classifier.naivebayes.NaiveBayes$.test(NaiveBayes.scala:336)
        at org.apache.mahout.drivers.TestNBDriver$.process(TestNBDriver.scala:105)
        at org.apache.mahout.drivers.TestNBDriver$$anonfun$main$1.apply(TestNBDriver.scala:77)
        at org.apache.mahout.drivers.TestNBDriver$$anonfun$main$1.apply(TestNBDriver.scala:75)
        at scala.Option.map(Option.scala:145)
...
{code}

> Refactor module dependencies
> ----------------------------
>
>                 Key: MAHOUT-1655
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1655
>             Project: Mahout
>          Issue Type: Improvement
>          Components: mrlegacy
>    Affects Versions: 0.9
>            Reporter: Pat Ferrel
>            Assignee: Andrew Musselman
>            Priority: Critical
>             Fix For: 0.10.0
>
>
> Make a new module, call it mahout-hadoop. Move anything there that is currently in mrlegacy but used in math-scala or spark. Remove dependencies on mrlegacy altogether if possible by using other core classes.
> The goal is to have math-scala and spark module depend on math, and a small module called mahout-hadoop (much smaller than mrlegacy). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)