You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@systemml.apache.org by "Mike Dusenberry (JIRA)" <ji...@apache.org> on 2017/05/23 19:01:04 UTC

[jira] [Closed] (SYSTEMML-1566) Possible regression from 0.13 -> 0.14 for MNIST LeNet script

     [ https://issues.apache.org/jira/browse/SYSTEMML-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mike Dusenberry closed SYSTEMML-1566.
-------------------------------------

> Possible regression from 0.13 -> 0.14 for MNIST LeNet script
> ------------------------------------------------------------
>
>                 Key: SYSTEMML-1566
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1566
>             Project: SystemML
>          Issue Type: Bug
>            Reporter: Mike Dusenberry
>            Assignee: Mike Dusenberry
>             Fix For: SystemML 1.0
>
>         Attachments: explain.txt, stats.txt
>
>
> For the 0.14 release testing, I tried out the [MNIST LeNet example | https://github.com/apache/incubator-systemml/blob/master/scripts/nn/examples/mnist_lenet-train.dml] on both 0.13 and 0.14 and noticed a possible regression.  Basically, on 0.14 the script took longer to run and had 2513 Spark instructions executed, while on 0.13 only 864 Spark instructions were executed.  This was run locally on a laptop using the 2 instructions at the top of the script (and copied below).  I've also attached the stats and runtime explain logs.
> 1. Download data
> {code}
> nn/examples/get_mnist_data.sh
> {code}
> 2. Execute from the {{scripts}} directory.
> {code}
> spark-submit --master local[*] --driver-memory 10G --conf spark.driver.maxResultSize=0 --conf spark.rpc.message.maxSize=128 SystemML.jar -f nn/examples/mnist_lenet-train.dml -stats -explain -nvargs train=nn/examples/data/mnist/mnist_train.csv test=nn/examples/data/mnist/mnist_test.csv C=1 Hin=28 Win=28 epochs=1 out_dir=nn/examples/model/mnist_lenet
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)