You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by FireFly <zh...@bankofamerica.com> on 2018/04/03 19:56:58 UTC

Re: Issue with using Generalized Linear Regression for Logistic Regression modeling

It turns out that the weight was too large (with mean around 5000 and the
standard deviation around 8000) and caused overflow. After scaling down the
weight to, for example, numbers between 0 and 1, the code converged nicely.

Spark did not report the overflow issue. We actually found it out by running
the data set through R.



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org