You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (Jira)" <ji...@apache.org> on 2020/05/11 02:27:00 UTC

[jira] [Commented] (SPARK-31668) Saving and loading HashingTF leads to hash function changed

    [ https://issues.apache.org/jira/browse/SPARK-31668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17104009#comment-17104009 ] 

Apache Spark commented on SPARK-31668:
--------------------------------------

User 'WeichenXu123' has created a pull request for this issue:
https://github.com/apache/spark/pull/28413

> Saving and loading HashingTF leads to hash function changed
> -----------------------------------------------------------
>
>                 Key: SPARK-31668
>                 URL: https://issues.apache.org/jira/browse/SPARK-31668
>             Project: Spark
>          Issue Type: Bug
>          Components: ML
>    Affects Versions: 3.0.0, 3.0.1, 3.1.0
>            Reporter: Weichen Xu
>            Assignee: Weichen Xu
>            Priority: Blocker
>
> If we use spark 2.x save HashingTF, and then use spark 3.0 load it, and then use spark 3.0 to save it again, and then use spark 3.0 to load it again, the hash function will be changed.
> This bug is hard to debug, we need to fix it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org