You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yan Facai (颜发才 JIRA)" <ji...@apache.org> on 2017/05/18 08:45:04 UTC

[jira] [Commented] (SPARK-20768) PySpark FPGrowth does not expose numPartitions (expert) param

    [ https://issues.apache.org/jira/browse/SPARK-20768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16015437#comment-16015437 ] 

Yan Facai (颜发才) commented on SPARK-20768:
-----------------------------------------

Hi, I'm newbie.
`numPartitions` is found in pyspark code, could you explain more details?
thanks.

```python
    def __init__(self, minSupport=0.3, minConfidence=0.8, itemsCol="items",
                 predictionCol="prediction", numPartitions=None):
```

> PySpark FPGrowth does not expose numPartitions (expert)  param
> --------------------------------------------------------------
>
>                 Key: SPARK-20768
>                 URL: https://issues.apache.org/jira/browse/SPARK-20768
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML, PySpark
>    Affects Versions: 2.2.0
>            Reporter: Nick Pentreath
>            Priority: Minor
>
> The PySpark API for {{FPGrowth}} does not expose the {{numPartitions}} param. While it is an "expert" param, the general approach elsewhere is to expose these on the Python side (e.g. {{aggregationDepth}} and intermediate storage params in {{ALS}})



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org