You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Wenchen Fan (JIRA)" <ji...@apache.org> on 2018/07/04 12:06:00 UTC

[jira] [Resolved] (SPARK-24727) The cache 100 in CodeGenerator is too small for streaming

     [ https://issues.apache.org/jira/browse/SPARK-24727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Wenchen Fan resolved SPARK-24727.
---------------------------------
       Resolution: Fixed
    Fix Version/s: 2.4.0

Issue resolved by pull request 21705
[https://github.com/apache/spark/pull/21705]

> The cache 100 in CodeGenerator is too small for streaming
> ---------------------------------------------------------
>
>                 Key: SPARK-24727
>                 URL: https://issues.apache.org/jira/browse/SPARK-24727
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.3.1
>            Reporter: ant_nebula
>            Assignee: Takeshi Yamamuro
>            Priority: Major
>             Fix For: 2.4.0
>
>
> {code:java}
> org.apache.spark.sql.catalyst.expressions.codegen.CodeGenerator 
> private val cache = CacheBuilder.newBuilder().maximumSize(100).build{code}
> The cache 100 in CodeGenerator is too small for realtime streaming calculation, although is ok for offline calculation. Because realtime streaming calculation is mostly more complex in one driver, and performance sensitive.
> I suggest spark support configging for user with default 100, such as spark.codegen.cache=1000
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org