You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2019/05/21 04:20:13 UTC

[jira] [Updated] (SPARK-12178) Expose reporting of StreamInputInfo for custom made streams

     [ https://issues.apache.org/jira/browse/SPARK-12178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon updated SPARK-12178:
---------------------------------
    Labels: bulk-closed  (was: )

> Expose reporting of StreamInputInfo for custom made streams
> -----------------------------------------------------------
>
>                 Key: SPARK-12178
>                 URL: https://issues.apache.org/jira/browse/SPARK-12178
>             Project: Spark
>          Issue Type: Improvement
>          Components: DStreams
>            Reporter: Rodrigo Boavida
>            Priority: Minor
>              Labels: bulk-closed
>
> For custom made direct streams, the Spark Streaming context needs to be informed of the RDD count per batch execution. This is not exposed by the InputDStream abstract class. 
> The suggestion is to create a method in the InputDStream class that reports to the streaming context and make that available to child classes of InputDStream.
> Signature example:
> def reportInfo(validTime : org.apache.spark.streaming.Time, inputInfo : org.apache.spark.streaming.scheduler.StreamInputInfo)
> I have already done this on my own private branch. I can merge that change in if approval is given.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org