You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by jose-torres <gi...@git.apache.org> on 2018/02/01 23:40:48 UTC

[GitHub] spark pull request #20445: [SPARK-23092][SQL] Migrate MemoryStream to DataSo...

Github user jose-torres commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20445#discussion_r165222931
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala ---
    @@ -270,16 +270,17 @@ class MicroBatchExecution(
                 }
               case s: MicroBatchReader =>
                 updateStatusMessage(s"Getting offsets from $s")
    -            reportTimeTaken("getOffset") {
    -            // Once v1 streaming source execution is gone, we can refactor this away.
    -            // For now, we set the range here to get the source to infer the available end offset,
    -            // get that offset, and then set the range again when we later execute.
    -            s.setOffsetRange(
    -              toJava(availableOffsets.get(s).map(off => s.deserializeOffset(off.json))),
    -              Optional.empty())
    -
    -              (s, Some(s.getEndOffset))
    +            reportTimeTaken("setOffsetRange") {
    --- End diff --
    
    I agree that the old metric names don't make much sense anymore, but I worry about changing external-facing behavior as part of an API migration.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org