You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Alexey Kudinkin (Jira)" <ji...@apache.org> on 2022/04/06 21:34:00 UTC

[jira] [Created] (HUDI-3811) Restore Spark metrics for Hudi own Relations

Alexey Kudinkin created HUDI-3811:
-------------------------------------

             Summary: Restore Spark metrics for Hudi own Relations
                 Key: HUDI-3811
                 URL: https://issues.apache.org/jira/browse/HUDI-3811
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Alexey Kudinkin
            Assignee: Alexey Kudinkin
             Fix For: 0.12.0
         Attachments: Screen Shot 2022-04-06 at 2.20.29 PM.png

After rebasing Hudi away from `HadoopFsRelation` onto its own bespoke `HoodieBaseRelation`, we now lost all good metrics that Spark was providing for it, which occurred primarily due to the fact that Spark is predicating on `HadoopFsRelation` to decide which task it will be executing `FileScan` or `DataScan` (only `FileScan` has file specific info, while `DataScan` does not):

!Screen Shot 2022-04-06 at 2.20.29 PM.png!



--
This message was sent by Atlassian Jira
(v8.20.1#820001)