You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Alexey Kudinkin (Jira)" <ji...@apache.org> on 2022/04/06 21:34:00 UTC
[jira] [Created] (HUDI-3811) Restore Spark metrics for Hudi own Relations
Alexey Kudinkin created HUDI-3811:
-------------------------------------
Summary: Restore Spark metrics for Hudi own Relations
Key: HUDI-3811
URL: https://issues.apache.org/jira/browse/HUDI-3811
Project: Apache Hudi
Issue Type: Bug
Reporter: Alexey Kudinkin
Assignee: Alexey Kudinkin
Fix For: 0.12.0
Attachments: Screen Shot 2022-04-06 at 2.20.29 PM.png
After rebasing Hudi away from `HadoopFsRelation` onto its own bespoke `HoodieBaseRelation`, we now lost all good metrics that Spark was providing for it, which occurred primarily due to the fact that Spark is predicating on `HadoopFsRelation` to decide which task it will be executing `FileScan` or `DataScan` (only `FileScan` has file specific info, while `DataScan` does not):
!Screen Shot 2022-04-06 at 2.20.29 PM.png!
--
This message was sent by Atlassian Jira
(v8.20.1#820001)