You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Matt Pouttu (Jira)" <ji...@apache.org> on 2021/05/01 00:16:00 UTC
[jira] [Created] (HUDI-1873) collect() call causing issues with
very large upserts
Matt Pouttu created HUDI-1873:
---------------------------------
Summary: collect() call causing issues with very large upserts
Key: HUDI-1873
URL: https://issues.apache.org/jira/browse/HUDI-1873
Project: Apache Hudi
Issue Type: Bug
Components: Spark Integration
Affects Versions: 0.8.0, 0.7.0
Environment: EMR 5.28 Spark 11
Reporter: Matt Pouttu
Fix For: 0.9.0
A collect call causes resource issues with very large upserts, and is only used for reporting error messages that are already in the spark task logs. I replaced it with a .isEmpty() call and amended the error message to direct the user to the task logs.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)