You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/09/07 19:29:39 UTC

[GitHub] [beam] KevinGG commented on pull request #22587: WIP: Dataframe API ML preprocessing notebook

KevinGG commented on PR #22587:
URL: https://github.com/apache/beam/pull/22587#issuecomment-1239789553

   > > > I think the "No producer" error is a bug with PCollection pruning in interactive beam: #21430
   > > > CC: @KevinGG
   > > 
   > > 
   > > Commented in #21430, we can disable pruning for dataframe like what we did for TestStream: https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/interactive/pipeline_fragment.py#L219
   > 
   > Is there any update on this or a potential workaround for merging Deferred dataframes?
   
   Just sent out https://github.com/apache/beam/pull/23069, this should mitigate the unintended pruning issues.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org