You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Ryan Thompson (Jira)" <ji...@apache.org> on 2022/05/31 15:51:00 UTC

[jira] [Created] (BEAM-14535) Add support for Pandas Dataframes to sklearn RunInference Implementation

Ryan Thompson created BEAM-14535:
------------------------------------

             Summary: Add support for Pandas Dataframes to sklearn RunInference Implementation
                 Key: BEAM-14535
                 URL: https://issues.apache.org/jira/browse/BEAM-14535
             Project: Beam
          Issue Type: Sub-task
          Components: sdk-py-core
            Reporter: Ryan Thompson
            Assignee: Ryan Thompson


Sklearn pipelines are often set up to take pandas dataframes.

 

Our current implementation only supports numpy arrays.

 

This FR allows the sklearn implementation to autodetect pandas dataframes or numpy arrays and then combine them (via concat).

 

In the case of a pandas dataframe that value will be passed through to the pipeline.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)