You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@beam.apache.org by OrielResearch Eila Arich-Landkof <ei...@orielresearch.org> on 2018/02/23 17:02:45 UTC

Processing genomics data from bigQuery prior to training model

Hi all,

I am looking for a good reference for processing data prior to training a
model using APACHE BEAM
*Phase1:*
 30K+ columns of features, partitioned between big query tables - each of
10K, and 100K+ rows.

*Phase 2:*
more columns and more rows

any reference is highly appreciated.

Thank you,
Eila

-- 
Eila
www.orielresearch.org
https://www.meetup.com/Deep-Learning-In-Production/