You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@beam.apache.org by Pablo Estrada <pa...@google.com> on 2018/08/20 17:14:44 UTC

Should we mention TF Transform in Beam site?

Other projects mention their ML / Graph / misc tooling libraries in their
websites.
It may be good for Beam to direct people to use Tensorflow Transform[1] if
they want to use beam for ML?
What do people think?

Best
-P.

[1] https://www.tensorflow.org/tfx/transform/

Re: Should we mention TF Transform in Beam site?

Posted by Matthias Feys <ma...@ml6.eu>.
I also published a minimal boilerplate example on github for using
tf.Transform with Apache Beam & ML Engine
https://github.com/Fematich/tftransform-demo
with an accompanying blogpost:
https://cloud.google.com/blog/products/ai-machine-learning/pre-processing-tensorflow-pipelines-tftransform-google-cloud

Happy to share somewhere else, so that more people can start using
tf.Transform :-)!

Op do 6 sep. 2018 om 19:39 schreef David Cavazos <dc...@google.com>:

> There are several examples on using TFT on the cloudml-samples repo
> <https://github.com/GoogleCloudPlatform/cloudml-samples>. Particularly,
> I've been working on an end-to-end sample
> <https://github.com/GoogleCloudPlatform/cloudml-samples/tree/master/molecules>
> using Beam+TFT to do the preprocessing. I recently added another Beam
> pipeline to do both batch predictions and streaming predictions. It's not
> merged yet, but here's the PR
> <https://github.com/GoogleCloudPlatform/cloudml-samples/pull/240> for the
> new version if anyone is interested.
>
> I'll be updating the README to include links to our existing samples as
> well as external samples.
>
> On Wed, Sep 5, 2018 at 1:59 AM Alexey Romanenko <ar...@gmail.com>
> wrote:
>
>> +1
>> Perhaps, it would be also helpful to add a small “get started” example to
>> attract user attention.
>>
>>
>> On 5 Sep 2018, at 02:16, Ahmet Altay <al...@google.com> wrote:
>>
>> +1
>>
>> It would be nice to showcase other tools built on top of Beam.
>>
>> On Mon, Aug 20, 2018 at 1:02 PM, Ankur Goenka <go...@google.com> wrote:
>>
>>> +1
>>> Adding to Thomas's suggestion, we can also add the future plans to keep
>>> people excited about what's in store.
>>>
>>>
>>> On Mon, Aug 20, 2018 at 11:09 AM Thomas Weise <th...@apache.org> wrote:
>>>
>>>> +1
>>>>
>>>> It would also be helpful to mention important current restrictions wrt
>>>> availability on runners, etc.
>>>>
>>>> On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pa...@plantix.net>
>>>> wrote:
>>>>
>>>>> fully agree!
>>>>>
>>>>> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:
>>>>>
>>>>>> +1 to add it on Beam website.
>>>>>>
>>>>>>
>>>>>>
>>>>>> -Rui
>>>>>>
>>>>>> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Other projects mention their ML / Graph / misc tooling libraries in
>>>>>>> their websites.
>>>>>>> It may be good for Beam to direct people to use Tensorflow
>>>>>>> Transform[1] if they want to use beam for ML?
>>>>>>> What do people think?
>>>>>>>
>>>>>>> Best
>>>>>>> -P.
>>>>>>>
>>>>>>> [1] https://www.tensorflow.org/tfx/transform/
>>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>> Pascal Gula
>>>>> Senior Data Engineer / Scientist+49 (0)176 34232684 <+49%20176%2034232684>www.plantix.net <http://plantix.net/>
>>>>>  PEAT GmbHKastanienallee 4
>>>>> 10435 Berlin // Germany <https://maps.google.com/?q=Kastanienallee+4%0D%0A10435+Berlin+//+Germany&entry=gmail&source=g>
>>>>>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
>>>>>
>>>>>
>>
>> --

[image: https://ml6.eu] <https://ml6.eu/>

* Matthias Feys*
ML6 Gent
<https://www.google.be/maps/place/ML6/@51.037408,3.7044893,17z/data=!3m1!4b1!4m5!3m4!1s0x47c37161feeca14b:0xb8f72585fdd21c90!8m2!3d51.037408!4d3.706678?hl=nl>
M: +32 498 11 83 09 <https://ml6.eu/disrupt-core-ai/>
**** DISCLAIMER ****
This email and any files transmitted with it are confidential and intended
solely for the use of the individual or entity to whom they are addressed.
If you have received this email in error please notify the system manager.
This message contains confidential information and is intended only for the
individual named. If you are not the named addressee you should not
disseminate, distribute or copy this e-mail. Please notify the sender
immediately by e-mail if you have received this e-mail by mistake and
delete this e-mail from your system. If you are not the intended recipient
you are notified that disclosing, copying, distributing or taking any
action in reliance on the contents of this information is strictly
prohibited.

Re: Should we mention TF Transform in Beam site?

Posted by David Cavazos <dc...@google.com>.
There are several examples on using TFT on the cloudml-samples repo
<https://github.com/GoogleCloudPlatform/cloudml-samples>. Particularly,
I've been working on an end-to-end sample
<https://github.com/GoogleCloudPlatform/cloudml-samples/tree/master/molecules>
using Beam+TFT to do the preprocessing. I recently added another Beam
pipeline to do both batch predictions and streaming predictions. It's not
merged yet, but here's the PR
<https://github.com/GoogleCloudPlatform/cloudml-samples/pull/240> for the
new version if anyone is interested.

I'll be updating the README to include links to our existing samples as
well as external samples.

On Wed, Sep 5, 2018 at 1:59 AM Alexey Romanenko <ar...@gmail.com>
wrote:

> +1
> Perhaps, it would be also helpful to add a small “get started” example to
> attract user attention.
>
>
> On 5 Sep 2018, at 02:16, Ahmet Altay <al...@google.com> wrote:
>
> +1
>
> It would be nice to showcase other tools built on top of Beam.
>
> On Mon, Aug 20, 2018 at 1:02 PM, Ankur Goenka <go...@google.com> wrote:
>
>> +1
>> Adding to Thomas's suggestion, we can also add the future plans to keep
>> people excited about what's in store.
>>
>>
>> On Mon, Aug 20, 2018 at 11:09 AM Thomas Weise <th...@apache.org> wrote:
>>
>>> +1
>>>
>>> It would also be helpful to mention important current restrictions wrt
>>> availability on runners, etc.
>>>
>>> On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pa...@plantix.net> wrote:
>>>
>>>> fully agree!
>>>>
>>>> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:
>>>>
>>>>> +1 to add it on Beam website.
>>>>>
>>>>>
>>>>>
>>>>> -Rui
>>>>>
>>>>> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com>
>>>>> wrote:
>>>>>
>>>>>> Other projects mention their ML / Graph / misc tooling libraries in
>>>>>> their websites.
>>>>>> It may be good for Beam to direct people to use Tensorflow
>>>>>> Transform[1] if they want to use beam for ML?
>>>>>> What do people think?
>>>>>>
>>>>>> Best
>>>>>> -P.
>>>>>>
>>>>>> [1] https://www.tensorflow.org/tfx/transform/
>>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>>
>>>> Pascal Gula
>>>> Senior Data Engineer / Scientist+49 (0)176 34232684 <+49%20176%2034232684>www.plantix.net <http://plantix.net/>
>>>>  PEAT GmbHKastanienallee 4
>>>> 10435 Berlin // Germany <https://maps.google.com/?q=Kastanienallee+4%0D%0A10435+Berlin+//+Germany&entry=gmail&source=g>
>>>>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
>>>>
>>>>
>
>

Re: Should we mention TF Transform in Beam site?

Posted by Alexey Romanenko <ar...@gmail.com>.
+1
Perhaps, it would be also helpful to add a small “get started” example to attract user attention.

> On 5 Sep 2018, at 02:16, Ahmet Altay <al...@google.com> wrote:
> 
> +1
> 
> It would be nice to showcase other tools built on top of Beam.
> 
> On Mon, Aug 20, 2018 at 1:02 PM, Ankur Goenka <goenka@google.com <ma...@google.com>> wrote:
> +1
> Adding to Thomas's suggestion, we can also add the future plans to keep people excited about what's in store.
> 
> 
> On Mon, Aug 20, 2018 at 11:09 AM Thomas Weise <thw@apache.org <ma...@apache.org>> wrote:
> +1
> 
> It would also be helpful to mention important current restrictions wrt availability on runners, etc.
> 
> On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pascal@plantix.net <ma...@plantix.net>> wrote:
> fully agree!
> 
> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ruwang@google.com <ma...@google.com>> wrote:
> +1 to add it on Beam website.
> 
> 
> 
> -Rui
> 
> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pabloem@google.com <ma...@google.com>> wrote:
> Other projects mention their ML / Graph / misc tooling libraries in their websites.
> It may be good for Beam to direct people to use Tensorflow Transform[1] if they want to use beam for ML?
> What do people think?
> 
> Best
> -P.
> 
> [1] https://www.tensorflow.org/tfx/transform/ <https://www.tensorflow.org/tfx/transform/>
> 
> 
> -- 
> Pascal Gula
> Senior Data Engineer / Scientist
> +49 (0)176 34232684
> www.plantix.net <http://plantix.net/>
>  
> PEAT GmbH
> Kastanienallee 4
> 10435 Berlin // Germany
> 
>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
> Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>


Re: Should we mention TF Transform in Beam site?

Posted by Ahmet Altay <al...@google.com>.
+1

It would be nice to showcase other tools built on top of Beam.

On Mon, Aug 20, 2018 at 1:02 PM, Ankur Goenka <go...@google.com> wrote:

> +1
> Adding to Thomas's suggestion, we can also add the future plans to keep
> people excited about what's in store.
>
>
> On Mon, Aug 20, 2018 at 11:09 AM Thomas Weise <th...@apache.org> wrote:
>
>> +1
>>
>> It would also be helpful to mention important current restrictions wrt
>> availability on runners, etc.
>>
>> On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pa...@plantix.net> wrote:
>>
>>> fully agree!
>>>
>>> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:
>>>
>>>> +1 to add it on Beam website.
>>>>
>>>>
>>>>
>>>> -Rui
>>>>
>>>> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com>
>>>> wrote:
>>>>
>>>>> Other projects mention their ML / Graph / misc tooling libraries in
>>>>> their websites.
>>>>> It may be good for Beam to direct people to use Tensorflow
>>>>> Transform[1] if they want to use beam for ML?
>>>>> What do people think?
>>>>>
>>>>> Best
>>>>> -P.
>>>>>
>>>>> [1] https://www.tensorflow.org/tfx/transform/
>>>>>
>>>>
>>>
>>>
>>> --
>>>
>>> Pascal Gula
>>> Senior Data Engineer / Scientist
>>> +49 (0)176 34232684www.plantix.net <http://plantix.net/>
>>>  PEAT GmbH
>>> Kastanienallee 4
>>> 10435 Berlin // Germany
>>>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
>>>
>>>

Re: Should we mention TF Transform in Beam site?

Posted by Ankur Goenka <go...@google.com>.
+1
Adding to Thomas's suggestion, we can also add the future plans to keep
people excited about what's in store.


On Mon, Aug 20, 2018 at 11:09 AM Thomas Weise <th...@apache.org> wrote:

> +1
>
> It would also be helpful to mention important current restrictions wrt
> availability on runners, etc.
>
> On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pa...@plantix.net> wrote:
>
>> fully agree!
>>
>> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:
>>
>>> +1 to add it on Beam website.
>>>
>>>
>>>
>>> -Rui
>>>
>>> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com>
>>> wrote:
>>>
>>>> Other projects mention their ML / Graph / misc tooling libraries in
>>>> their websites.
>>>> It may be good for Beam to direct people to use Tensorflow Transform[1]
>>>> if they want to use beam for ML?
>>>> What do people think?
>>>>
>>>> Best
>>>> -P.
>>>>
>>>> [1] https://www.tensorflow.org/tfx/transform/
>>>>
>>>
>>
>>
>> --
>>
>> Pascal Gula
>> Senior Data Engineer / Scientist
>> +49 (0)176 34232684www.plantix.net <http://plantix.net/>
>>  PEAT GmbH
>> Kastanienallee 4
>> 10435 Berlin // Germany
>>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
>>
>>

Re: Should we mention TF Transform in Beam site?

Posted by Thomas Weise <th...@apache.org>.
+1

It would also be helpful to mention important current restrictions wrt
availability on runners, etc.

On Mon, Aug 20, 2018 at 10:45 AM Pascal Gula <pa...@plantix.net> wrote:

> fully agree!
>
> On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:
>
>> +1 to add it on Beam website.
>>
>>
>>
>> -Rui
>>
>> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com>
>> wrote:
>>
>>> Other projects mention their ML / Graph / misc tooling libraries in
>>> their websites.
>>> It may be good for Beam to direct people to use Tensorflow Transform[1]
>>> if they want to use beam for ML?
>>> What do people think?
>>>
>>> Best
>>> -P.
>>>
>>> [1] https://www.tensorflow.org/tfx/transform/
>>>
>>
>
>
> --
>
> Pascal Gula
> Senior Data Engineer / Scientist
> +49 (0)176 34232684www.plantix.net <http://plantix.net/>
>  PEAT GmbH
> Kastanienallee 4
> 10435 Berlin // Germany
>  <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>
>
>

Re: Should we mention TF Transform in Beam site?

Posted by Pascal Gula <pa...@plantix.net>.
fully agree!

On Mon, Aug 20, 2018 at 7:23 PM, Rui Wang <ru...@google.com> wrote:

> +1 to add it on Beam website.
>
>
>
> -Rui
>
> On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com> wrote:
>
>> Other projects mention their ML / Graph / misc tooling libraries in their
>> websites.
>> It may be good for Beam to direct people to use Tensorflow Transform[1]
>> if they want to use beam for ML?
>> What do people think?
>>
>> Best
>> -P.
>>
>> [1] https://www.tensorflow.org/tfx/transform/
>>
>


-- 

Pascal Gula
Senior Data Engineer / Scientist
+49 (0)176 34232684www.plantix.net <http://plantix.net/>
 PEAT GmbH
Kastanienallee 4
10435 Berlin // Germany
 <https://play.google.com/store/apps/details?id=com.peat.GartenBank>Download
the App! <https://play.google.com/store/apps/details?id=com.peat.GartenBank>

Re: Should we mention TF Transform in Beam site?

Posted by Rui Wang <ru...@google.com>.
+1 to add it on Beam website.



-Rui

On Mon, Aug 20, 2018 at 10:15 AM Pablo Estrada <pa...@google.com> wrote:

> Other projects mention their ML / Graph / misc tooling libraries in their
> websites.
> It may be good for Beam to direct people to use Tensorflow Transform[1] if
> they want to use beam for ML?
> What do people think?
>
> Best
> -P.
>
> [1] https://www.tensorflow.org/tfx/transform/
>