You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by gi...@apache.org on 2022/08/18 22:17:02 UTC

[beam] branch asf-site updated: Publishing website 2022/08/18 22:16:53 at commit 062a0d2

This is an automated email from the ASF dual-hosted git repository.

git-site-role pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 1f44beb4c24 Publishing website 2022/08/18 22:16:53 at commit 062a0d2
1f44beb4c24 is described below

commit 1f44beb4c2405795ebf1e3546afd87ffd4e261fc
Author: jenkins <bu...@apache.org>
AuthorDate: Thu Aug 18 22:16:53 2022 +0000

    Publishing website 2022/08/18 22:16:53 at commit 062a0d2
---
 .../get-started/from-spark/index.html              |  4 +-
 website/generated-content/get-started/index.xml    | 48 ++++++++++------------
 website/generated-content/sitemap.xml              |  2 +-
 3 files changed, 25 insertions(+), 29 deletions(-)

diff --git a/website/generated-content/get-started/from-spark/index.html b/website/generated-content/get-started/from-spark/index.html
index ee66d4b22aa..1acd6de5c8b 100644
--- a/website/generated-content/get-started/from-spark/index.html
+++ b/website/generated-content/get-started/from-spark/index.html
@@ -69,7 +69,7 @@ This serves both as comments and makes your pipeline easier to debug.</p><p>This
         <span class=o>|</span> <span class=s1>&#39;Multiply by two&#39;</span> <span class=o>&gt;&gt;</span> <span class=n>beam</span><span class=o>.</span><span class=n>Map</span><span class=p>(</span><span class=k>lambda</span> <span class=n>x</span><span class=p>:</span> <span class=n>x</span> <span class=o>*</span> <span class=mi>2</span><span class=p>)</span>
         <span class=o>|</span> <span class=s1>&#39;Sum everything&#39;</span> <span class=o>&gt;&gt;</span> <span class=n>beam</span><span class=o>.</span><span class=n>CombineGlobally</span><span class=p>(</span><span class=nb>sum</span><span class=p>)</span>
         <span class=o>|</span> <span class=s1>&#39;Print results&#39;</span> <span class=o>&gt;&gt;</span> <span class=n>beam</span><span class=o>.</span><span class=n>Map</span><span class=p>(</span><span class=k>print</span><span class=p>)</span>
-    <span class=p>)</span></code></pre></div></div></div><h2 id=setup>Setup</h2><p>Here&rsquo;s a comparison on how to get started both in PySpark and Beam.</p><div class=table-wrapper><table><tr><th></th><th>PySpark</th><th>Beam</th></tr><tr><td><b>Install</b></td><td><code>$ pip install pyspark</code></td><td><code>$ pip install apache-beam</code></td></tr><tr><td><b>Imports</b></td><td><code>import pyspark</code></td><td><code>import apache_beam as beam</code></td></tr><tr><td><b>Crea [...]
+    <span class=p>)</span></code></pre></div></div></div><h2 id=setup>Setup</h2><p>Here&rsquo;s a comparison on how to get started both in PySpark and Beam.</p><div class=table-wrapper><table style=width:100%><tr><th style=width:20%></th><th style=width:40%>PySpark</th><th style=width:40%>Beam</th></tr><tr><td><b>Install</b></td><td><code>$ pip install pyspark</code></td><td><code>$ pip install apache-beam</code></td></tr><tr><td><b>Imports</b></td><td><code>import pyspark</code></td><td [...]
 <a href=/documentation/transforms/python/overview>Python transform gallery</a>.</p></blockquote><h2 id=using-calculated-values>Using calculated values</h2><p>Since we are working in potentially distributed environments,
 we can&rsquo;t guarantee that the results we&rsquo;ve calculated are available at any given machine.</p><p>In PySpark, we can get a result from a collection of elements (RDD) by using
 <code>data.collect()</code>, or other aggregations such as <code>reduce()</code>, <code>count()</code>, and more.</p><p>Here&rsquo;s an example to scale numbers into a range between zero and one.</p><div class="language-py snippet"><div class="notebook-skip code-snippet"><a class=copy type=button data-bs-toggle=tooltip data-bs-placement=bottom title="Copy to clipboard"><img src=/images/copy-icon.svg></a><div class=highlight><pre class=chroma><code class=language-py data-lang=py><span cla [...]
@@ -108,7 +108,7 @@ and access them as an <a href=https://docs.python.org/3/glossary.html#term-itera
     <span class=n>scaled_values</span> <span class=o>|</span> <span class=n>beam</span><span class=o>.</span><span class=n>Map</span><span class=p>(</span><span class=k>print</span><span class=p>)</span></code></pre></div></div></div><blockquote><p>ℹ️ In Beam we need to pass a side input explicitly, but we get the
 benefit that a reduction or aggregation does <em>not</em> have to fit into memory.
 Lazily computing side inputs also allows us to compute <code>values</code> only once,
-rather than for each distinct reduction (or requiring explicit caching of the RDD).</p></blockquote><h2 id=next-steps>Next Steps</h2><ul><li>Take a look at all the available transforms in the <a href=/documentation/transforms/python/overview>Python transform gallery</a>.</li><li>Learn how to read from and write to files in the <a href=/documentation/programming-guide/#pipeline-io><em>Pipeline I/O</em> section of the <em>Programming guide</em></a></li><li>Walk through additional WordCount [...]
+rather than for each distinct reduction (or requiring explicit caching of the RDD).</p></blockquote><h2 id=next-steps>Next Steps</h2><ul><li>Take a look at all the available transforms in the <a href=/documentation/transforms/python/overview>Python transform gallery</a>.</li><li>Learn how to read from and write to files in the <a href=/documentation/programming-guide/#pipeline-io><em>Pipeline I/O</em> section of the <em>Programming guide</em></a></li><li>Walk through additional WordCount [...]
 <a href=http://www.apache.org>The Apache Software Foundation</a>
 | <a href=/privacy_policy>Privacy Policy</a>
 | <a href=/feed.xml>RSS Feed</a><br><br>Apache Beam, Apache, Beam, the Beam logo, and the Apache feather logo are either registered trademarks or trademarks of The Apache Software Foundation. All other products or name brands are trademarks of their respective holders, including The Apache Software Foundation.</div></div><div class="footer__cols__col footer__cols__col__logos"><div class=footer__cols__col--group><div class=footer__cols__col__logo><a href=https://github.com/apache/beam><im [...]
\ No newline at end of file
diff --git a/website/generated-content/get-started/index.xml b/website/generated-content/get-started/index.xml
index 618f639b594..c9afd0482c5 100644
--- a/website/generated-content/get-started/index.xml
+++ b/website/generated-content/get-started/index.xml
@@ -4415,11 +4415,11 @@ This serves both as comments and makes your pipeline easier to debug.&lt;/p>
 &lt;/div>
 &lt;h2 id="setup">Setup&lt;/h2>
 &lt;p>Here&amp;rsquo;s a comparison on how to get started both in PySpark and Beam.&lt;/p>
-&lt;div class="table-wrapper">&lt;table>
+&lt;div class="table-wrapper">&lt;table style="width:100%">
 &lt;tr>
-&lt;th>&lt;/th>
-&lt;th>PySpark&lt;/th>
-&lt;th>Beam&lt;/th>
+&lt;th style="width:20%">&lt;/th>
+&lt;th style="width:40%">PySpark&lt;/th>
+&lt;th style="width:40%">Beam&lt;/th>
 &lt;/tr>
 &lt;tr>
 &lt;td>&lt;b>Install&lt;/b>&lt;/td>
@@ -4472,86 +4472,82 @@ This serves both as comments and makes your pipeline easier to debug.&lt;/p>
 &lt;/table>&lt;/div>
 &lt;h2 id="transforms">Transforms&lt;/h2>
 &lt;p>Here are the equivalents of some common transforms in both PySpark and Beam.&lt;/p>
-&lt;div class="table-wrapper">&lt;table>
-&lt;thead>
+&lt;div class="table-wrapper">&lt;table style="width:100%">
 &lt;tr>
-&lt;th>&lt;/th>
-&lt;th>PySpark&lt;/th>
-&lt;th>Beam&lt;/th>
+&lt;th style="width:20%">&lt;/th>
+&lt;th style="width:40%">PySpark&lt;/th>
+&lt;th style="width:40%">Beam&lt;/th>
 &lt;/tr>
-&lt;/thead>
-&lt;tbody>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/elementwise/map/">&lt;strong>Map&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/elementwise/map/">Map&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.map(lambda x: x * 2)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Map(lambda x: x * 2)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/elementwise/filter/">&lt;strong>Filter&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/elementwise/filter/">Filter&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.filter(lambda x: x % 2 == 0)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Filter(lambda x: x % 2 == 0)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/elementwise/flatmap/">&lt;strong>FlatMap&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/elementwise/flatmap/">FlatMap&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.flatMap(lambda x: range(x))&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.FlatMap(lambda x: range(x))&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/groupbykey/">&lt;strong>Group by key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/groupbykey/">Group by key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.groupByKey()&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.GroupByKey()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/combineglobally/">&lt;strong>Reduce&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/combineglobally/">Reduce&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.reduce(lambda x, y: x+y)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.CombineGlobally(sum)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/combineperkey/">&lt;strong>Reduce by key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/combineperkey/">Reduce by key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.reduceByKey(lambda x, y: x+y)&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.CombinePerKey(sum)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/distinct/">&lt;strong>Distinct&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/distinct/">Distinct&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.distinct()&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.Distinct()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/count/">&lt;strong>Count&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/count/">Count&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.count()&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Count.Globally()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/count/">&lt;strong>Count by key&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/count/">Count by key&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.countByKey()&lt;/code>&lt;/td>
 &lt;td>&lt;code>pairs | beam.combiners.Count.PerKey()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/top/">&lt;strong>Take smallest&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/top/">Take smallest&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeOrdered(3)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Top.Smallest(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/top/">&lt;strong>Take largest&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/top/">Take largest&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeOrdered(3, lambda x: -x)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Top.Largest(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/sample/">&lt;strong>Random sample&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/sample/">Random sample&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.takeSample(False, 3)&lt;/code>&lt;/td>
 &lt;td>&lt;code>values | beam.combiners.Sample.FixedSizeGlobally(3)&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/other/flatten/">&lt;strong>Union&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/other/flatten/">Union&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>values.union(otherValues)&lt;/code>&lt;/td>
 &lt;td>&lt;code>(values, otherValues) | beam.Flatten()&lt;/code>&lt;/td>
 &lt;/tr>
 &lt;tr>
-&lt;td>&lt;a href="/documentation/transforms/python/aggregation/cogroupbykey/">&lt;strong>Co-group&lt;/strong>&lt;/a>&lt;/td>
+&lt;td>&lt;b>&lt;a href="/documentation/transforms/python/aggregation/cogroupbykey/">Co-group&lt;/a>&lt;/b>&lt;/td>
 &lt;td>&lt;code>pairs.cogroup(otherPairs)&lt;/code>&lt;/td>
 &lt;td>&lt;code>{'Xs': pairs, 'Ys': otherPairs} | beam.CoGroupByKey()&lt;/code>&lt;/td>
 &lt;/tr>
-&lt;/tbody>
 &lt;/table>&lt;/div>
 &lt;blockquote>
 &lt;p>ℹ️ To learn more about the transforms available in Beam, check the
diff --git a/website/generated-content/sitemap.xml b/website/generated-content/sitemap.xml
index babe8bdfd75..93820975b1a 100644
--- a/website/generated-content/sitemap.xml
+++ b/website/generated-content/sitemap.xml
@@ -1 +1 @@
-<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/case-studies/intuit/</loc><lastmod>2022-08-18T01:27:08+06:00</lastmod></url><url><loc>/blog/go-2.40/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/c [...]
\ No newline at end of file
+<?xml version="1.0" encoding="utf-8" standalone="yes"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml"><url><loc>/case-studies/intuit/</loc><lastmod>2022-08-18T01:27:08+06:00</lastmod></url><url><loc>/blog/go-2.40/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/categories/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/blog/</loc><lastmod>2022-07-06T14:03:32-04:00</lastmod></url><url><loc>/c [...]
\ No newline at end of file