You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by cr...@apache.org on 2013/08/02 18:55:30 UTC
svn commit: r1509774 - in /incubator/samza/site/startup: download/index.html
hello-samza/0.7.0/index.html
Author: criccomini
Date: Fri Aug 2 16:55:30 2013
New Revision: 1509774
URL: http://svn.apache.org/r1509774
Log:
updating naming in a few spots
Modified:
incubator/samza/site/startup/download/index.html
incubator/samza/site/startup/hello-samza/0.7.0/index.html
Modified: incubator/samza/site/startup/download/index.html
URL: http://svn.apache.org/viewvc/incubator/samza/site/startup/download/index.html?rev=1509774&r1=1509773&r2=1509774&view=diff
==============================================================================
--- incubator/samza/site/startup/download/index.html (original)
+++ incubator/samza/site/startup/download/index.html Fri Aug 2 16:55:30 2013
@@ -65,8 +65,6 @@
<div class="body">
<h2>Download</h2>
-<!-- TODO update github link when we have an apache repo. -->
-
<!-- TODO update maven dependency versions appropriately -->
<p>If you want to play around with Samza for the first time, go to <a href="/startup/hello-samza/0.7.0">Hello Samza</a>.</p>
@@ -81,32 +79,32 @@ All Samza JARs are published through Mav
A Samza project that runs with Kafka and YARN should depend on the following artifacts.
<dependency>
- <groupId>samsa</groupId>
- <artifactId>samsa-api</artifactId>
+ <groupId>samza</groupId>
+ <artifactId>samza-api</artifactId>
<version>0.7.0</version>
</dependency>
<dependency>
- <groupId>samsa</groupId>
- <artifactId>samsa-core_2.9.2</artifactId>
+ <groupId>samza</groupId>
+ <artifactId>samza-core_2.9.2</artifactId>
<version>0.7.0</version>
<scope>runtime</scope>
</dependency>
<dependency>
- <groupId>samsa</groupId>
- <artifactId>samsa-serializers_2.9.2</artifactId>
+ <groupId>samza</groupId>
+ <artifactId>samza-serializers_2.9.2</artifactId>
<version>0.7.0</version>
<scope>runtime</scope>
</dependency>
<dependency>
- <groupId>samsa</groupId>
- <artifactId>samsa-yarn_2.9.2</artifactId>
+ <groupId>samza</groupId>
+ <artifactId>samza-yarn_2.9.2</artifactId>
<version>0.7.0</version>
<classifier>yarn-2.0.5-alpha</classifier>
<scope>runtime</scope>
</dependency>
<dependency>
- <groupId>samsa</groupId>
- <artifactId>samsa-kafka_2.9.2</artifactId>
+ <groupId>samza</groupId>
+ <artifactId>samza-kafka_2.9.2</artifactId>
<version>0.7.0</version>
<scope>runtime</scope>
</dependency>
Modified: incubator/samza/site/startup/hello-samza/0.7.0/index.html
URL: http://svn.apache.org/viewvc/incubator/samza/site/startup/hello-samza/0.7.0/index.html?rev=1509774&r1=1509773&r2=1509774&view=diff
==============================================================================
--- incubator/samza/site/startup/hello-samza/0.7.0/index.html (original)
+++ incubator/samza/site/startup/hello-samza/0.7.0/index.html Fri Aug 2 16:55:30 2013
@@ -65,9 +65,7 @@
<div class="body">
<h2>Hello Samza</h2>
-<!-- TODO Update github link when we have am apache project. -->
-
-<!-- TODO Rename samsa to samza in package name, when we do the project rename. -->
+<!-- TODO Make github project public when code is up. -->
<p>The <a href="http://github.com/linkedin/hello-samza">hello-samza</a> project is a stand-alone project designed to help you run your first Samza job.</p>
@@ -97,7 +95,7 @@ tar -xvf ./samza-job-package/target/samz
<h3>Run a Samza Job</h3>
<p>After you've built your Samza package, you can start a job on the grid using the run-job.sh script.</p>
-<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/samza/bin/run-job.sh --config-factory=samsa.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-feed.properties
+<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/samza/bin/run-job.sh --config-factory=samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-feed.properties
</code></pre></div>
<p>The job will consume a feed of real-time edits from Wikipedia, and produce them to a Kafka topic called "wikipedia-raw". Give the job a minute to startup, and then tail the Kafka topic:</p>
<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/kafka/bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic wikipedia-raw
@@ -107,8 +105,8 @@ tar -xvf ./samza-job-package/target/samz
<h3>Generate Wikipedia Statistics</h3>
<p>Let's calculate some statistics based on the messages in the wikipedia-raw topic. Start two more jobs:</p>
-<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/samza/bin/run-job.sh --config-factory=samsa.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-parser.properties
-deploy/samza/bin/run-job.sh --config-factory=samsa.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-stats.properties
+<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/samza/bin/run-job.sh --config-factory=samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-parser.properties
+deploy/samza/bin/run-job.sh --config-factory=samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-stats.properties
</code></pre></div>
<p>The first job (wikipedia-parser) parses the messages in wikipedia-raw, and extracts information about the size of the edit, who made the change, etc. You can take a look at its output with:</p>
<div class="highlight"><pre><code class="text language-text" data-lang="text">deploy/kafka/bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic wikipedia-edits