You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@fluo.apache.org by mw...@apache.org on 2018/03/13 21:07:56 UTC

[fluo-website] branch gh-pages updated: Added troubleshooting documentation (#142)

This is an automated email from the ASF dual-hosted git repository.

mwalch pushed a commit to branch gh-pages
in repository https://gitbox.apache.org/repos/asf/fluo-website.git


The following commit(s) were added to refs/heads/gh-pages by this push:
     new 16b6f7f  Added troubleshooting documentation (#142)
16b6f7f is described below

commit 16b6f7f71ea0acd03dac6001a99c717bd2f2e78f
Author: Mike Walch <mw...@apache.org>
AuthorDate: Tue Mar 13 17:07:54 2018 -0400

    Added troubleshooting documentation (#142)
---
 _fluo-1-2/administration/troubleshooting.md | 56 +++++++++++++++++++++++++++++
 1 file changed, 56 insertions(+)

diff --git a/_fluo-1-2/administration/troubleshooting.md b/_fluo-1-2/administration/troubleshooting.md
new file mode 100644
index 0000000..47dd5e4
--- /dev/null
+++ b/_fluo-1-2/administration/troubleshooting.md
@@ -0,0 +1,56 @@
+---
+title: Troubleshooting
+category: administration
+order: 7
+---
+
+Steps for troubleshooting problems with Fluo applications.
+
+## Fluo application stops processing data
+
+1. Confirm that your application is running with the expected number of workers. 
+    ```bash
+    $ fluo list
+    Fluo instance (localhost/fluo) contains 1 application(s)
+
+    Application     Status     # Workers
+    -----------     ------     ---------
+    webindex        RUNNING        3
+    ```
+   Look for errors in the logs of any oracle or worker that has died.
+
+1. Run the `fluo wait` command to see if you application is processing notifications. 
+    ```bash
+    $ fluo wait -a webindex
+    [command.FluoWait] INFO : The wait command will exit when all notifications are processed
+    [command.FluoWait] INFO : 140 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : 140 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : 140 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : 96 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : 70 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : 31 notifications are still outstanding.  Will try again in 10 seconds...
+    [command.FluoWait] INFO : All processing has finished!
+    ```
+   The number of notifications will increase as data is added to the application but they should eventually decrease
+   to zero and processing should finish.
+
+1. Look for errors or exceptions in the logs of all oracle and worker processes. Processing can stop if all threads
+   in a worker process were consumed by exceptions thrown in Fluo application's observer code. These exceptions
+   are often due to parsing issues or corner cases not seen during development or using small data sets.
+
+1. If you are using a cluster manager (i.e Marathon, YARN etc) to run your Fluo application, look for errors in the logs of
+   your cluster manager or application manager.  Below are some common errors: 
+
+    * Cluster managers sometimes fail to start all process of Fluo application due to lack of container slots or resources (CPU, memory, etc).
+      This can be fixed by giving more resources to your cluster manager or decrease the number/resources of Fluo workers.
+    * Cluster managers can kill Fluo processes if they use too much memory. This can be fixed by allocating more memory to your workers.
+
+1. Run [jstack] to get stack traces of threads in your Fluo application processes and look for any stuck threads.
+
+1. Consider configuring your Fluo application to [report metrics][metrics] so that they are viewable in Grafana/InfluxDB. Metrics
+   can are helpfu in debugging performance issues.
+
+If you are still having trouble, feel free to email `dev@fluo.apache.org` for help.
+
+[jstack]: https://docs.oracle.com/javase/8/docs/technotes/tools/unix/jstack.html
+[metrics]: {{ page.docs_base }}/administration/metrics

-- 
To stop receiving notification emails like this one, please contact
mwalch@apache.org.