You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by tencas <di...@gmail.com> on 2017/04/11 14:35:03 UTC

Spark Streaming. Real-time save data and visualize on dashboard

I've developed an application using Apache Spark Streaming, that reads simple
info from plane sensors like acceleration, via TCP sockets on json format,
and analyse it.

I'd like to be able to persist this info from each "flight" on real-time,
while it is shown on any responsive dashboard.

I just don't know if is it possible to use a no-SQL database like Mongo,
Cassandra in collaboration with a monitoring tools like Grafana,Kabana, and
it make sense.



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-Real-time-save-data-and-visualize-on-dashboard-tp28587.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org


Re: Spark Streaming. Real-time save data and visualize on dashboard

Posted by Pierce Lamb <ri...@gmail.com>.
Hi,

It is possible to use Mongo or Cassandra to persist results from Spark. In
fact, a wide variety of data stores are available to use with Spark and
many are aimed at serving queries for dashboard visualizations. I cannot
comment on which work well with Grafana or Kabana, however, I've listed
(with links) a majority of the data stores that have an existing connector
or integration with Spark here:

http://stackoverflow.com/a/39753976/3723346

Hope this helps,

Pierce

On Tue, Apr 11, 2017 at 7:35 AM, tencas <di...@gmail.com> wrote:

> I've developed an application using Apache Spark Streaming, that reads
> simple
> info from plane sensors like acceleration, via TCP sockets on json format,
> and analyse it.
>
> I'd like to be able to persist this info from each "flight" on real-time,
> while it is shown on any responsive dashboard.
>
> I just don't know if is it possible to use a no-SQL database like Mongo,
> Cassandra in collaboration with a monitoring tools like Grafana,Kabana, and
> it make sense.
>
>
>
> --
> View this message in context: http://apache-spark-user-list.
> 1001560.n3.nabble.com/Spark-Streaming-Real-time-save-data-
> and-visualize-on-dashboard-tp28587.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>

Re: Spark Streaming. Real-time save data and visualize on dashboard

Posted by Sam Elamin <hu...@gmail.com>.
Hi

To be honest there are a variety of options but it all comes down to who
will be querying these dashboards.

If the end user is an engineer then the ELK stack is fine and I can attest
to the ease of use of kibana since I used it quite heavily.

On the other hand in my experience it isnt the engineers that are in charge
of reporting so if the end user is a data analyst or data scientist then
they are most comfortable using SQL and would be slightly aversed to
learning the nuances of creating dashboards and using elastic search. Trust
me no matter how much you try, these folks are more comfortable using sql
and Tableau like platforms. So you will have to educate them, Not to
mention the fact that any new hire will have to undergo the same training
to be productive

My suggestion for that is to push your data to Google BigQuery
<https://cloud.google.com/bigquery/>. It really is simple to use and people
can just focus on writing their queries. It also returns within seconds for
queries over terabytes of data. The caveat here is that you are paying per
query. But it's $5 for 1 TB which is peanuts really. Its a managed service
so there is zero setup costs and management compared to the other services.
I suppose in the end you are paying to abstract that knowledge away

Happy to answer any questions you might have

Kind Regards
Sam




On Wed, 12 Apr 2017 at 09:36, tencas <di...@gmail.com> wrote:

> Hi Gaurav1809 ,
>
> I was thinking about using elasticsearch + kibana too (actually don't know
> the differences between ELK and elasticsearch).
> I was wondering about pros and cons of using a document indexer vs NoSQL
> database.
>
>
>
> --
> View this message in context: http://apache-spark-user-list.
> 1001560.n3.nabble.com/Spark-Streaming-Real-time-save-data-
> and-visualize-on-dashboard-tp28587p28589.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscribe@spark.apache.org
>
>

Re: Spark Streaming. Real-time save data and visualize on dashboard

Posted by tencas <di...@gmail.com>.
Hi Gaurav1809 ,

I was thinking about using elasticsearch + kibana too (actually don't know
the differences between ELK and elasticsearch).
I was wondering about pros and cons of using a document indexer vs NoSQL
database.



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-Real-time-save-data-and-visualize-on-dashboard-tp28587p28589.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org


Re: Spark Streaming. Real-time save data and visualize on dashboard

Posted by Gaurav1809 <ga...@gmail.com>.
May be you can injest your data in ELK and use Kibana for live reporting. Of
course there can be better way of doing this. Waiting for others to share
their opinion. Thanks.



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-Real-time-save-data-and-visualize-on-dashboard-tp28587p28588.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org