You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@ignite.apache.org by dkarachentsev <dk...@gridgain.com> on 2016/12/12 13:00:55 UTC

Re: How can I save the job result which was ran on server which lost the connection.

Hi,

Try job checkpointing [1].

[1] https://apacheignite.readme.io/docs/checkpointing



--
View this message in context: http://apache-ignite-users.70518.x6.nabble.com/How-can-I-save-the-job-result-which-was-ran-on-server-which-lost-the-connection-tp9462p9479.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.

Re: How can I save the job result which was ran on server which lost the connection.

Posted by vkulichenko <va...@gmail.com>.




--
View this message in context: http://apache-ignite-users.70518.x6.nabble.com/How-can-I-save-the-job-result-which-was-ran-on-server-which-lost-the-connection-tp9462p9610.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.

Re: How can I save the job result which was ran on server which lost the connection.

Posted by vkulichenko <va...@gmail.com>.

Hi,

Please properly subscribe to the mailing list so that the community can
receive email notifications for your messages. To subscribe, send empty
email to user-subscribe@ignite.apache.org and follow simple instructions in
the reply.


vahan wrote
> Imagine we have cluster of nodes with replicated cache. Each node is
> writing something to that cache.
> Then suddenly the connection of some node is lost, that node is still
> running and continuing to write a data to his own copy of the cache. When
> connection will back to online again what would happen? Will the locally
> written data be synched up with other nodes' cashe (will the information
> written during connection lost available in the replicated cashe) ?

Ignite does not allow to merge clusters after split-brain, because this will
most likely cause data inconsistency. You will end up with two separate
clusters and will have to manually stop and restart one of them. It will
then join other cluster (assuming that network is restored) and rebalance
the data. Updates happened on the stopped cluster within this period of time
will be lost.

BTW, GridGain provides better support for segmentation handling as a part of
their paid solution [1]. Basically, they allow to detect segmentation
automatically and stop/restart one of the clusters right away, so there are
no time frame with two clusters and therefore no lost updates.

[1] https://gridgain.readme.io/docs/network-segmentation

-Val



--
View this message in context: http://apache-ignite-users.70518.x6.nabble.com/How-can-I-save-the-job-result-which-was-ran-on-server-which-lost-the-connection-tp9462p9612.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.

Re: How can I save the job result which was ran on server which lost the connection.

Posted by Pavel Tupitsyn <pt...@apache.org>.

There is no checkpointing support in .NET yet, but we certainly have plans
for it:
https://issues.apache.org/jira/browse/IGNITE-4427

On Wed, Dec 14, 2016 at 1:42 AM, vkulichenko <va...@gmail.com>
wrote:

> No, unfortunately not. Probably someone closer to this topic can tell if
> there are any plans to add this support.
>
> -Val
>
>
>
> --
> View this message in context: http://apache-ignite-users.
> 70518.x6.nabble.com/How-can-I-save-the-job-result-which-was-
> ran-on-server-which-lost-the-connection-tp9462p9514.html
> Sent from the Apache Ignite Users mailing list archive at Nabble.com.
>

Re: How can I save the job result which was ran on server which lost the connection.

Posted by vkulichenko <va...@gmail.com>.

No, unfortunately not. Probably someone closer to this topic can tell if
there are any plans to add this support.

-Val



--
View this message in context: http://apache-ignite-users.70518.x6.nabble.com/How-can-I-save-the-job-result-which-was-ran-on-server-which-lost-the-connection-tp9462p9514.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.