You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@ignite.apache.org by Chris Berry <ch...@gmail.com> on 2016/12/20 16:48:36 UTC

Ignite Design questions

Greetings,

I am hoping for some design advice? I am new to Ignite and don’t want to start out on the wrong foot.

At my day job we have a “Rates Engine”. It computes the current Rate for a particular User request. It is a high volume/low latency system. And currently it lives within a Java Servlet Engine (jetty), but is moving to Netty.

This is how it works – from 50000 ft :

In the Foreground:

User X requests a Rate (current price) for Product Y. (Actually it is for Products[X,Y,Z,…n], where n <= 1000)
The request is routed to “Shard P” based on Y.
The Rate is computed based on the cached RateInfo (essentially, a concurrent HashMap) for Y, found in Shard P. The computation is relatively expensive CPU-wise.
The Rate is returned to User X.

In the Background:

The system listens to “Rate change events” for all Products.
Rate changes are read and are aggregated over some period of time.
Periodically, the appropriate, cached RateInfo is updated as required, in the appropriate Shard

Simple. And it works well.

Except the system relies on static host bindings. The system is showing its age.
Moreover, we are moving all of this to AWS.

So, since we have to refactor anyway, we decided to look at better ways. And Ignite seems a perfect fit.
Ignite clustering, sharding, and replication are all quite appealing.

So my question. How would y’all model this system?

I would like to use Ignite as a standalone cluster (not in-memory within the Rates Engine)
To basically divide the system into its component pieces; A “Rates Engine”, a Distributed HashMap, and a “Rates Injector”

The things I find most puzzling are:

* How does a Clustered & Partitioned IgniteCache handle a `cache.getAll(Set<> keys)` behind the scenes ??
Will it transparently fan those requests out to the appropriate Partition for me (from the Client – or is it a “double hop” — once to the Cluster, another to the appropriate shard)??
How does it handle individual failures?? I can’t seem to find much detail about how all of the internal routing is handled in Ignite.

* There is currently ~30GB of Rates cached. What is an optimal “shard size”? I’ve found in other sharded systems that; (shard <= 5GB) is a good operational rule of thumb. For when data has to start moving about for maintenance, etc anything larger than 5GB is cumbersome. Is this true for Ignite? I am trying to figure out how big a cluster to build out.

* Would you build your own “artificial sharding” on top of the Rates. In other words; apply “affinity collocation” and use say Geo info. Something/anything so that the Rates collocate by some auxiliary key. That builds reasonably sized shards?? This is probably a given.

* Is it possible to configure Ignite to, essentially, “always serve data”. E.g. even it if split-brains?? In other words, since my RatesEngine is read-only, and really doesn’t care (Rates are assumed to be eventually consistent) – as long as it can find Rates it will be happy. Only the Rates Injector is really concerned about fidelity. And may blow up. And as we piece back together the world, the Rates Engine can keep chugging along somehow. Does that make sense?? Perhaps another way to ask this is, what are the failure scenarios for an Ignite Cluster??

Sorry if these questions are too broad. I am a total newbie.
I am looking to start a POC, and need a place to begin.

Thanks much,
-- Chris

Re: Ignite Design questions

Posted by vkulichenko <va...@gmail.com>.

Hi Chris,

See my comments below...


Chris Berry wrote
> *  How does a Clustered & Partitioned IgniteCache handle a
> `cache.getAll(Set<> keys)` behind the scenes ??
> Will it transparently fan those requests out to the appropriate Partition
> for me (from the Client – or is it a “double hop” — once to the Cluster,
> another to the appropriate shard)??
> How does it handle individual failures?? I can’t seem to find much detail
> about how all of the internal routing is handled in Ignite.

Ignite automatically routes any request to a proper node. getAll() will be
split into several requests accordingly. I.e. if set of keys includes 3
subsets routed to 3 different nodes, it will be 3 parallel round trips. In
case of failure you can get CachePartialUpdateException with the list of
keys that were not updated (note that this is true for ATOMIC cache only, in
case of TRANSACTIONAL cache you will have transaction committed or rolled
back, i.e. partial failure is not possible).


Chris Berry wrote
> * Would you build your own “artificial sharding” on top of the Rates. In
> other words; apply “affinity collocation” and use say Geo info.
> Something/anything so that the Rates collocate by some auxiliary key. That
> builds reasonably sized shards?? This is probably a given.

Hard to tell, this depends on how your data model is organized. Collocation
is generally used to minimize network traffic. For example, if there is a
computation that involves multiple entries that are related to each other,
it's a good idea to have them all on one node.


Chris Berry wrote
> * Is it possible to configure Ignite to, essentially, “always serve data”.
> E.g. even it if split-brains?? In other words, since my RatesEngine is
> read-only, and really doesn’t care (Rates are assumed to be eventually
> consistent) – as long as it can find Rates it will be happy. Only the
> Rates Injector is really concerned about fidelity. And may blow up. And as
> we piece back together the world, the Rates Engine can keep chugging along
> somehow. Does that make sense?? Perhaps another way to ask this is, what
> are the failure scenarios for an Ignite Cluster??

In case of segmentation you will have two separate clusters and clients will
be able to connect to any of them. In case of read-only this is probably not
a big problem, but in any case Ignite does not allow to merge clusters back.
You will have to manually restart one of them.

-Val



--
View this message in context: http://apache-ignite-users.70518.x6.nabble.com/Ignite-Design-questions-tp9646p9653.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.