You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@ignite.apache.org by Dmitriy Setrakyan <ds...@gridgain.com> on 2015/04/13 04:05:58 UTC

TTL-based expirations

Guys,

I have been playing with TTL-based expirations for streaming and am
noticing that often the TTL thread cannot cope with the load a
data-streamer can generate.

Our current approach for TTL is implemented with a thread that keeps all
cache entries in sorted-by-expire-time order and scans this ordered list
every time something expires. The scan is optimized, such that once an
entry that does not need to expire yet is touched, the thread goes into
wait mode until TTL time for that entry expires. However even with this
optimization, this thread cannot expire the entries fast enough.

I think if we handle TTL expirations the same way as we handle evictions,
through eviction policies, this problem would go away, because all the
threads that add entries to eviction queue are also responsible to evict
the excess entries as well.

I have created a parent ticket for this issue:
https://issues.apache.org/jira/browse/IGNITE-729

Would be nice to implement them in the next couple of weeks.

D.

Re: TTL-based expirations

Posted by Yakov Zhdanov <yz...@apache.org>.

Arti, please take a look here -
org.apache.ignite.internal.processors.cache.GridCacheEvictionManager. Pay
attention
to org.apache.ignite.internal.processors.cache.GridCacheEvictionManager#notifyPolicy

Thanks!

--Yakov

2015-04-14 20:34 GMT+03:00 Atri Sharma <at...@gmail.com>:

> I agree with you on the thrashing point.rethinking through the idea I
> agree.however, I am imagining a lot of threads blocked on data eviction
> which might be not good for scalable performance (this is an unrelated
> problem but might be relevant).
>
> Can you please guide on code pointers which handle this part please?
>
> I will add a comment.
> On 14 Apr 2015 22:53, "Dmitriy Setrakyan" <ds...@apache.org> wrote:
>
> > Thanks Atri,
> >
> > Interesting algorithm, reminds me of LIRS in some way. I still, however,
> > don't see how it fixes thrashing. The clock-sweep thread still can get
> > behind the loading threads, and we will still end up with the same
> problem.
> >
> > In my opinion the only way to handle it is to have the loading threads
> > themselves do the clean up. This way they won't move forward until the
> data
> > is evicted. The algorithm you suggested may be used in some ways by the
> > cleanup implementation though. Can you please add a comment in
> IGNITE-729?
> >
> > D.
> >
> > On Mon, Apr 13, 2015 at 3:12 AM, Atri Sharma <at...@gmail.com>
> wrote:
> >
> > > Clocksweep is essentially a two chance algorithm. What happens is that
> a
> > > counter is attached to each buffer page and initialized to a suitable
> > value
> > > (take 1 for eg). Now when an eviction needs to happen, a scan is done
> > > through the cache looking for any guy with 0 count. The essential part
> is
> > > that while the scan happens, the counter for *each* page is decremented
> > by
> > > one. The first page with counter 0 is evicted.
> > >
> > > Each time a page is hit, the counter is incremented. We can have a
> limit
> > > for the counter count.
> > >
> > > The idea is that popular pages are kept in the cache and there is no
> need
> > > to sort according to anything or manage expiration explicitly. All we
> > need
> > > to do is do a scan and decrement, and pull the one with 0 count.
> > >
> > > On Mon, Apr 13, 2015 at 3:33 PM, Dmitriy Setrakyan <
> > > dsetrakyan@gridgain.com>
> > > wrote:
> > >
> > > > On Mon, Apr 13, 2015 at 2:52 AM, Atri Sharma <at...@gmail.com>
> > > wrote:
> > > >
> > > > > I would feel that this problem might also be mitigated a bit if we
> > > used a
> > > > > different algorithm than simple LRU (I assume that is what we use).
> > > > > Something like Clocksweep...
> > > > >
> > > >
> > > > I am not too familiar with Clocksweep. Is this something like this?
> > > >
> > > >
> > >
> >
> https://www.usenix.org/legacy/event/usenix05/tech/general/full_papers/jiang/jiang_html/html.html
> > > >
> > > >
> > > >
> > > > >
> > > > > Thoughts?
> > > > >
> > > > > On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <
> > > > > dsetrakyan@gridgain.com>
> > > > > wrote:
> > > > >
> > > > > > Yes, it is possible for the system to thrash if the data is
> > composed
> > > > only
> > > > > > of unique keys being added and the only parameter to control
> cache
> > > size
> > > > > is
> > > > > > the time-based window. In this case cache can grow indefinitely
> > > because
> > > > > the
> > > > > > TTL thread cannot cope with the load. If you, in addition to TTL
> > also
> > > > > > configure an eviction policy, e.g. FifoEvictionPolicy, then there
> > is
> > > > not
> > > > > > thrashing.
> > > > > >
> > > > > > That's why I suggested that TTL expiration is also implemented
> > > through
> > > > > > eviction policies.
> > > > > >
> > > > > > D.
> > > > > >
> > > > > > On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <
> atri.jiit@gmail.com
> > >
> > > > > wrote:
> > > > > >
> > > > > > > Out of curiosity, did we never face thrashing with this
> approach?
> > > > > > (assuming
> > > > > > > wild workloads that hit the same page again and again)
> > > > > > >
> > > > > > > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > > > > > > dsetrakyan@gridgain.com>
> > > > > > > wrote:
> > > > > > >
> > > > > > > > Guys,
> > > > > > > >
> > > > > > > > I have been playing with TTL-based expirations for streaming
> > and
> > > am
> > > > > > > > noticing that often the TTL thread cannot cope with the load
> a
> > > > > > > > data-streamer can generate.
> > > > > > > >
> > > > > > > > Our current approach for TTL is implemented with a thread
> that
> > > > keeps
> > > > > > all
> > > > > > > > cache entries in sorted-by-expire-time order and scans this
> > > ordered
> > > > > > list
> > > > > > > > every time something expires. The scan is optimized, such
> that
> > > once
> > > > > an
> > > > > > > > entry that does not need to expire yet is touched, the thread
> > > goes
> > > > > into
> > > > > > > > wait mode until TTL time for that entry expires. However even
> > > with
> > > > > this
> > > > > > > > optimization, this thread cannot expire the entries fast
> > enough.
> > > > > > > >
> > > > > > > > I think if we handle TTL expirations the same way as we
> handle
> > > > > > evictions,
> > > > > > > > through eviction policies, this problem would go away,
> because
> > > all
> > > > > the
> > > > > > > > threads that add entries to eviction queue are also
> responsible
> > > to
> > > > > > evict
> > > > > > > > the excess entries as well.
> > > > > > > >
> > > > > > > > I have created a parent ticket for this issue:
> > > > > > > > https://issues.apache.org/jira/browse/IGNITE-729
> > > > > > > >
> > > > > > > > Would be nice to implement them in the next couple of weeks.
> > > > > > > >
> > > > > > > > D.
> > > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Regards,
> > > > > > >
> > > > > > > Atri
> > > > > > > *l'apprenant*
> > > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Regards,
> > > > >
> > > > > Atri
> > > > > *l'apprenant*
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Regards,
> > >
> > > Atri
> > > *l'apprenant*
> > >
> >
>

Re: TTL-based expirations

Posted by Atri Sharma <at...@gmail.com>.

I agree with you on the thrashing point.rethinking through the idea I
agree.however, I am imagining a lot of threads blocked on data eviction
which might be not good for scalable performance (this is an unrelated
problem but might be relevant).

Can you please guide on code pointers which handle this part please?

I will add a comment.
On 14 Apr 2015 22:53, "Dmitriy Setrakyan" <ds...@apache.org> wrote:

> Thanks Atri,
>
> Interesting algorithm, reminds me of LIRS in some way. I still, however,
> don't see how it fixes thrashing. The clock-sweep thread still can get
> behind the loading threads, and we will still end up with the same problem.
>
> In my opinion the only way to handle it is to have the loading threads
> themselves do the clean up. This way they won't move forward until the data
> is evicted. The algorithm you suggested may be used in some ways by the
> cleanup implementation though. Can you please add a comment in IGNITE-729?
>
> D.
>
> On Mon, Apr 13, 2015 at 3:12 AM, Atri Sharma <at...@gmail.com> wrote:
>
> > Clocksweep is essentially a two chance algorithm. What happens is that a
> > counter is attached to each buffer page and initialized to a suitable
> value
> > (take 1 for eg). Now when an eviction needs to happen, a scan is done
> > through the cache looking for any guy with 0 count. The essential part is
> > that while the scan happens, the counter for *each* page is decremented
> by
> > one. The first page with counter 0 is evicted.
> >
> > Each time a page is hit, the counter is incremented. We can have a limit
> > for the counter count.
> >
> > The idea is that popular pages are kept in the cache and there is no need
> > to sort according to anything or manage expiration explicitly. All we
> need
> > to do is do a scan and decrement, and pull the one with 0 count.
> >
> > On Mon, Apr 13, 2015 at 3:33 PM, Dmitriy Setrakyan <
> > dsetrakyan@gridgain.com>
> > wrote:
> >
> > > On Mon, Apr 13, 2015 at 2:52 AM, Atri Sharma <at...@gmail.com>
> > wrote:
> > >
> > > > I would feel that this problem might also be mitigated a bit if we
> > used a
> > > > different algorithm than simple LRU (I assume that is what we use).
> > > > Something like Clocksweep...
> > > >
> > >
> > > I am not too familiar with Clocksweep. Is this something like this?
> > >
> > >
> >
> https://www.usenix.org/legacy/event/usenix05/tech/general/full_papers/jiang/jiang_html/html.html
> > >
> > >
> > >
> > > >
> > > > Thoughts?
> > > >
> > > > On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <
> > > > dsetrakyan@gridgain.com>
> > > > wrote:
> > > >
> > > > > Yes, it is possible for the system to thrash if the data is
> composed
> > > only
> > > > > of unique keys being added and the only parameter to control cache
> > size
> > > > is
> > > > > the time-based window. In this case cache can grow indefinitely
> > because
> > > > the
> > > > > TTL thread cannot cope with the load. If you, in addition to TTL
> also
> > > > > configure an eviction policy, e.g. FifoEvictionPolicy, then there
> is
> > > not
> > > > > thrashing.
> > > > >
> > > > > That's why I suggested that TTL expiration is also implemented
> > through
> > > > > eviction policies.
> > > > >
> > > > > D.
> > > > >
> > > > > On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <atri.jiit@gmail.com
> >
> > > > wrote:
> > > > >
> > > > > > Out of curiosity, did we never face thrashing with this approach?
> > > > > (assuming
> > > > > > wild workloads that hit the same page again and again)
> > > > > >
> > > > > > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > > > > > dsetrakyan@gridgain.com>
> > > > > > wrote:
> > > > > >
> > > > > > > Guys,
> > > > > > >
> > > > > > > I have been playing with TTL-based expirations for streaming
> and
> > am
> > > > > > > noticing that often the TTL thread cannot cope with the load a
> > > > > > > data-streamer can generate.
> > > > > > >
> > > > > > > Our current approach for TTL is implemented with a thread that
> > > keeps
> > > > > all
> > > > > > > cache entries in sorted-by-expire-time order and scans this
> > ordered
> > > > > list
> > > > > > > every time something expires. The scan is optimized, such that
> > once
> > > > an
> > > > > > > entry that does not need to expire yet is touched, the thread
> > goes
> > > > into
> > > > > > > wait mode until TTL time for that entry expires. However even
> > with
> > > > this
> > > > > > > optimization, this thread cannot expire the entries fast
> enough.
> > > > > > >
> > > > > > > I think if we handle TTL expirations the same way as we handle
> > > > > evictions,
> > > > > > > through eviction policies, this problem would go away, because
> > all
> > > > the
> > > > > > > threads that add entries to eviction queue are also responsible
> > to
> > > > > evict
> > > > > > > the excess entries as well.
> > > > > > >
> > > > > > > I have created a parent ticket for this issue:
> > > > > > > https://issues.apache.org/jira/browse/IGNITE-729
> > > > > > >
> > > > > > > Would be nice to implement them in the next couple of weeks.
> > > > > > >
> > > > > > > D.
> > > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Regards,
> > > > > >
> > > > > > Atri
> > > > > > *l'apprenant*
> > > > > >
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Regards,
> > > >
> > > > Atri
> > > > *l'apprenant*
> > > >
> > >
> >
> >
> >
> > --
> > Regards,
> >
> > Atri
> > *l'apprenant*
> >
>

Re: TTL-based expirations

Posted by Dmitriy Setrakyan <ds...@apache.org>.

Thanks Atri,

Interesting algorithm, reminds me of LIRS in some way. I still, however,
don't see how it fixes thrashing. The clock-sweep thread still can get
behind the loading threads, and we will still end up with the same problem.

In my opinion the only way to handle it is to have the loading threads
themselves do the clean up. This way they won't move forward until the data
is evicted. The algorithm you suggested may be used in some ways by the
cleanup implementation though. Can you please add a comment in IGNITE-729?

D.

On Mon, Apr 13, 2015 at 3:12 AM, Atri Sharma <at...@gmail.com> wrote:

> Clocksweep is essentially a two chance algorithm. What happens is that a
> counter is attached to each buffer page and initialized to a suitable value
> (take 1 for eg). Now when an eviction needs to happen, a scan is done
> through the cache looking for any guy with 0 count. The essential part is
> that while the scan happens, the counter for *each* page is decremented by
> one. The first page with counter 0 is evicted.
>
> Each time a page is hit, the counter is incremented. We can have a limit
> for the counter count.
>
> The idea is that popular pages are kept in the cache and there is no need
> to sort according to anything or manage expiration explicitly. All we need
> to do is do a scan and decrement, and pull the one with 0 count.
>
> On Mon, Apr 13, 2015 at 3:33 PM, Dmitriy Setrakyan <
> dsetrakyan@gridgain.com>
> wrote:
>
> > On Mon, Apr 13, 2015 at 2:52 AM, Atri Sharma <at...@gmail.com>
> wrote:
> >
> > > I would feel that this problem might also be mitigated a bit if we
> used a
> > > different algorithm than simple LRU (I assume that is what we use).
> > > Something like Clocksweep...
> > >
> >
> > I am not too familiar with Clocksweep. Is this something like this?
> >
> >
> https://www.usenix.org/legacy/event/usenix05/tech/general/full_papers/jiang/jiang_html/html.html
> >
> >
> >
> > >
> > > Thoughts?
> > >
> > > On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <
> > > dsetrakyan@gridgain.com>
> > > wrote:
> > >
> > > > Yes, it is possible for the system to thrash if the data is composed
> > only
> > > > of unique keys being added and the only parameter to control cache
> size
> > > is
> > > > the time-based window. In this case cache can grow indefinitely
> because
> > > the
> > > > TTL thread cannot cope with the load. If you, in addition to TTL also
> > > > configure an eviction policy, e.g. FifoEvictionPolicy, then there is
> > not
> > > > thrashing.
> > > >
> > > > That's why I suggested that TTL expiration is also implemented
> through
> > > > eviction policies.
> > > >
> > > > D.
> > > >
> > > > On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <at...@gmail.com>
> > > wrote:
> > > >
> > > > > Out of curiosity, did we never face thrashing with this approach?
> > > > (assuming
> > > > > wild workloads that hit the same page again and again)
> > > > >
> > > > > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > > > > dsetrakyan@gridgain.com>
> > > > > wrote:
> > > > >
> > > > > > Guys,
> > > > > >
> > > > > > I have been playing with TTL-based expirations for streaming and
> am
> > > > > > noticing that often the TTL thread cannot cope with the load a
> > > > > > data-streamer can generate.
> > > > > >
> > > > > > Our current approach for TTL is implemented with a thread that
> > keeps
> > > > all
> > > > > > cache entries in sorted-by-expire-time order and scans this
> ordered
> > > > list
> > > > > > every time something expires. The scan is optimized, such that
> once
> > > an
> > > > > > entry that does not need to expire yet is touched, the thread
> goes
> > > into
> > > > > > wait mode until TTL time for that entry expires. However even
> with
> > > this
> > > > > > optimization, this thread cannot expire the entries fast enough.
> > > > > >
> > > > > > I think if we handle TTL expirations the same way as we handle
> > > > evictions,
> > > > > > through eviction policies, this problem would go away, because
> all
> > > the
> > > > > > threads that add entries to eviction queue are also responsible
> to
> > > > evict
> > > > > > the excess entries as well.
> > > > > >
> > > > > > I have created a parent ticket for this issue:
> > > > > > https://issues.apache.org/jira/browse/IGNITE-729
> > > > > >
> > > > > > Would be nice to implement them in the next couple of weeks.
> > > > > >
> > > > > > D.
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > Regards,
> > > > >
> > > > > Atri
> > > > > *l'apprenant*
> > > > >
> > > >
> > >
> > >
> > >
> > > --
> > > Regards,
> > >
> > > Atri
> > > *l'apprenant*
> > >
> >
>
>
>
> --
> Regards,
>
> Atri
> *l'apprenant*
>

Re: TTL-based expirations

Posted by Atri Sharma <at...@gmail.com>.

Clocksweep is essentially a two chance algorithm. What happens is that a
counter is attached to each buffer page and initialized to a suitable value
(take 1 for eg). Now when an eviction needs to happen, a scan is done
through the cache looking for any guy with 0 count. The essential part is
that while the scan happens, the counter for *each* page is decremented by
one. The first page with counter 0 is evicted.

Each time a page is hit, the counter is incremented. We can have a limit
for the counter count.

The idea is that popular pages are kept in the cache and there is no need
to sort according to anything or manage expiration explicitly. All we need
to do is do a scan and decrement, and pull the one with 0 count.

On Mon, Apr 13, 2015 at 3:33 PM, Dmitriy Setrakyan <ds...@gridgain.com>
wrote:

> On Mon, Apr 13, 2015 at 2:52 AM, Atri Sharma <at...@gmail.com> wrote:
>
> > I would feel that this problem might also be mitigated a bit if we used a
> > different algorithm than simple LRU (I assume that is what we use).
> > Something like Clocksweep...
> >
>
> I am not too familiar with Clocksweep. Is this something like this?
>
> https://www.usenix.org/legacy/event/usenix05/tech/general/full_papers/jiang/jiang_html/html.html
>
>
>
> >
> > Thoughts?
> >
> > On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <
> > dsetrakyan@gridgain.com>
> > wrote:
> >
> > > Yes, it is possible for the system to thrash if the data is composed
> only
> > > of unique keys being added and the only parameter to control cache size
> > is
> > > the time-based window. In this case cache can grow indefinitely because
> > the
> > > TTL thread cannot cope with the load. If you, in addition to TTL also
> > > configure an eviction policy, e.g. FifoEvictionPolicy, then there is
> not
> > > thrashing.
> > >
> > > That's why I suggested that TTL expiration is also implemented through
> > > eviction policies.
> > >
> > > D.
> > >
> > > On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <at...@gmail.com>
> > wrote:
> > >
> > > > Out of curiosity, did we never face thrashing with this approach?
> > > (assuming
> > > > wild workloads that hit the same page again and again)
> > > >
> > > > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > > > dsetrakyan@gridgain.com>
> > > > wrote:
> > > >
> > > > > Guys,
> > > > >
> > > > > I have been playing with TTL-based expirations for streaming and am
> > > > > noticing that often the TTL thread cannot cope with the load a
> > > > > data-streamer can generate.
> > > > >
> > > > > Our current approach for TTL is implemented with a thread that
> keeps
> > > all
> > > > > cache entries in sorted-by-expire-time order and scans this ordered
> > > list
> > > > > every time something expires. The scan is optimized, such that once
> > an
> > > > > entry that does not need to expire yet is touched, the thread goes
> > into
> > > > > wait mode until TTL time for that entry expires. However even with
> > this
> > > > > optimization, this thread cannot expire the entries fast enough.
> > > > >
> > > > > I think if we handle TTL expirations the same way as we handle
> > > evictions,
> > > > > through eviction policies, this problem would go away, because all
> > the
> > > > > threads that add entries to eviction queue are also responsible to
> > > evict
> > > > > the excess entries as well.
> > > > >
> > > > > I have created a parent ticket for this issue:
> > > > > https://issues.apache.org/jira/browse/IGNITE-729
> > > > >
> > > > > Would be nice to implement them in the next couple of weeks.
> > > > >
> > > > > D.
> > > > >
> > > >
> > > >
> > > >
> > > > --
> > > > Regards,
> > > >
> > > > Atri
> > > > *l'apprenant*
> > > >
> > >
> >
> >
> >
> > --
> > Regards,
> >
> > Atri
> > *l'apprenant*
> >
>



-- 
Regards,

Atri
*l'apprenant*

Re: TTL-based expirations

Posted by Dmitriy Setrakyan <ds...@gridgain.com>.

On Mon, Apr 13, 2015 at 2:52 AM, Atri Sharma <at...@gmail.com> wrote:

> I would feel that this problem might also be mitigated a bit if we used a
> different algorithm than simple LRU (I assume that is what we use).
> Something like Clocksweep...
>

I am not too familiar with Clocksweep. Is this something like this?
https://www.usenix.org/legacy/event/usenix05/tech/general/full_papers/jiang/jiang_html/html.html



>
> Thoughts?
>
> On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <
> dsetrakyan@gridgain.com>
> wrote:
>
> > Yes, it is possible for the system to thrash if the data is composed only
> > of unique keys being added and the only parameter to control cache size
> is
> > the time-based window. In this case cache can grow indefinitely because
> the
> > TTL thread cannot cope with the load. If you, in addition to TTL also
> > configure an eviction policy, e.g. FifoEvictionPolicy, then there is not
> > thrashing.
> >
> > That's why I suggested that TTL expiration is also implemented through
> > eviction policies.
> >
> > D.
> >
> > On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <at...@gmail.com>
> wrote:
> >
> > > Out of curiosity, did we never face thrashing with this approach?
> > (assuming
> > > wild workloads that hit the same page again and again)
> > >
> > > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > > dsetrakyan@gridgain.com>
> > > wrote:
> > >
> > > > Guys,
> > > >
> > > > I have been playing with TTL-based expirations for streaming and am
> > > > noticing that often the TTL thread cannot cope with the load a
> > > > data-streamer can generate.
> > > >
> > > > Our current approach for TTL is implemented with a thread that keeps
> > all
> > > > cache entries in sorted-by-expire-time order and scans this ordered
> > list
> > > > every time something expires. The scan is optimized, such that once
> an
> > > > entry that does not need to expire yet is touched, the thread goes
> into
> > > > wait mode until TTL time for that entry expires. However even with
> this
> > > > optimization, this thread cannot expire the entries fast enough.
> > > >
> > > > I think if we handle TTL expirations the same way as we handle
> > evictions,
> > > > through eviction policies, this problem would go away, because all
> the
> > > > threads that add entries to eviction queue are also responsible to
> > evict
> > > > the excess entries as well.
> > > >
> > > > I have created a parent ticket for this issue:
> > > > https://issues.apache.org/jira/browse/IGNITE-729
> > > >
> > > > Would be nice to implement them in the next couple of weeks.
> > > >
> > > > D.
> > > >
> > >
> > >
> > >
> > > --
> > > Regards,
> > >
> > > Atri
> > > *l'apprenant*
> > >
> >
>
>
>
> --
> Regards,
>
> Atri
> *l'apprenant*
>

Re: TTL-based expirations

Posted by Atri Sharma <at...@gmail.com>.

I would feel that this problem might also be mitigated a bit if we used a
different algorithm than simple LRU (I assume that is what we use).
Something like Clocksweep...

Thoughts?

On Mon, Apr 13, 2015 at 3:16 PM, Dmitriy Setrakyan <ds...@gridgain.com>
wrote:

> Yes, it is possible for the system to thrash if the data is composed only
> of unique keys being added and the only parameter to control cache size is
> the time-based window. In this case cache can grow indefinitely because the
> TTL thread cannot cope with the load. If you, in addition to TTL also
> configure an eviction policy, e.g. FifoEvictionPolicy, then there is not
> thrashing.
>
> That's why I suggested that TTL expiration is also implemented through
> eviction policies.
>
> D.
>
> On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <at...@gmail.com> wrote:
>
> > Out of curiosity, did we never face thrashing with this approach?
> (assuming
> > wild workloads that hit the same page again and again)
> >
> > On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> > dsetrakyan@gridgain.com>
> > wrote:
> >
> > > Guys,
> > >
> > > I have been playing with TTL-based expirations for streaming and am
> > > noticing that often the TTL thread cannot cope with the load a
> > > data-streamer can generate.
> > >
> > > Our current approach for TTL is implemented with a thread that keeps
> all
> > > cache entries in sorted-by-expire-time order and scans this ordered
> list
> > > every time something expires. The scan is optimized, such that once an
> > > entry that does not need to expire yet is touched, the thread goes into
> > > wait mode until TTL time for that entry expires. However even with this
> > > optimization, this thread cannot expire the entries fast enough.
> > >
> > > I think if we handle TTL expirations the same way as we handle
> evictions,
> > > through eviction policies, this problem would go away, because all the
> > > threads that add entries to eviction queue are also responsible to
> evict
> > > the excess entries as well.
> > >
> > > I have created a parent ticket for this issue:
> > > https://issues.apache.org/jira/browse/IGNITE-729
> > >
> > > Would be nice to implement them in the next couple of weeks.
> > >
> > > D.
> > >
> >
> >
> >
> > --
> > Regards,
> >
> > Atri
> > *l'apprenant*
> >
>



-- 
Regards,

Atri
*l'apprenant*

Re: TTL-based expirations

Posted by Dmitriy Setrakyan <ds...@gridgain.com>.

Yes, it is possible for the system to thrash if the data is composed only
of unique keys being added and the only parameter to control cache size is
the time-based window. In this case cache can grow indefinitely because the
TTL thread cannot cope with the load. If you, in addition to TTL also
configure an eviction policy, e.g. FifoEvictionPolicy, then there is not
thrashing.

That's why I suggested that TTL expiration is also implemented through
eviction policies.

D.

On Sun, Apr 12, 2015 at 10:41 PM, Atri Sharma <at...@gmail.com> wrote:

> Out of curiosity, did we never face thrashing with this approach? (assuming
> wild workloads that hit the same page again and again)
>
> On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <
> dsetrakyan@gridgain.com>
> wrote:
>
> > Guys,
> >
> > I have been playing with TTL-based expirations for streaming and am
> > noticing that often the TTL thread cannot cope with the load a
> > data-streamer can generate.
> >
> > Our current approach for TTL is implemented with a thread that keeps all
> > cache entries in sorted-by-expire-time order and scans this ordered list
> > every time something expires. The scan is optimized, such that once an
> > entry that does not need to expire yet is touched, the thread goes into
> > wait mode until TTL time for that entry expires. However even with this
> > optimization, this thread cannot expire the entries fast enough.
> >
> > I think if we handle TTL expirations the same way as we handle evictions,
> > through eviction policies, this problem would go away, because all the
> > threads that add entries to eviction queue are also responsible to evict
> > the excess entries as well.
> >
> > I have created a parent ticket for this issue:
> > https://issues.apache.org/jira/browse/IGNITE-729
> >
> > Would be nice to implement them in the next couple of weeks.
> >
> > D.
> >
>
>
>
> --
> Regards,
>
> Atri
> *l'apprenant*
>

Re: TTL-based expirations

Posted by Atri Sharma <at...@gmail.com>.

Out of curiosity, did we never face thrashing with this approach? (assuming
wild workloads that hit the same page again and again)

On Mon, Apr 13, 2015 at 7:35 AM, Dmitriy Setrakyan <ds...@gridgain.com>
wrote:

> Guys,
>
> I have been playing with TTL-based expirations for streaming and am
> noticing that often the TTL thread cannot cope with the load a
> data-streamer can generate.
>
> Our current approach for TTL is implemented with a thread that keeps all
> cache entries in sorted-by-expire-time order and scans this ordered list
> every time something expires. The scan is optimized, such that once an
> entry that does not need to expire yet is touched, the thread goes into
> wait mode until TTL time for that entry expires. However even with this
> optimization, this thread cannot expire the entries fast enough.
>
> I think if we handle TTL expirations the same way as we handle evictions,
> through eviction policies, this problem would go away, because all the
> threads that add entries to eviction queue are also responsible to evict
> the excess entries as well.
>
> I have created a parent ticket for this issue:
> https://issues.apache.org/jira/browse/IGNITE-729
>
> Would be nice to implement them in the next couple of weeks.
>
> D.
>



-- 
Regards,

Atri
*l'apprenant*