You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@stratos.apache.org by "Pranavan (JIRA)" <ji...@apache.org> on 2015/07/17 17:39:04 UTC

[jira] [Commented] (STRATOS-1211) Introducing "curve fitting" for stat prediction algorithm of Autoscaler

    [ https://issues.apache.org/jira/browse/STRATOS-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14631481#comment-14631481 ] 

Pranavan commented on STRATOS-1211:
-----------------------------------

The branch that I am working - https://github.com/pranavan135/stratos/tree/CurveFittingGSoC
Tentative PR - https://github.com/apache/stratos/pull/382

> Introducing "curve fitting" for stat prediction algorithm of Autoscaler
> -----------------------------------------------------------------------
>
>                 Key: STRATOS-1211
>                 URL: https://issues.apache.org/jira/browse/STRATOS-1211
>             Project: Stratos
>          Issue Type: Bug
>          Components: Autoscaler, CEP
>            Reporter: Lahiru Sandaruwan
>              Labels: gsoc2015
>             Fix For: FUTURE
>
>
> This is a summery of a mail sent to Stratos dev under "[Autoscaling] [Improvement] Introducing "curve fitting" for stat prediction algorithm of Autoscaler" subject.
> Current implementation
> Currently CEP calculates average, gradient, and second derivative and send those values to Autoscaler. Then Autoscaler predicts the values using S = u*t + 0.5*a*t*t.
> In this method CEP calculation is not very much accurate as it does not consider all the events when calculating the gradient and second derivative. Therefore the equation we apply doesn't yield the best prediction.
> Proposed Implementation
> CEP's task
> I think best approach is to do "curve fitting"[1] for received event sample in a particular time window. Refer "Locally weighted linear regression" section at [2] for more details.
> We would need a second degree polynomial fitter for this, where we can use Apache commons math library for this. Refer the sample at [3], we can run this with any degree. e.g. 2, 3. Just increase the degree to increase the accuracy.
> E.g.
> So if get degree 2 polynomial fitter, we will have an equation like below where value(v) is our statistic value and time(t) is the time of event.
> Equation we get from received events,
> v = a*t*t + b*t + c
> So the solution is,
> Find memberwise curves that fits events received in specific window(say 10 minutes) at CEP
> Send the parameters of fitted line(a, b, and c in above equation) with the timestamp of last event(T) in the window, to Autoscaler
> Autoscaler's task
> Autoscaler use v = a*t*t + b*t + c function to predict the value in any timestamp from the last timestamp
> E.g. Say we need to find the value(v) after 1 minute(assuming we carried all the calculations in milliseconds),
> v = a * (T+60000) * (T+60000) + b * (T+60000) + c
> So we have memberwise predictions and we can find clusterwise prediction by averaging all the memberwise values.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)