You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@knox.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/03/27 00:22:00 UTC

[jira] [Work logged] (KNOX-2310) Add aggregate method to KnoxShellTable

     [ https://issues.apache.org/jira/browse/KNOX-2310?focusedWorklogId=410738&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-410738 ]

ASF GitHub Bot logged work on KNOX-2310:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 27/Mar/20 00:21
            Start Date: 27/Mar/20 00:21
    Worklog Time Spent: 10m 
      Work Description: lmccay commented on pull request #302: KNOX-2310
URL: https://github.com/apache/knox/pull/302
 
 
   (It is very **important** that you created an Apache Knox JIRA for this change and that the PR title/commit message includes the Apache Knox JIRA ID!)
   
   ## What changes were proposed in this pull request?
   
   A simple method for producing a meaningful mathematical report of a dataset would allow the user to quickly determine some basic metrics of a dataset.
   
   For instance, given a dataset with time series data across columns, an aggregation of:
   
       min
       max
       mean
       median
       mode
       sum
   
   for specific columns would provide a way to visualize the changes across time.
   
   knox:000> test.aggregate() columns "A,B,C" functions "min, max, mean, median, mode,sum"
   ===> +----------+----------------------+----------+----------+
   |          |          A           |    B     |    C     |
   +----------+----------------------+----------+----------+
   |   min    |        100.0         |  200.0   |  300.0   |
   |   max    |        200.0         |  400.0   |  500.0   |
   |   mean   |  166.66666666666666  |  300.0   |  400.0   |
   |  median  |        200.0         |  300.0   |  400.0   |
   |   mode   |        200.0         |  200.0   |  300.0   |
   |   sum    |        500.0         |  900.0   |  1200.0  |
   +----------+----------------------+----------+----------+
   
   ## How was this patch tested?
   
   Added new unit test and ran existing unit tests.
   Tested manually in KnoxShell.
   
   (Please explain how this patch was tested. For instance: running automated unit/integration tests, manual tests. Please write down your test steps as detailed as possible)
   (If this patch involves UI changes, please attach a screen-shot; otherwise, remove this)
   
   Please review [Knox Contributing Process](https://cwiki.apache.org/confluence/display/KNOX/Contribution+Process#ContributionProcess-GithubWorkflow) before opening a pull request.
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 410738)
    Remaining Estimate: 0h
            Time Spent: 10m

> Add aggregate method to KnoxShellTable
> --------------------------------------
>
>                 Key: KNOX-2310
>                 URL: https://issues.apache.org/jira/browse/KNOX-2310
>             Project: Apache Knox
>          Issue Type: Improvement
>          Components: KnoxShell
>            Reporter: Larry McCay
>            Assignee: Larry McCay
>            Priority: Major
>             Fix For: 1.4.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> A simple method for producing a meaningful mathematical report of a dataset would allow the user to quickly determine some basic metrics of a dataset.
> For instance, given a dataset with time series data across columns, an aggregation of:
> * min
> * max
> * mean
> * median
> * mode
> * sum
> for specific columns would provide a way to visualize the changes across time.
> {code}
> knox:000> hopkinsC19.filter() name "Country/Region" equalTo "US" aggregate() columns "3/15/20, 3/16/20, 3/17/20, 3/18/20, 3/19/20, 3/20/20, 3/21/20" functions "min, max, mean, median, mode, sum" title "Confirmed COVID-19 US Cases over the last Week"
> ===> Confirmed COVID-19 US Cases over the last Week
> +----------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+
> |          |       3/15/20        |       3/16/20        |       3/17/20        |       3/18/20        |       3/19/20        |       3/20/20        |       3/21/20        |
> +----------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+
> |   min    |         0.0          |         0.0          |         1.0          |         1.0          |         2.0          |         3.0          |         3.0          |
> |   max    |        732.0         |        967.0         |        1706.0        |        2495.0        |        5365.0        |        8310.0        |       11710.0        |
> |   mean   |  62.482142857142826  |  82.71428571428568   |  114.66071428571433  |  138.98214285714295  |  244.23214285714275  |  341.0714285714283   |  455.1607142857145   |
> |  median  |         20.0         |         24.0         |         31.0         |         40.5         |         56.0         |         79.0         |        120.0         |
> |   mode   |         1.0          |         1.0          |         3.0          |         18.0         |         44.0         |         14.0         |         15.0         |
> |   sum    |        3499.0        |        4632.0        |        6421.0        |        7783.0        |       13677.0        |       19100.0        |       25489.0        |
> +----------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+----------------------+
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)