You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@carbondata.apache.org by ma...@apache.org on 2018/02/24 11:55:19 UTC
carbondata git commit: [CARBONDATA-2135] Documentation for Table
comment and Column Comment
Repository: carbondata
Updated Branches:
refs/heads/master e9430312d -> 4dbd0e5e7
[CARBONDATA-2135] Documentation for Table comment and Column Comment
Documentation for table comment and column comment
This closes #1936
Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo
Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/4dbd0e5e
Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/4dbd0e5e
Diff: http://git-wip-us.apache.org/repos/asf/carbondata/diff/4dbd0e5e
Branch: refs/heads/master
Commit: 4dbd0e5e74ff44067bb2c6808a09e947df91607d
Parents: e943031
Author: sgururajshetty <sg...@gmail.com>
Authored: Tue Feb 6 16:06:42 2018 +0530
Committer: manishgupta88 <to...@gmail.com>
Committed: Sat Feb 24 17:28:06 2018 +0530
----------------------------------------------------------------------
docs/data-management-on-carbondata.md | 99 +++++++++++++++++++++---------
1 file changed, 71 insertions(+), 28 deletions(-)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/carbondata/blob/4dbd0e5e/docs/data-management-on-carbondata.md
----------------------------------------------------------------------
diff --git a/docs/data-management-on-carbondata.md b/docs/data-management-on-carbondata.md
index 78ab010..9678a32 100644
--- a/docs/data-management-on-carbondata.md
+++ b/docs/data-management-on-carbondata.md
@@ -286,7 +286,40 @@ This tutorial is going to introduce all commands and data operations on CarbonDa
* If the table is aggregate table, then all the aggregate tables should be copied to the new database location.
* For old store, the time zone of the source and destination cluster should be same.
* If old cluster used HIVE meta store to store schema, refresh will not work as schema file does not exist in file system.
+
+### Table and Column Comment
+
+ You can provide more information on table by using table comment. Similarly you can provide more information about a particular column using column comment.
+ You can see the column comment of an existing table using describe formatted command.
+ ```
+ CREATE TABLE [IF NOT EXISTS] [db_name.]table_name[(col_name data_type [COMMENT col_comment], ...)]
+ [COMMENT table_comment]
+ STORED BY 'carbondata'
+ [TBLPROPERTIES (property_name=property_value, ...)]
+ ```
+
+ Example:
+ ```
+ CREATE TABLE IF NOT EXISTS productSchema.productSalesTable (
+ productNumber Int COMMENT 'unique serial number for product')
+ COMMENT “This is table comment”
+ STORED BY 'carbondata'
+ TBLPROPERTIES ('DICTIONARY_INCLUDE'='productNumber')
+ ```
+ You can also SET and UNSET table comment using ALTER command.
+
+ Example to SET table comment:
+
+ ```
+ ALTER TABLE carbon SET TBLPROPERTIES ('comment'='this table comment is modified');
+ ```
+
+ Example to UNSET table comment:
+
+ ```
+ ALTER TABLE carbon UNSET TBLPROPERTIES ('comment');
+ ```
## LOAD DATA
@@ -954,9 +987,9 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'year_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'year_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
@@ -964,9 +997,9 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'month_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'month_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
@@ -974,9 +1007,9 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'day_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'day_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
@@ -984,9 +1017,9 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'hour_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'hour_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
@@ -994,9 +1027,19 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'minute_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'minute_granualrity'='1',
+ ) AS
+ SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
+ avg(price) FROM sales GROUP BY order_time, country, sex
+
+ CREATE DATAMAP agg_minute
+ ON TABLE sales
+ USING "timeseries"
+ DMPROPERTIES (
+ 'event_time'='order_time',
+ 'minute_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
```
@@ -1004,11 +1047,11 @@ roll-up for the queries on these hierarchies.
For Querying data and automatically roll-up to the desired aggregation level,Carbondata supports
UDF as
```
- timeseries(timeseries column name, ‘aggregation level’)
+ timeseries(timeseries column name, 'aggregation level')
```
```
- Select timeseries(order_time, ‘hour’), sum(quantity) from sales group by timeseries(order_time,
- ’hour’)
+ Select timeseries(order_time, 'hour'), sum(quantity) from sales group by timeseries(order_time,
+ 'hour')
```
It is **not necessary** to create pre-aggregate tables for each granularity unless required for
@@ -1021,9 +1064,9 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'day_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'day_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
@@ -1031,20 +1074,20 @@ roll-up for the queries on these hierarchies.
ON TABLE sales
USING "timeseries"
DMPROPERTIES (
- 'event_time’=’order_time’,
- 'hour_granularity’=’1’)
- AS
+ 'event_time'='order_time',
+ 'hour_granualrity'='1',
+ ) AS
SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
avg(price) FROM sales GROUP BY order_time, country, sex
```
Queries like below will be rolled-up and fetched from pre-aggregate tables
```
- Select timeseries(order_time, ‘month’), sum(quantity) from sales group by timeseries(order_time,
- ’month’)
+ Select timeseries(order_time, 'month'), sum(quantity) from sales group by timeseries(order_time,
+ 'month')
- Select timeseries(order_time, ‘year’), sum(quantity) from sales group by timeseries(order_time,
- ’year’)
+ Select timeseries(order_time, 'year'), sum(quantity) from sales group by timeseries(order_time,
+ 'year')
```
NOTE (<b>RESTRICTION</b>):