You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@carbondata.apache.org by ma...@apache.org on 2018/02/24 11:55:19 UTC

carbondata git commit: [CARBONDATA-2135] Documentation for Table comment and Column Comment

Repository: carbondata
Updated Branches:
  refs/heads/master e9430312d -> 4dbd0e5e7


[CARBONDATA-2135] Documentation for Table comment and Column Comment

Documentation for table comment and column comment

This closes #1936


Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo
Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/4dbd0e5e
Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/4dbd0e5e
Diff: http://git-wip-us.apache.org/repos/asf/carbondata/diff/4dbd0e5e

Branch: refs/heads/master
Commit: 4dbd0e5e74ff44067bb2c6808a09e947df91607d
Parents: e943031
Author: sgururajshetty <sg...@gmail.com>
Authored: Tue Feb 6 16:06:42 2018 +0530
Committer: manishgupta88 <to...@gmail.com>
Committed: Sat Feb 24 17:28:06 2018 +0530

----------------------------------------------------------------------
 docs/data-management-on-carbondata.md | 99 +++++++++++++++++++++---------
 1 file changed, 71 insertions(+), 28 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/carbondata/blob/4dbd0e5e/docs/data-management-on-carbondata.md
----------------------------------------------------------------------
diff --git a/docs/data-management-on-carbondata.md b/docs/data-management-on-carbondata.md
index 78ab010..9678a32 100644
--- a/docs/data-management-on-carbondata.md
+++ b/docs/data-management-on-carbondata.md
@@ -286,7 +286,40 @@ This tutorial is going to introduce all commands and data operations on CarbonDa
   * If the table is aggregate table, then all the aggregate tables should be copied to the new database location.
   * For old store, the time zone of the source and destination cluster should be same.
   * If old cluster used HIVE meta store to store schema, refresh will not work as schema file does not exist in file system.
+
+### Table and Column Comment
+
+  You can provide more information on table by using table comment. Similarly you can provide more information about a particular column using column comment. 
+  You can see the column comment of an existing table using describe formatted command.
   
+  ```
+  CREATE TABLE [IF NOT EXISTS] [db_name.]table_name[(col_name data_type [COMMENT col_comment], ...)]
+    [COMMENT table_comment]
+  STORED BY 'carbondata'
+  [TBLPROPERTIES (property_name=property_value, ...)]
+  ```
+  
+  Example:
+  ```
+  CREATE TABLE IF NOT EXISTS productSchema.productSalesTable (
+                                productNumber Int COMMENT 'unique serial number for product')
+  COMMENT “This is table comment”
+   STORED BY 'carbondata'
+   TBLPROPERTIES ('DICTIONARY_INCLUDE'='productNumber')
+  ```
+  You can also SET and UNSET table comment using ALTER command.
+
+  Example to SET table comment:
+  
+  ```
+  ALTER TABLE carbon SET TBLPROPERTIES ('comment'='this table comment is modified');
+  ```
+  
+  Example to UNSET table comment:
+  
+  ```
+  ALTER TABLE carbon UNSET TBLPROPERTIES ('comment');
+  ```
 
 ## LOAD DATA
 
@@ -954,9 +987,9 @@ roll-up for the queries on these hierarchies.
   ON TABLE sales
   USING "timeseries"
   DMPROPERTIES (
-  'event_time’=’order_time’,
-  'year_granularity’=’1’)
-  AS
+  'event_time'='order_time',
+  'year_granualrity'='1',
+  ) AS
   SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
    avg(price) FROM sales GROUP BY order_time, country, sex
     
@@ -964,9 +997,9 @@ roll-up for the queries on these hierarchies.
   ON TABLE sales
   USING "timeseries"
   DMPROPERTIES (
-  'event_time’=’order_time’,
-  'month_granularity’=’1’)
-  AS
+  'event_time'='order_time',
+  'month_granualrity'='1',
+  ) AS
   SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
    avg(price) FROM sales GROUP BY order_time, country, sex
     
@@ -974,9 +1007,9 @@ roll-up for the queries on these hierarchies.
   ON TABLE sales
   USING "timeseries"
   DMPROPERTIES (
-  'event_time’=’order_time’,
-  'day_granularity’=’1’)
-  AS
+  'event_time'='order_time',
+  'day_granualrity'='1',
+  ) AS
   SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
    avg(price) FROM sales GROUP BY order_time, country, sex
         
@@ -984,9 +1017,9 @@ roll-up for the queries on these hierarchies.
   ON TABLE sales
   USING "timeseries"
   DMPROPERTIES (
-  'event_time’=’order_time’,
-  'hour_granularity’=’1’)
-  AS
+  'event_time'='order_time',
+  'hour_granualrity'='1',
+  ) AS
   SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
    avg(price) FROM sales GROUP BY order_time, country, sex
   
@@ -994,9 +1027,19 @@ roll-up for the queries on these hierarchies.
   ON TABLE sales
   USING "timeseries"
   DMPROPERTIES (
-  'event_time’=’order_time’,
-  'minute_granularity’=’1’)
-  AS
+  'event_time'='order_time',
+  'minute_granualrity'='1',
+  ) AS
+  SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
+   avg(price) FROM sales GROUP BY order_time, country, sex
+    
+  CREATE DATAMAP agg_minute
+  ON TABLE sales
+  USING "timeseries"
+  DMPROPERTIES (
+  'event_time'='order_time',
+  'minute_granualrity'='1',
+  ) AS
   SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
    avg(price) FROM sales GROUP BY order_time, country, sex
   ```
@@ -1004,11 +1047,11 @@ roll-up for the queries on these hierarchies.
   For Querying data and automatically roll-up to the desired aggregation level,Carbondata supports 
   UDF as
   ```
-  timeseries(timeseries column name, ‘aggregation level’)
+  timeseries(timeseries column name, 'aggregation level')
   ```
   ```
-  Select timeseries(order_time, ‘hour’), sum(quantity) from sales group by timeseries(order_time,
-  ’hour’)
+  Select timeseries(order_time, 'hour'), sum(quantity) from sales group by timeseries(order_time,
+  'hour')
   ```
   
   It is **not necessary** to create pre-aggregate tables for each granularity unless required for 
@@ -1021,9 +1064,9 @@ roll-up for the queries on these hierarchies.
     ON TABLE sales
     USING "timeseries"
     DMPROPERTIES (
-    'event_time’=’order_time’,
-    'day_granularity’=’1’)
-    AS
+    'event_time'='order_time',
+    'day_granualrity'='1',
+    ) AS
     SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
      avg(price) FROM sales GROUP BY order_time, country, sex
           
@@ -1031,20 +1074,20 @@ roll-up for the queries on these hierarchies.
     ON TABLE sales
     USING "timeseries"
     DMPROPERTIES (
-    'event_time’=’order_time’,
-    'hour_granularity’=’1’)
-    AS
+    'event_time'='order_time',
+    'hour_granualrity'='1',
+    ) AS
     SELECT order_time, country, sex, sum(quantity), max(quantity), count(user_id), sum(price),
      avg(price) FROM sales GROUP BY order_time, country, sex
   ```
   
   Queries like below will be rolled-up and fetched from pre-aggregate tables
   ```
-  Select timeseries(order_time, ‘month’), sum(quantity) from sales group by timeseries(order_time,
-    ’month’)
+  Select timeseries(order_time, 'month'), sum(quantity) from sales group by timeseries(order_time,
+    'month')
     
-  Select timeseries(order_time, ‘year’), sum(quantity) from sales group by timeseries(order_time,
-    ’year’)
+  Select timeseries(order_time, 'year'), sum(quantity) from sales group by timeseries(order_time,
+    'year')
   ```
   
   NOTE (<b>RESTRICTION</b>):