You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@carbondata.apache.org by in...@apache.org on 2020/10/13 06:14:59 UTC

[carbondata] branch master updated: [CARBONDATA-3932] [CARBONDATA-3903] change discovery.uri in presto guide and dml document update

This is an automated email from the ASF dual-hosted git repository.

indhumuthumurugesh pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/carbondata.git


The following commit(s) were added to refs/heads/master by this push:
     new bdc8c91  [CARBONDATA-3932] [CARBONDATA-3903] change discovery.uri in presto guide and dml document update
bdc8c91 is described below

commit bdc8c91d77c0407f66cc30a90a931035f28bd8d9
Author: ShreelekhyaG <sh...@yahoo.com>
AuthorDate: Tue Oct 6 20:12:10 2020 +0530

    [CARBONDATA-3932] [CARBONDATA-3903] change discovery.uri in presto guide and dml document update
    
    Why is this PR needed?
    Few document changes in dml, presto-guide.
    
    What changes were proposed in this PR?
    change discovery.uri=<coordinator_ip>:8086 to discovery.uri=http://<coordinator_ip>:8086 in presto guide.
    dml document update with delta file information.
    
    Does this PR introduce any user interface change?
    No
    
    Is any new testcase added?
    No
    
    This closes #3969
---
 docs/dml-of-carbondata.md | 10 ++++++++++
 docs/language-manual.md   |  2 +-
 docs/prestodb-guide.md    |  4 ++--
 docs/prestosql-guide.md   |  4 ++--
 4 files changed, 15 insertions(+), 5 deletions(-)

diff --git a/docs/dml-of-carbondata.md b/docs/dml-of-carbondata.md
index 0a8163a..278ccf8 100644
--- a/docs/dml-of-carbondata.md
+++ b/docs/dml-of-carbondata.md
@@ -402,6 +402,11 @@ CarbonData DML statements are documented here,which includes:
 
 ## UPDATE AND DELETE
 
+  Since the data stored in a file system like HDFS is immutable, the update and delete in carbondata are done via maintaining two files namely:
+  
+  * Insert Delta: Stores newly added rows (CarbonData file format)
+  * Delete Delta: Store RowId of rows that are deleted (Bitmap file format)
+  
 ### UPDATE
 
   This command will allow to update the CarbonData table based on the column expression and optional filter conditions.
@@ -451,10 +456,15 @@ CarbonData DML statements are documented here,which includes:
   ```
   DELETE FROM table_name [WHERE expression]
   ```
+  Note: If the WHERE clause with expression is not provided in the above syntax, all of the records from the table will be deleted.
 
   Examples:
 
   ```
+  DELETE FROM carbontable
+  ```
+
+  ```
   DELETE FROM carbontable WHERE column1  = 'china'
   ```
 
diff --git a/docs/language-manual.md b/docs/language-manual.md
index f533e2c..ad82124 100644
--- a/docs/language-manual.md
+++ b/docs/language-manual.md
@@ -31,7 +31,7 @@ CarbonData has its own parser, in addition to Spark's SQL Parser, to parse and p
   - [Materialized Views](./mv-guide.md)
   - [Streaming](./streaming-guide.md)
 - Data Manipulation Statements
-  - [DML:](./dml-of-carbondata.md) [Load](./dml-of-carbondata.md#load-data), [Insert](./dml-of-carbondata.md#insert-data-into-carbondata-table), [Update](./dml-of-carbondata.md#update), [Delete](./dml-of-carbondata.md#delete)
+  - [DML:](./dml-of-carbondata.md) [Load](./dml-of-carbondata.md#load-data), [Insert](./dml-of-carbondata.md#insert-data-into-carbondata-table), [Update](./dml-of-carbondata.md#update), [Delete](./dml-of-carbondata.md#delete), [Compaction](./dml-of-carbondata.md#compaction)
   - [Segment Management](./segment-management-on-carbondata.md)
 - [CarbonData as Spark's Datasource](./carbon-as-spark-datasource-guide.md)
 - [Configuration Properties](./configuration-parameters.md)
diff --git a/docs/prestodb-guide.md b/docs/prestodb-guide.md
index 0e45108..ec2e5af 100644
--- a/docs/prestodb-guide.md
+++ b/docs/prestodb-guide.md
@@ -108,7 +108,7 @@ Currently carbondata supports only one version of presto, cannot handle multiple
   query.max-memory-per-node=3GB
   memory.heap-headroom-per-node=1GB
   discovery-server.enabled=true
-  discovery.uri=<coordinator_ip>:8086
+  discovery.uri=http://<coordinator_ip>:8086
   ```
 The options `node-scheduler.include-coordinator=false` and `coordinator=true` indicate that the node is the coordinator and tells the coordinator not to do any of the computation work itself and to use the workers.
 
@@ -128,7 +128,7 @@ Then, `query.max-memory=<30GB * number of nodes>`.
   http-server.http.port=8086
   query.max-memory=5GB
   query.max-memory-per-node=2GB
-  discovery.uri=<coordinator_ip>:8086
+  discovery.uri=http://<coordinator_ip>:8086
   ```
 
 **Note**: `jvm.config` and `node.properties` files are same for all the nodes (worker + coordinator). All the nodes should have different `node.id`.
diff --git a/docs/prestosql-guide.md b/docs/prestosql-guide.md
index ff05379..5fc83d8 100644
--- a/docs/prestosql-guide.md
+++ b/docs/prestosql-guide.md
@@ -110,7 +110,7 @@ Currently carbondata supports only one version of presto, cannot handle multiple
   query.max-memory-per-node=3GB
   memory.heap-headroom-per-node=1GB
   discovery-server.enabled=true
-  discovery.uri=<coordinator_ip>:8086
+  discovery.uri=http://<coordinator_ip>:8086
   ```
 The options `node-scheduler.include-coordinator=false` and `coordinator=true` indicate that the node is the coordinator and tells the coordinator not to do any of the computation work itself and to use the workers.
 
@@ -130,7 +130,7 @@ Then, `query.max-memory=<30GB * number of nodes>`.
   http-server.http.port=8086
   query.max-memory=5GB
   query.max-memory-per-node=2GB
-  discovery.uri=<coordinator_ip>:8086
+  discovery.uri=http://<coordinator_ip>:8086
   ```
 
 **Note**: `jvm.config` and `node.properties` files are same for all the nodes (worker + coordinator). All the nodes should have different `node.id`.