You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@carbondata.apache.org by ja...@apache.org on 2018/06/18 16:34:47 UTC

carbondata git commit: [CARBONDATA-1787] Updated data-management-on-carbondata.md for GLOBAL_SORT_PARTITIONS

Repository: carbondata
Updated Branches:
  refs/heads/master dc4f87ba5 -> ca466d9f4


[CARBONDATA-1787] Updated data-management-on-carbondata.md for GLOBAL_SORT_PARTITIONS

This closes #1668


Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo
Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/ca466d9f
Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/ca466d9f
Diff: http://git-wip-us.apache.org/repos/asf/carbondata/diff/ca466d9f

Branch: refs/heads/master
Commit: ca466d9f4b07db1a088dc62cebaf6b4733c28a9b
Parents: dc4f87b
Author: vandana <va...@gmail.com>
Authored: Fri Dec 15 18:16:05 2017 +0530
Committer: Jacky Li <ja...@qq.com>
Committed: Tue Jun 19 00:34:39 2018 +0800

----------------------------------------------------------------------
 docs/data-management-on-carbondata.md | 10 ++++++++++
 1 file changed, 10 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/carbondata/blob/ca466d9f/docs/data-management-on-carbondata.md
----------------------------------------------------------------------
diff --git a/docs/data-management-on-carbondata.md b/docs/data-management-on-carbondata.md
index 3326e9b..7e171aa 100644
--- a/docs/data-management-on-carbondata.md
+++ b/docs/data-management-on-carbondata.md
@@ -560,6 +560,16 @@ This tutorial is going to introduce all commands and data operations on CarbonDa
   'BAD_RECORDS_ACTION'='REDIRECT','IS_EMPTY_DATA_BAD_RECORD'='false')
   ```
 
+  - **GLOBAL_SORT_PARTITIONS:** If the SORT_SCOPE is defined as GLOBAL_SORT, then user can specify the number of partitions to use while shuffling data for sort using GLOBAL_SORT_PARTITIONS. If it is not configured, or configured less than 1, then it uses the number of map task as reduce task. It is recommended that each reduce task deal with 512MB-1GB data.
+
+  ```
+  OPTIONS('GLOBAL_SORT_PARTITIONS'='2')
+  ```
+
+   NOTE:
+   * GLOBAL_SORT_PARTITIONS should be Integer type, the range is [1,Integer.MaxValue].
+   * It is only used when the SORT_SCOPE is GLOBAL_SORT.
+
 ### INSERT DATA INTO CARBONDATA TABLE
 
   This command inserts data into a CarbonData table, it is defined as a combination of two queries Insert and Select query respectively.