You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by jackylk <gi...@git.apache.org> on 2018/02/08 13:45:27 UTC

[GitHub] carbondata pull request #1808: [CARBONDATA-2023][DataLoad] Add size base blo...

Github user jackylk commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1808#discussion_r166936712
  
    --- Diff: core/src/main/java/org/apache/carbondata/core/constants/CarbonLoadOptionConstants.java ---
    @@ -114,4 +114,14 @@
        */
       public static final int MAX_EXTERNAL_DICTIONARY_SIZE = 10000000;
     
    +  /**
    +   * enable block size based block allocation while loading data. By default, carbondata assigns
    +   * blocks to node based on block number. If this option is set to `true`, carbondata will
    +   * consider block size first and make sure that all the nodes will process almost equal size of
    +   * data. This option is especially useful when you encounter skewed data.
    +   */
    +  @CarbonProperty
    +  public static final String ENABLE_CARBON_LOAD_SKEWED_DATA_OPTIMIZATION
    +      = "carbon.load.skewed.data.optimization";
    --- End diff --
    
    change to `carbon.load.skewedDataOptimization.enabled`


---