You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@accumulo.apache.org by "Eric Newton (Created) (JIRA)" <ji...@apache.org> on 2011/11/07 14:50:51 UTC

[jira] [Created] (ACCUMULO-121) document detailed bulk ingest best practices

document detailed bulk ingest best practices
--------------------------------------------

                 Key: ACCUMULO-121
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-121
             Project: Accumulo
          Issue Type: Improvement
          Components: docs
    Affects Versions: 1.5.0
            Reporter: Eric Newton
            Assignee: Adam Fuchs


The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  

 1. Recommended file sizes
 1. Side effects of bypassing constraints
 1. Proper use of range partitioners
 1. Preemptive splitting of tables
 1. How major compaction/garbage collection affects bulk files
 1. Setting the timestamp
 1. Invalid visibility fields makes accumulo fail in strange ways
 1. Bulk import doesn't increase entry counts on the monitor page until the files are compacted

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (ACCUMULO-121) document detailed bulk ingest best practices

Posted by "Keith Turner (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/ACCUMULO-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Keith Turner updated ACCUMULO-121:
----------------------------------

    Affects Version/s:     (was: 1.5.0)
        Fix Version/s: 1.5.0
    
> document detailed bulk ingest best practices
> --------------------------------------------
>
>                 Key: ACCUMULO-121
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-121
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: docs
>            Reporter: Eric Newton
>            Assignee: Adam Fuchs
>             Fix For: 1.5.0
>
>
> The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  
>  # Recommended file sizes
>  # Side effects of bypassing constraints
>  # Proper use of range partitioners
>  # Preemptive splitting of tables
>  # How major compaction/garbage collection affects bulk files
>  # Setting the timestamp
>  # Invalid visibility fields makes accumulo fail in strange ways
>  # Bulk import doesn't increase entry counts on the monitor page until the files are compacted

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (ACCUMULO-121) document detailed bulk ingest best practices

Posted by "Eric Newton (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/ACCUMULO-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eric Newton updated ACCUMULO-121:
---------------------------------

    Description: 
The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  

 # Recommended file sizes
 # Side effects of bypassing constraints
 # Proper use of range partitioners
 # Preemptive splitting of tables
 # How major compaction/garbage collection affects bulk files
 # Setting the timestamp
 # Invalid visibility fields makes accumulo fail in strange ways
 # Bulk import doesn't increase entry counts on the monitor page until the files are compacted

  was:
The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  

 1. Recommended file sizes
 1. Side effects of bypassing constraints
 1. Proper use of range partitioners
 1. Preemptive splitting of tables
 1. How major compaction/garbage collection affects bulk files
 1. Setting the timestamp
 1. Invalid visibility fields makes accumulo fail in strange ways
 1. Bulk import doesn't increase entry counts on the monitor page until the files are compacted

    
> document detailed bulk ingest best practices
> --------------------------------------------
>
>                 Key: ACCUMULO-121
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-121
>             Project: Accumulo
>          Issue Type: Improvement
>          Components: docs
>    Affects Versions: 1.5.0
>            Reporter: Eric Newton
>            Assignee: Adam Fuchs
>
> The users' manual should advise best practices for generating and importing bulk files, as well as the effects of bulk imported files:  
>  # Recommended file sizes
>  # Side effects of bypassing constraints
>  # Proper use of range partitioners
>  # Preemptive splitting of tables
>  # How major compaction/garbage collection affects bulk files
>  # Setting the timestamp
>  # Invalid visibility fields makes accumulo fail in strange ways
>  # Bulk import doesn't increase entry counts on the monitor page until the files are compacted

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira